In this section we assume a strictly de Sitter background throughout with
$$\begin{aligned} {a=e^{Ht}\,.} \end{aligned}$$
(24)
Up until now we have written the expectation value of the energy-momentum tensor symbolically as \(\langle \hat{T}_{\mu \nu }\rangle \), without specifying how it is to be derived. As mentioned in the introduction, our method for calculating the expectation value will deviate from what is generally used for the semi-classical prescription [50]. Before this important topic we need to however cover some basic features.
First, we will adopt the cosmologically motivated choice where de Sitter space is described in terms of the expanding FLRW coordinates used for example when calculating the cosmological perturbations from inflation. This is a consistent approach, since for an observer at rest with the expanding FLRW coordinates the contracting patch is not accessible.
Next we need to define the specific state to be used as the initial condition. In the black hole context making the physical choice for the vacuum is an essential ingredient for the understanding of the evaporation process, see [69] for important pioneering work and [47] for a clear discussion. Again conforming to the usual choice made in inflationary cosmology, we will use the Bunch–Davies vacuum [80, 81] as the quantum sate. A compelling motivation for this choice comes from the fact that the Bunch–Davies vacuum is an attractor state in de Sitter space [33], provided we make the natural assumption that the leading divergences of the theory coincide with those in flat space [82, 83].
A very important feature of the Bunch–Davies vacuum is that it covers the entire FLRW de Sitter patch and hence extends also to regions that would be hidden behind the horizon. This is a natural requirement for an initial condition in a case where the Universe was not always dominated by vacuum energy, which from the cosmological point of view is well-motivated: for example, the Universe may start out as radiation or matter dominated and only at late times asymptotically approach the exponentially expanding de Sitter space as in the \(\Lambda \)CDM model. This will turn out to be crucial for our calculation.
For deriving the Bunch–Davies vacuum in the FLRW coordinates it proves convenient to make use of the conformal time coordinate
$$\begin{aligned} {\,d\eta \!= \!\frac{{dt}}{{a}}\quad \Rightarrow \quad \eta \!=\! \frac{{-1}}{{aH}}\quad \Rightarrow \quad ds^2=a^2\big [-d\eta ^2+d\mathbf {x}^2\big ]\,,} \end{aligned}$$
(25)
with which the equation of motion (2) becomes
$$\begin{aligned} {\bigg [\partial _\eta ^2+(n-2)\frac{{a'}}{{a}}\partial _\eta -a^2\partial _i\partial ^i+\frac{{n-2}}{{4(n-1)}}a^2R\bigg ]\hat{\phi }=0\,,} \end{aligned}$$
(26)
where \(a'\equiv \partial a / \partial \eta \) and \(a^2R=2(n-1)a''/a+(n-1)(n-4)(a'/a)^2\). The solutions can be written as the mode expansion
$$\begin{aligned} {\hat{\phi }=\int d^{n-1}\mathbf {k}\left[ \hat{a}_\mathbf {k} u_{\mathbf {k}}+\hat{a}_{\mathbf {k}}^\dagger u^{*}_{{\mathbf {k}}}\right] \,\, ,} \end{aligned}$$
(27)
with \(k\equiv \vert \mathbf {k}\vert \), with the commutation relations \([\hat{a}_{\mathbf {k}},\hat{a}_{\mathbf {k}'}^\dagger ]=\delta ^{(n-1)}(\mathbf {k}-\mathbf {k}'),~~[\hat{a}_{\mathbf {k}},\hat{a}_{\mathbf {k}'}]=[\hat{a}_{\mathbf {k}}^{{\dagger }},\hat{a}_{\mathbf {k}'}^\dagger ]=0\) and the modes
$$\begin{aligned} {u_{\mathbf {k}}=\frac{{1}}{{\sqrt{(2\pi )^{n-1}a^{n-2}}}}\frac{{1}}{{\sqrt{2k}}}e^{-i (k\eta -\mathbf {k\cdot \mathbf {x}})}\, ,} \end{aligned}$$
(28)
which define the Bunch–Davies vacuum state \(\vert 0 \rangle \) via
$$\begin{aligned} {\hat{a}_{\mathbf {k}}\vert 0 \rangle =0\, .} \end{aligned}$$
(29)
The Klein-Gordon inner product between two solutions to the equation of motion \(\phi _1\) and \(\phi _2\) can be defined in terms of a spacelike hypersurface \(\Sigma \), a future oriented unit normal vector \(n^\mu \) and the induced spatial metric \(\gamma _{ij}\) as
$$\begin{aligned} {\big (\phi _1,\phi _2\big )\!=\!- i\int _\Sigma d^{n-1}\mathbf {x}\sqrt{\gamma }n^\mu \,\phi _1 \overset{\leftrightarrow }{\nabla }_\mu \phi ^*_2\,;\quad \overset{\leftrightarrow }{\nabla }_\mu \!=\! \overset{\rightarrow }{\nabla }_\mu \!-\!\overset{\leftarrow }{\nabla }_\mu \,.} \end{aligned}$$
(30)
It is easy to show in the conformal coordinates (25) that using the vector \(\partial _\eta \) for normalization gives \(n^\eta =a^{-1}, n^i=0\) and \(\sqrt{\gamma } = a^{n-1}\) with which the inner product takes the form
$$\begin{aligned} {\big (\phi _1,\phi _2\big )=-i\int d^{n-1}\mathbf {x}\,a^{n-2}\,\phi _1 \overset{\leftrightarrow }{\nabla }_\eta \phi ^*_2\,,} \end{aligned}$$
(31)
and that the expansion (27) in terms of the Bunch–Davies modes (28) is properly normalized,
$$\begin{aligned} {\big (u_{\mathbf {k}},u_{\mathbf {k}'}\big )=\delta ^{(n-1)}(\mathbf {k}-\mathbf {k}')\,.} \end{aligned}$$
(32)
For completeness we also show the derivation for the Bunch–Davies modes in the spherical coordinates in four dimensions. Using an ansatz
$$\begin{aligned} {\psi _{\ell m k}=f_{\ell k}(r)Y^m_\ell \frac{{e^{-ik\eta }}}{{a}}\,,} \end{aligned}$$
(33)
where the \(Y_\ell ^m\) are the spherical harmonics normalized according to
$$\begin{aligned} \int d\Omega \,Y_\ell ^m \, Y_{\ell '}^{m'}{}^*&=\int _{\theta =0}^\pi \int _{\varphi =0}^{2\pi }d\theta d\varphi \sin \theta \,Y_\ell ^m \, Y_{\ell '}^{m'}{}^* \nonumber \\&=\delta _{\ell \ell '}\, \delta _{mm'}\,, \end{aligned}$$
(34)
the equation of motion (26) reduces to a purely radial equation
$$\begin{aligned} {\bigg \{r^{-2}\frac{{\partial }}{{\partial r}}\Big (r^2\frac{{\partial }}{{\partial {{r}}}}\Big )+{k^2} -\frac{{\ell (\ell +1)}}{{r^2}}\bigg \}f_{\ell k}({r})=0\,,} \end{aligned}$$
(35)
which has solutions expressible as linear combinations of the spherical Bessel functions \(j_\nu (x)\). Writing the inner product in spherical coordinates
$$\begin{aligned} {\big (\phi _1,\phi _2\big )=-i\int d\Omega \int _0^\infty dr\,r^2\,a^{2}\,\phi _1 \overset{\leftrightarrow }{\nabla }_\eta \phi ^*_2\,,} \end{aligned}$$
(36)
and making use of the orthogonality properties of the Bessel functions allows one to derive the properly normalized positive frequency modes
$$\begin{aligned} {\psi _{\ell m k}=\sqrt{\frac{{k}}{{\pi }}}j_\ell (kr)Y^m_\ell \frac{{e^{-ik\eta }}}{{a}}\,.} \end{aligned}$$
(37)
The spherical modes (37) provide another representation for the scalar field and the Bunch–Davies vacuum via
$$\begin{aligned} {\hat{\phi }=\sum _{\ell =0}^\infty \sum _{m=-\ell }^\ell \int _0^\infty dk\Big [\psi _{\ell m k}\hat{a}_{\ell m k}+\psi ^{*}_{\ell m k}\hat{a}^\dagger _{\ell m k}\Big ]} \end{aligned}$$
(38)
and
$$\begin{aligned} {\hat{a}_{\ell m k}|0\rangle =0\,.} \end{aligned}$$
(39)
The equivalence of the states as defined by (29) and (39) can be demonstrated for example by showing that the Wightman function as defined by the two states coincides, which can be easily done with the help of the Rayleigh or plane wave expansion.
In order to obtain well-defined quantum expectation values we must define a regularization and a renormalization prescription for the ultraviolet divergences. Perhaps the most elegant way would be to analytically continue the dimensions to n and redefine the constants of the original action to obtain physical results. Dimensional regularization does have a drawback however, which is that consistency requires one to calculate everything in n dimensions, which is surely more difficult than to perform the calculation in 2 dimensions, for example. For our purposes the most convenient choice is the adiabatic subtraction technique [84,85,86], which in the non-interacting case is a consistent and a covariant approach [87], but where no explicit regularization is needed as the counter terms can formally be combined in the same integral as the expectation value. This also allows us to use a strictly 2- or a 4-dimensional theory.
Tracing over the unobservable states
So far our approach has followed standard lines. If in some given state \(\vert \Psi \rangle \) we were to calculate the relevant expectation values \(\langle \hat{T}_{\mu \nu }\rangle \equiv \langle \Psi \vert \hat{T}_{\mu \nu }\vert \Psi \rangle \) and use them as the sources in the Friedmann equations we would obtain a result that exponentially fast approaches a configuration with \(\dot{H}=0\) and conclude that de Sitter space is a stable solution also when back reaction is taken into account. This is a manifestation of the de Sitter invariance and the attractor nature of the Bunch–Davies vacuum and true as well for the non-conformal case [33]. However, as discussed in Sect. 4 the horizon in de Sitter space splits the FLRW manifold into two patches only one of which is visible to a local observer. This is very much analogous to how a black hole horizon blocks the observational access of an observer outside the horizon [42, 72]. Here is where our approach will differ from what is traditionally done in semi-classical gravity: following [36] when calculating \(\langle \hat{T}_{\mu \nu }\rangle \) we will use a prescription where we average over only those states that are inside the horizon and thus observable. We note that quite generally coarse graining a state is expected to bring about a qualitative change in the results since it often leads to a violation of de Sitter invariance [36].
A configuration where a state is not completely observationally accessible can be described in terms of an open quantum system. For more discussion, see for example the textbook [88]. If we assume that the quantum state \(\vert \Psi \rangle \) can be written as a product of orthonormal states \(\vert n,A\rangle \) of the observable system and \(\vert n,B\rangle \) of the unobservable environment as
$$\begin{aligned} {\vert \Psi \rangle = \sum _n p_n\vert n,A\rangle \vert n,B\rangle \,,} \end{aligned}$$
(40)
we can express expectation values with a coarse grained density matrix \(\hat{\rho }\)
$$\begin{aligned} {\langle \hat{O}\rangle \equiv \mathrm{Tr}\big \{\hat{O}\hat{\rho }\big \}\,,} \end{aligned}$$
(41)
where the density matrix \(\hat{\rho }\) is obtained by neglecting or tracing over the unobservable states
$$\begin{aligned} \hat{\rho }&\equiv \mathrm{Tr}_{B}\big \{\vert \Psi \rangle \langle \Psi \vert \big \} \\&=\sum _m\langle B,m\vert \bigg \{\Big [\sum _n p_n\vert n,A\rangle \vert n,B\rangle \Big ]\\&\quad \times \Big [\sum _{n'} p_{n'}^* \langle B,n'\vert \langle A,n'\vert \Big ]\bigg \}\vert m,B\rangle \,, \end{aligned}$$
leading to
$$\begin{aligned} {\Leftrightarrow \quad \hat{\rho }=\sum _m\vert p_m\vert ^2\vert m,A\rangle \langle A,m\vert \,.} \end{aligned}$$
(42)
If in the state \(\vert \Psi \rangle \) there is entanglement between the observable states and the unobservable states we coarse grain over, the initially pure quantum state becomes mixed and the Von Neumann entropy of the density matrix will be non-zero
$$\begin{aligned} {\hat{\rho }^2\ne \hat{\rho }\quad \Leftrightarrow \quad -\mathrm{Tr}\big (\hat{\rho }\log \hat{\rho }\big )>0\,,} \end{aligned}$$
(43)
signalling that part of the information of the initial state \(\vert \Psi \rangle \) is lost or unobservable. When coarse graining leads to entropy increase/information loss it is a generic feature that the expectation values will not remain the same [36]. For example for the energy-momentum tensor one would expect to have
$$\begin{aligned} \mathrm{Tr}\big \{\hat{T}_{\mu \nu }\hat{\rho }\big \}\quad {\ne }\quad \langle \Psi |\hat{T}_{\mu \nu }| \Psi \rangle \,, \end{aligned}$$
(44)
implying that the coarse grained system has a different gravitational response compared to the un-coarse grained case.
Importantly, the Bunch–Davies vacuum in de Sitter space before coarse graining is a zero entropy state, but as explained covers also regions that are hidden from a local observer. If tracing over the unobservable states leads to a non-zero entropy it also suggests the presence of a non-zero energy density, which in light of the arguments given in Sect. 3 implies \(\dot{H}\ne 0\) and gives an important link between loss of information from coarse graining and a potentially non-trivial back reaction in our prescription.
Our choice of neglecting the unobservable states from the expectation values can be motivated as follows. First of all it is a standard procedure in branches of physics where having only partial observable access to a quantum state is a typical feature. An important example is the decoherence program: without an unobservable environment the quantum-to-classical transition does not take place [89]. Neglecting unobservable information is crucial also for the inflationary paradigm: in order to obtain the correct evolution of large scale structure as seeded by the inherently quantum fluctuations from inflation one must calculate the gravitational dynamics from the classicalized i.e. coarse grained energy-momentum tensor [90]. Perhaps most importantly, the energy-momentum tensor one obtains after neglecting the unobservable states corresponds to what an observer would actually measure and in this sense has clear physical significance.
A profound feature of our prescription is that since the horizon in de Sitter space is an observer dependent quantity, so is then the back reaction itself. Although a rather radical proposition, this does not imply an immediate inconsistency. After all, observer dependence is a ubiquitous feature in general – and even special – relativity. Furthermore, the well-known observer dependence of the concept of a particle in quantum theories on curved backgrounds was argued to lead to such a conclusion already in the seminal work [42].
Although our prescription of using a coarse grained energy-momentum tensor as the source term for semi-classical gravity deviates from the standard approach making use of \(\langle \hat{T}_{\mu \nu }\rangle \equiv \langle \Psi \vert \hat{T}_{\mu \nu }\vert \Psi \rangle \), we would like to emphasize that at the moment there is no method for conclusively determining precisely which object is the correct one [50]. This stems from the fact that semi-classical gravity is not a complete first principle approach, but rather an approximation for describing some of the gravitational implications from the quantum nature of matter. Before a full description of quantum gravity is obtained it is likely that this state of affairs will remain.
Tracing over inaccessible environmental states that are separated by a sharp boundary from the accessible ones generically leads to divergent behaviour close to the boundary. This is encountered for example in the context of black hole entropy [91] and entanglement entropy in general [92, 93]. Although by introducing a cut-off or a smoothing prescription well-defined results can be obtained [73], there is valid suspicion of the applicability of the semi-classical approach when close to the horizon. However, we can expect reliable results at the limit when the horizon is far away. At this limit there exists a natural expansion in terms of physical distance in units of the horizon radius, or more specifically in terms of the dimensionless quantities
, in the notation of Sect. 4. The neglected terms we will throughout denote as
. This limit can be expressed equivalently as being far away from the horizon or close to the center of the Hubble sphere and can equally well be satisfied when H is large such as during primordial inflation or when it is very small as it is during the late time Dark Energy dominated phase we are currently entering. The limit where the observer is far from the horizon is also the limit taken in the standard black hole analysis [37, 38].
Two dimensions
For completeness we first go through the steps of the 2-dimensional argument presented in [34], before proceeding to the full 4-dimensional derivation.
The various coordinate systems in de Sitter space in two dimensions can be expressed analogously to what was discussed in four dimensions in Sect. 4. The main modification is that since there is only one spatial coordinate there are now two horizons, at \(\pm 1/H\). For the patch inside the horizon in two dimensions one may use the FLRW
$$\begin{aligned} {\left\{ \begin{array}{ll}y^0=H^{-1}\sinh (Ht)+(H/2){x}^2e^{Ht}\\ y^1=e^{Ht}x \\ y^4=H^{-1}\cosh (Ht)-(H/2){x}^2e^{Ht}\end{array}\right. }\,, \end{aligned}$$
(45)
or static coordinates
giving
$$\begin{aligned} {ds^2=-dt^2+e^{2Ht}d{x}^2\,,} \end{aligned}$$
(47)
and
respectively, with
and
. Since in two dimensions there are two horizons there are also two patches beyond the horizon that can be covered with the FLRW coordinates or with
where
and
, and
where
and
.
The relations between the various coordinate systems become quite simple when using the light-cone coordinates defined in terms of conformal time (25) as
where the notation implies the same definition in all three regions A, B and C. As is clear from the definitions (51) the V and U coordinates can also be conveniently used to split the FLRW patch in terms of the regions A, B and C since they vanish at the horizons 1 / H and \(-1/H\), respectively. This is illustrated in Fig. 4. Furthermore, in the static patches we define the tortoise coordinates
with
,
and
. It is now a question of straightforward algebra to express of the light-cone coordinates U and V in terms of the static ones. The results can be summarized as
$$\begin{aligned} {\left\{ \begin{array}{ll}V{=}-H^{-1}e^{-Hv_{{\tiny { A}}}}\\ U{=}-H^{-1}e^{-Hu_{\tiny { A}}}\end{array}\right. }\,,\,\mathrm{Reg. }~A\,, \end{aligned}$$
(53)
$$\begin{aligned} {\left\{ \begin{array}{ll}V{=}+H^{-1}e^{-Hv_{B}}\\ U{=}-H^{-1}e^{-Hu_{\tiny {B}}}\end{array}\right. }\,,\,\mathrm{Reg. }~B\,, \end{aligned}$$
(54)
and
$$\begin{aligned} {\left\{ \begin{array}{ll}V{=}-H^{-1}e^{-Hv_{C}}\\ U{=}+H^{-1}e^{-Hu_{\tiny {C}}}\end{array}\right. }\,,\,\mathrm{Reg. }~C\,, \end{aligned}$$
(55)
and it is also convenient to define light-cone coordinates with respect to the static coordinates
The core of the calculation is finding an expression for the coarse grained density matrix (42), from which the unobservable information related to states beyond the horizon is removed. If we assume that any possible entanglement occurs only between modes with the same momentum, the density matrix where the hidden states are traced over can be written as the product in momentum space
$$\begin{aligned} \hat{\rho }\equiv \mathrm{Tr}_{BC}\big \{\vert 0 \rangle \langle 0\vert \big \}=\prod _{\mathbf {k}}\mathrm{Tr}_{BC}\big \{\vert 0_{\mathbf {k}} \rangle \langle 0_{\mathbf {k}}\vert \big \}\equiv \prod _{\mathbf {k}} \hat{\rho }_{\mathbf {k}}\,, \end{aligned}$$
(57)
where we define \(\mathbf {k}\) to be a scalar going from \(-\infty \) to \(\infty \) and where \(\vert 0_{\mathbf {k}} \rangle \) is the \(\mathbf {k}\)’th Fock space contribution to the Bunch–Davies vacuum, \(\vert 0 \rangle =\prod {\,}^{\,}_{\mathbf {k}} \vert 0^{\,}_{\mathbf {k}} \rangle \).
Next we need to find an expression for the Bunch–Davies vacuum in terms of observable and unobservable states. This is obtained by relating the Bunch–Davies modes to the ones defined in the 2-dimensional static coordinates found in (46), (49) and (50). From (26–29) we see that in two dimensions the mode expansion in de Sitter space coincides with the flat space result and can be written in the light-cone coordinates (51) as
$$\begin{aligned} \hat{\phi }&\equiv \hat{\phi }_V+\hat{\phi }_U\nonumber \\&=\int _0^\infty \frac{{dk}}{{\sqrt{4\pi k}}}\Big [e^{-ikV}\hat{a}_{-k} +e^{ikV}\hat{a}^{\dagger }_{-k} +e^{-ikU}\hat{a}_{k} +e^{ikU}\hat{a}^{\dagger }_{k}\Big ]\,. \end{aligned}$$
(58)
In (58) we have split the quantum field to two contributions according their dependence on U or V since these newer mix and can be thought as separate sectors, as is evident by taking into account.Footnote 4 \([\hat{a}_{-k},\hat{a}^{{\dagger }}_{k}]=0\), \(V=V(v)\) and \(U=U(u)\) from (53–56). Since \(\hat{\phi }_V\) is expressed only in terms of the \(\hat{a}_{-k}\) operators it consists solely of particles moving towards the left and similarly for \(\hat{\phi }_U\) and the right-moving particles.
In two dimensions also the static coordinates give rise to a trivial equation of motion
but much like for the Unruh effect, we must carefully determine the correct normalization for the modes in the static patches. Namely, we need to make sure that we are consistent in terms of defining a positive frequency mode.
As discussed after equation (30), the modes in (58) are defined to be positive frequency in terms of the vector \(\partial _\eta \) and we need to respect this definition also in the static patches A, B and C. If we choose
,
and
for A, B and C respectively, it is a simple matter of using the vector transformations \(\partial _{\mu }=\frac{{\partial x^{\tilde{\alpha }}}}{{\partial x^\mu }}\partial _{\tilde{\alpha }}\) with (53–56) to show that
which with the help of Fig. 4 one may see to be time-like in terms of conformal time and future-oriented in their respective regions.Footnote 5
Normalizing the vectors (60–62) we can then from (30) write inner products in the static patches
with which the expression for the scalar field in the static coordinates becomes
$$\begin{aligned}&\hat{\phi }= \int _0^\infty \frac{{dk}}{{\sqrt{4\pi k}}}\nonumber \\&\times \left\{ \begin{array}{l}\displaystyle \Big (e^{-ik{v}_{\tiny { A}}}\hat{a}^{{\tiny { A}}}_{-k}+ \mathrm{H.C}\Big )+\Big (e^{-ik{u}_{\tiny { A}}}\hat{a}^{{\tiny { A}}}_{k} + \mathrm{H.C}\Big )\,,\, \mathrm{Reg. }~A\,,\\ \displaystyle \Big (e^{+ik{v}_{{\tiny {B}}}}\hat{a}^{{\tiny {B}}}_{-k} + \mathrm{H.C}\Big )+\Big (e^{-ik{u}_{\tiny {B}}}\hat{a}^{{\tiny {B}}}_{k} + \mathrm{H.C}\Big )\,,\,\mathrm{Reg. }~B\,,\\ \displaystyle \Big (e^{-ik{v}_{{C}}}\hat{a}^{{\tiny {C}}}_{-k} + \mathrm{H.C}\Big ) +\Big (e^{+ik{u}_{{C}}}\hat{a}^{{\tiny {C}}}_{k} + \mathrm{H.C}\Big )\,,\,\mathrm{Reg. }~C\,. \end{array}\right. \end{aligned}$$
(66)
where ‘\(\mathrm{H.C.}\)’ stands for ‘hermitian conjugate’. The form in (66) can be used to define the scalar field \(\hat{\phi }\) on the entire expanding FLRW patch, precisely as (58). The crucial point is that in general the Bunch–Davies vacuum as defined by (29) is an entangled combination of states inside and outside the horizon, which leads to an increase in entropy once the hidden states are traced over.
We will first perform the entire calculation for \(\hat{\phi }_V\), after which writing the results for \(\hat{\phi }_U\) becomes trivial. We can first focus only on the regions A and B, since for \(\hat{\phi }_V\) only the horizon at \(V=0\) is relevant, which we elaborate more below. From (66) we then get
$$\begin{aligned} \hat{\phi }_V&=\int _0^\infty \frac{{dk}}{{\sqrt{4\pi k}}}\Big [e^{-ik{v}_{\tiny { A}}}\hat{a}^{{\tiny { A}}}_{-k} + e^{ik{v}_{\tiny { A}}}\hat{a}^{{\tiny { A}}\,\dagger }_{-k}\nonumber \\&\quad +e^{ik{v}_{\tiny {B}}}\hat{a}^{{\tiny {B}}}_{-k} +e^{-ik{v}_{\tiny {B}}}\hat{a}^{{\tiny {B}}\,\dagger }_{-k}\Big ]\,, \end{aligned}$$
(67)
where the modes defined in the A region are to be understood to vanish in region B and vice versa for the modes in B.
The approach we will use was originally presented in [69] and is based on the fact that any linear combination of positive modes defines the same vacuum as a single positive mode [50]. The main constraint is that the Bunch–Davies modes (58) are continuous across the horizon. Making use of (53–56) we can write
$$\begin{aligned} e^{-iv_{\tiny { A}} k}&=e^{\frac{{ik}}{{H}}\ln (-HV)}\,;~~ \mathrm{Reg. }~A\,, \end{aligned}$$
(68)
$$\begin{aligned} e^{-\frac{{\pi k}}{{H}}}\big (e^{ikv_{\tiny {B}}})^*=e^{-\frac{{\pi k}}{{H}}}e^{\frac{{ik}}{{H}}\ln (HV)}&=e^{\frac{{ik}}{{H}}\ln (-HV)}\,;~~ \mathrm{Reg. }~B\,, \end{aligned}$$
(69)
where in the last line we chose the complex logarithm to have a branch cut as \(\ln (-1)=i\pi \). Because of this choice the sum of (68) and (69) is continuous across the horizon and analytic when \(\mathfrak {I}[V]<0\), so it must be expressible as a linear combination of \(e^{-ikV}\) i.e the positive frequency Bunch–Davies modes from (58). In a similar fashion starting from \(e^{+iv_{\tiny { A}} k}\) it is straightforward to find a second continuous and well-behaved linear combination. With such linear combinations we have yet another representation for the scalar field in addition to (58) and (67)
$$\begin{aligned} \hat{\phi }_V&=\int _0^\infty \frac{{dk}}{{\sqrt{4\pi k}}}\frac{{1}}{{\sqrt{1-\gamma ^2}}}\bigg \{\Big [e^{-ikv_{\tiny { A}}}+\gamma \big (e^{ikv_{\tiny {B}} })^*\Big ]\hat{d}^{(1)}_{-k}\nonumber \\&\quad +\Big [\gamma \big (e^{-ikv_{\tiny { A}}})^*+e^{ikv_{\tiny {B}}}\Big ]\hat{d}^{(2)}_{-k}+ \mathrm{H.C.}\bigg \}\,, \end{aligned}$$
(70)
where we have defined \(\gamma \equiv e^{-\pi k/H}\) and used (63–64) to get properly normalized modes. An important result may be derived by realizing that the operators \(\hat{d}^{(1)}_{-k}\) and \(\hat{d}^{(2)}_{-k}\) annihilate the Bunch–Davies vacuum since the modes in the square brackets of (70) are continuous and must be linear combinations of the positive frequency Bunch–Davies modes and (67) allows us to express them in terms of \(\hat{a}^{{\tiny { A}}}_{-k}\) and \(\hat{a}^{{\tiny {B}}}_{-k}\)
$$\begin{aligned} {\left\{ \begin{array}{ll} \big (\hat{a}^{{\tiny { A}}}_{-k}-\gamma \hat{a}^{{\tiny {B}}\,\dagger }_{-k}\big )|0_{-k}\rangle =0\\ \big (\hat{a}^{{\tiny {B}}}_{-k}-\gamma \hat{a}^{{\tiny { A}}\,\dagger }_{-k}\big )|0_{-k}\rangle =0\end{array}\right. } \,. \end{aligned}$$
(71)
The above relations are identical to what is found in the 2-dimensional Unruh effect, and black hole evaporation and imply that \(|0_{-k}\rangle \) is an entangled state in terms the number bases as defined by \(\hat{a}^{{\tiny { A}}}_{-k}\) and \(\hat{a}^{{\tiny {B}}}_{-k}\). Following [73] the normalized solution to (71) can be written as
$$\begin{aligned} {|0_{-k}\rangle =\sqrt{1-\gamma ^2}\sum _{n_{-k}=0}^\infty \gamma ^{n_{-k}}|n_{-k},A\rangle |n_{-k},B\rangle \,,} \end{aligned}$$
(72)
where \(|n_{-k},A\rangle \) and \(|n_{-k},B\rangle \) are particle number eigenstates as defined by \(\hat{a}^{{\tiny { A}}}_{-k}\) and \(\hat{a}^{{\tiny {B}}}_{-k}\). If as in (42) we trace over the unobservable states the density matrix becomes precisely thermal
$$\begin{aligned} \hat{\rho }_{-{k}}&=\mathrm{Tr}_{B}\big \{|0_{-k}\rangle \langle 0_{-k}|\big \}\nonumber \\&=(1-\gamma ^2)\sum _{n_{-k}=0}^\infty \gamma ^{2n_{-k}}|n_{-k},A\rangle \langle A, n_{-k}|\,. \end{aligned}$$
(73)
A physical argument can also be used to rule out one of the two possible choices for the branch cut. Had we made the different choice the result would have given an infinite number of produced particles at the ultraviolet limit, which is an unphysical solution as the ultraviolet modes should be indifferent to the global structure of spacetime and experience no particle creation.
As mentioned, only the regions A and B are relevant for \(\hat{\phi }_V\). The reason why one may neglect the contribution from region C is apparent from the relations (53–56) and (66): \(\hat{\phi }_V\) has no dependence on U so no mixing of modes is needed in order to obtain analytic behaviour across the horizon \(U=0\). Thus, including all regions A, B and C in (70) would still give (72), which we have also explicitly checked.
So far we have only studied \(\hat{\phi }_V\) i.e. the particles moving to the left. By using (53–56) and (66) the calculation involving \(\hat{\phi }_U\) proceeds in an identical manner resulting also in a thermal density matrix, but in terms of the right-moving particles \(|n_k,A\rangle \).
Putting everything together, the density matrix (57) obtained by tracing over the states beyond the horizon in the Bunch–Davies vacuum is precisely thermal with the Gibbons–Hawking de Sitter temperature \(T=H/(2\pi )\)
$$\begin{aligned} \hat{\rho }=\prod _{\mathbf {k}}\big (1-e^{- \frac{{2\pi k}}{{H}}}\big )\sum _{n_{\mathbf {k}}=0}^\infty e^{- \frac{{2\pi k}}{{H}}n_{\mathbf {k}}}|n_{\mathbf {k}},A\rangle \langle A, n_{\mathbf {k}}|\,. \end{aligned}$$
(74)
Since in (74) all states except the ones belonging to region A are neglected we can write the expectation value of the energy-momentum tensor from (5) by expressing \(\hat{\phi }\) as the top line from (66)
$$\begin{aligned} {\langle \hat{T}_{vv}\rangle \!=\!\langle \hat{T}_{uu}\rangle \!=\!\int _0^\infty \frac{{dk}}{{2\pi }}\bigg [\frac{{k}}{{2}}+\frac{{k}}{{e^{2\pi k/H}-1}}\bigg ]\,;~~~\langle \hat{T}_{uv}\rangle =0\,,} \end{aligned}$$
(75)
where for simplicity we have dropped the A labels. The final unrenormalized expression in the FLRW coordinates (47) can be obtained by using the tensor transformation law \(T_{\mu \nu }=\frac{{\partial x^{\tilde{\alpha }}}}{{\partial x^\mu }}\frac{{\partial x^{\tilde{\beta }}}}{{\partial x^\nu }}T_{\tilde{\alpha }\tilde{\beta }}\) with (53–56) and (51). This gives
and
As the discussion after Eq. (42) addressed, (76) and (77) have divergent behaviour on the horizons
. This is distinct to the usual ultraviolet divergences encountered in quantum field theory, which are also present in (76) and (77) as the divergent integrals. For our purposes the relevant limit of being close to the origin is obtained with an expansion in terms of
giving
The last step in the calculation is renormalization. When we neglect the
contributions i.e. study only the region far from the horizon the result is precisely homogeneous and isotropic for which the counter terms can be found by calculating the energy-momentum tensor as an expansion in terms of derivatives the scale factor. This is the adiabatic subtraction technique [84,85,86], with which the 2-dimensional counter terms were first calculated in [95] giving coinciding results to [96]. This technique gives the counter terms for the energy and pressure components as formally divergent integrals
$$\begin{aligned} \delta T_{00}=\int \frac{{d\mathbf {k}}}{{2\pi }}\frac{{k}}{{2}}+\frac{{H^2}}{{24\pi }}\,;\qquad \delta T_{ii}/a^2=\int \frac{{d\mathbf {k}}}{{2\pi }}\frac{{k}}{{2}}-\frac{{H^2}}{{24\pi }}\,. \end{aligned}$$
(80)
Note that the apparent divergence in the flux contribution (79) is an artefact of our use of non-regulated integrals. When dimensionally regulated the sum of the energy and pressure density divergences cancels, also for massive particles. Physically one can understand this from the requirement that Minkowski space must be stable under back reaction. This issue is discussed more in Sect. 6, see also the equation (113). For now we can simply neglect the divergences in (79) or following [34] formally derive the flux counter terms by demanding covariant conservation of \(\delta T_{\mu \nu }\). The renormalized energy-momentum is then
We can clearly see that far away from the horizon the energy-momentum describes a homogeneous and isotropic distribution of thermal particles. It can be checked to be covariantly conserved and to have the usual conformal anomaly [50], although in de Sitter space the conformal anomaly is not important and one may cancel the \(\pm \,\frac{{H^2}}{{24\pi }}\) contributions by a redefinition of the cosmological constant. Contrary to what one usually encounters in cosmology, the energy density in (81) is constant despite the fact that space is expanding and generically diluting any existing particle density. This is explained by the additional term (83) representing a continuous incoming flux of particles, which replenishes the energy lost by dilution. This naturally raises an important question concerning the source of the flux, which will be addressed in Sect. 6 where we write down solutions that are consistent in terms of semi-classical back reaction.
In the language of Sect. 3 the \(\pm H^2/(24\pi )\) terms are state independent contributions resulting from renormalization and the integral over the thermal distribution is the state dependent contribution \(\rho ^S_m\). The results (81–83) can be seen to be in agreement with the arguments of Sect. 3, in particular the 2-dimensional version of (13) when taking into account of the modifications arising from our choice of not to implement dimensional regularization.
Before ending this subsection we comment on a technical detail regarding the divergences generated in the coarse grained state. A general – or certainly a desirable – feature of quantum field theory is the universality of the generated divergences and renormalization. For a quantum field on a de Sitter background, one should be able to absorb all divergences in the redefinition of the cosmological constant, preferably also in the coarse grained state (74). But from (76) and (77) we can see that this does not hold due to the
-dependence of the generated divergences, likely related to coarse graining and the additional divergence when approaching the horizon. We emphasize however that even if in a carefully defined coarse graining divergences
are not generated, the renormalized result would coincide with (81–83), because up to the accuracy we are interested in all the needed counter terms could be derived via adiabatic subtraction, which satisfies \(\delta T_{00}=-\delta T_{ii}/a^2\) [33].
Four dimensions
The calculation in four dimensions proceeds in principle precisely as the 2-dimensional derivation of the previous subsection. The main differences are that the solutions in four dimensions are analytically more involved and that there is only one horizon. Quantization of a scalar field in the static de Sitter patch has been studied in [97,98,99] to which we refer the reader for more details. Quite interestingly, although the line element for a Schwarzschild black hole and de Sitter space in static coordinates are very similar, the latter has an analytic solution for the modes while the former does not.
We begin by writing useful coordinate transformations between the 4-dimensional FLRW coordinates and static coordinates of Sect. 4, in region A
and region B
The tortoise coordinates and light-cone coordinates can be obtained trivially from the 2-dimensional results (51), (52) and (53–56) with the replacements \({x}\rightarrow {r}\) and
.
Due to spherical symmetry and time-independence of the metric (22) in the coordinates (21) we introduce a similar ansatz to the equation of motion (2) as in the spherical from of the FLRW metric in (33)
leading to the radial equation
where we have used \(\xi _4 R = 2H^2\). With a suitable ansatz the above may be reduced to a hypergeometric equation and has the solution in terms of the Gaussian hypergeometric function
where \(D^{\tiny {A}}_{\ell k}\) is a normalization constant. With the help of (84) and (85) we can show that
so inside the horizon in the static coordinates we can use
for defining positive frequency modes and the inner product via (30), which reads
Using (86) and (88) in the above allows one to solve for the normalization constant \(D_{\ell k}\). For details we refer the reader to [98, 99], but here we simply write the result
$$\begin{aligned} \frac{{1}}{{D_{\ell k}^{_{\tiny {A}}}}}&=\frac{{\sqrt{4\pi k}}}{{H}}{\,}_2F_1\bigg [\frac{{\ell }}{{2}} +\frac{{ik}}{{2H}} +\frac{{1}}{{2}},\frac{{\ell }}{{2}} +\frac{{ik}}{{2H}} +{1};\ell +\frac{{3}}{{2}};1\bigg ]\nonumber \\&=\frac{{\sqrt{4\pi k}\Gamma \big [\ell +\frac{{3}}{{2}}\big ]\Gamma \big [\frac{{-ik}}{{H}}\big ]}}{{H\Gamma \big [\frac{{1}}{{2}}\big (\ell - \frac{{ik}}{{H}} +1\big )\big ]\Gamma \big [\frac{{1}}{{2}}\big (\ell - \frac{{ik}}{{H}} +2\big )\big ]}}\,, \end{aligned}$$
(91)
up to factors of modulus one.
Having derived the solutions in the static patch inside the horizon (21), the solutions in coordinates covering the outside of the horizon (23) follow trivially from the fact that the line element and thus the radial equation (87) have identical form. Again however, we must carefully determine the correct normalization for the positive frequency mode beyond the horizon. With (84) and (85) we can show
which, similarly to the 2-dimensional derivation, implies that
plays the role of time. The inner product in the B region then becomes
Since the inner product (93) is independent of the choice of hypersurface we can evaluate it at the limit
. It is then a simple matter of using (91) to show that
in region B is the correctly normalized positive frequency mode provided that
with \(D^{\tiny {B}}_{\ell k}=(D^{\tiny {A}}_{\ell k})^*\).
In the 4-dimensional expanding FLRW patch the scalar field can then be written in the coordinates (21) and (23) as
$$\begin{aligned} \hat{\phi }= \left\{ \begin{array}{l}{\mathop {\sum }\limits _{\ell =0}^\infty }{\mathop {\sum }\limits _{m=-\ell }^\ell }\int _0^\infty dk\Big [\psi _{\ell m k}^{\tiny {A}}\hat{a}^{\tiny { A}}_{\ell m k}+\mathrm{H.C.}\Big ]\qquad \mathrm{Reg. }~A\,,\\ {\mathop {\sum }\limits _{\ell =0}^\infty }{\mathop {\sum }\limits _{m=-\ell }^\ell }\int _0^\infty dk\Big [\psi _{\ell m k}^{\tiny {B}}\hat{a}^{\tiny {B}}_{\ell m k}+\mathrm{H.C.}\Big ]\qquad \mathrm{Reg. }~B\,. \end{array}\right. \end{aligned}$$
(96)
Despite the analytically involved structure of the modes the arguments we used in the 2-dimensional case apply practically identically in four dimensions. This is due to the simplification that occurs when approaching the horizon. At this limit with the help of (91) and the tortoise coordinates (52) we get for the mode in region A
and for the mode in region B
where \(\sim \) denotes an equality up to constant factors of modulus one. Comparing the above to (66) reveals that the discontinuity is precisely of the same form as in two dimensions, leading to two non-trivial linear combinations that are continuous across the horizon
$$\begin{aligned} \psi _{\ell m k}^{\tiny {A}}+\gamma (\psi _{\ell m k}^{\tiny {B}})^*\qquad \text {and}\qquad \gamma (\psi _{\ell m k}^{\tiny {A}})^*+\psi _{\ell m k}^{\tiny {B}}\,, \end{aligned}$$
(99)
where again \(\gamma \equiv e^{-\pi k/H}\). Expressing the scalar field in terms of two representations, as was done in two dimensions in (70), one may reproduce the steps of the 2-dimensional derivation and deduce the thermality of the 4-dimensional coarse grained density matrix
$$\begin{aligned} {\hat{\rho }=\prod _{\ell m k}\big (1-e^{-\frac{{2\pi k}}{{H}} }\big )\sum _{n_{\ell m k}=0}^\infty e^{- \frac{{2\pi k}}{{H}}n_{\ell m k}}|n_{\ell m k},A\rangle \langle A, n_{\ell m k}|\,.} \end{aligned}$$
(100)
The energy density in four dimensions is of course more difficult to write in a clear form than the 2-dimensional result due to the presence of the hypergeometric functions in the mode solution (88). However for our purposes only the limiting case of a being far from the horizon is relevant, for which the static line element (22) coincides with flat space up to small terms
. This and the fact that our theory is conformal imply that up to
the energy density in the coarse grained state (100) should coincide with that of a black-body with \(T=H/(2\pi )\). In appendix A we verify this assertion with an explicit calculation.
Tracelessness of the unrenormalized energy-momentum tensor immediately fixes the ratio of the energy and pressure densities, again dropping the A labels as irrelevant
where the off-diagonal components cancel due to the lack of flux and shear in a static spherically symmetric case. From the results expressed in the static coordinates in (101) we already see a very important outcome from coarse graining the energy-momentum tensor with respect to states beyond the horizon: the sum of the pressure- and energy-density does not vanish, a relation which is satisfied by the ’00’ and ’ii’ components of Einstein tensor in the static coordinates at the centre of the Hubble sphere. This implies that there is non-trivial back reaction since Einstein’s equation with (101) as the source is not solved by the static line element describing de Sitter space (22). We can conclude that coarse graining such that only observable states are left leads to a violation of de Sitter invariance.
Following the 2-dimensional procedure of the previous subsection, the usual tensor transformations with the help of (84) and (85) allow us to write the result in the FLRW form, which we can renormalize by using the 4-dimensional adiabatic counter terms that can be found for example in [36]. The final result is
$$\begin{aligned}&\rho _m=\int \frac{{d^3\mathbf {k}}}{{(2\pi )^3}}\frac{{k}}{{e^{2\pi k/H}-1}}+\frac{{H^4}}{{960\pi ^2}}\,, \end{aligned}$$
(102)
$$\begin{aligned}&p_m=\frac{{1}}{{3}}\int \frac{{d^3\mathbf {k}}}{{(2\pi )^3}}\frac{{k}}{{e^{2\pi k/H}-1}}-\frac{{H^4}}{{960\pi ^2}}\,, \end{aligned}$$
(103)
where \(f_{m,\,i}\equiv T_{i0}/a\) and we have neglected terms of
. So quite naturally, the 4-dimensional result has exactly the same thermal characteristics to the 2-dimensional one in (81–83). Far away from the horizon (102–104) is homogeneous and isotropic with
$$\begin{aligned} {\rho _m+p_m=\frac{{4}}{{3}}\rho ^S_m=\frac{{4}}{{3}}\int \frac{{d^3\mathbf {k}}}{{(2\pi )^3}}\frac{{k}}{{e^{2\pi k/H}-1}}=\frac{{H^4}}{{360\pi ^2}}\,, } \end{aligned}$$
(105)
and again the terms \(\pm \,\frac{{H^4}}{{960\pi ^2}}\) responsible for the conformal anomaly are irrelevant and could have been removed with a redefinition of \(\Lambda \). The above can be seen to be in agreement with the discussion of Sect. 3, specifically the right-hand side of equation (14).
For clarity we summarize the arguments of this section here once more: in de Sitter space as described by the expanding FLRW coordinates (18) initialized to the Bunch–Davies vacuum the energy-momentum of a quantum field has a thermal character when in the density matrix one includes only the observable states inside the horizon. The energy density inside the horizon is maintained at a constant temperature by a continuous flux of radiation incoming from the horizon that precisely cancels the dilution from expansion. This results in \(\rho _m+p_m\ne 0 \), which is independent of the details of renormalization and the conformal anomaly due to the symmetries of the counter terms in de Sitter space. Thus even at the limit when the distance to the horizon is very large and the result is isotropic and homogeneous the sum of the energy and pressure densities does not cancel and because of this the dynamical Friedmann equation (9) then implies that a strictly constant Hubble rate H is not a consistent solution. This is also visible in the result given in the static coordinates (101), which does not solve Einstein’s equation if the background is assumed to be strictly de Sitter.