Comments on Jacobson's"Entanglement equilibrium and the Einstein equation"

Using holographic calculations, we examine a key assumption made in Jacobson's recent argument for deriving Einstein's equations from vacuum entanglement entropy. Our results involving relevant operators with low conformal dimensions seem to conflict with Jacobson's assumption. However, we discuss ways to circumvent this problem.


Introduction and Summary of Results
Entanglement is now recognized to play an important role in the emergence of space (and space-time) in quantum gravity with calculations from a variety of different approaches [1][2][3][4][5][6]. Of course, one of the most natural frameworks where this connection can be investigated is the AdS/CFT correspondence [7]. In particular, the elegant prescription for holographic entanglement entropy [8] reveals a deep connection between entanglement entropy and spacetime geometry [3]. However, this connection was further extended to the dynamics of the spacetime in [9] which related the first law of entanglement in the vacuum of the boundary CFT to the Einstein equations linearized around the AdS vacuum in the bulk. Of course, the latter is reminiscent of Jacobson's derivation of Einstein's equations from thermodynamic arguments involving Rindler horizons in [2]. More recently, Jacobson [10] proposed an intriguing argument in which the full nonlinear Einstein equations arise from the postulate that the vacuum entanglement entropy for small balls is extremal. This argument makes a precise and profound connection between entanglement and gravity. The aim of this paper is to examine a certain key assumption that was needed for this argument to hold. In particular, it was assumed that the variation of the entanglement entropy for quantum fields in small casual diamonds takes a specific form. In the following, we test this assumption using holographic calculations but first let us briefly review Jacobson's argument: Jacobson [10,11] begins by considering the entanglement entropy of a small (spacelike) spherical region R in the vacuum and makes small deformations of the geometry δg ab and of the state of the matter fields δ|ψ . With the assumption that the vacuum entanglement entropy is extremal, the variation of entanglement entropy then vanishes (to leading order) through the cancellation of two contributions: δS total = δS UV + δS IR = 0 . (1.1) Here, δS UV is a universal UV contribution, arising from the change in geometry. Quantum gravity is expected to render this UV contribution finite and produce a result proportional to the change in the boundary area, i.e., δS UV = δA/(4G). The IR contribution δS IR arises from the change in the state of the matter fields. The crucial point for the remaining discussion is the precise form of this IR variation. Quite generally, the first law of entanglement [12] allows the leading contribution to the latter to be written as where H is the modular Hamiltonian for the density matrix ρ produced when the global state is reduced to a given region of interest, i.e., H = − log ρ. However, this result is not particularly useful except in special cases. One such special case arises when the vacuum state of a CFT in flat space is reduced to a spherical region. In this case, the modular Hamiltonian is given by the integral of the energy density T 00 with a simple profile across the spherical region [13] -see eq. (3.2) below. In his argument, Jacobson chooses the radius R of the sphere to be much smaller than any (length) scale in the geometry or in the quantum field theory, but still much larger than Planck scale ℓ P -see section 3 for further discussion. Hence in this small sphere limit, the energy density is essentially constant throughout this region and the first law (1.2) yields where δ T 00 is the change in the energy density in comparison to the vacuum state. Further, d is the spacetime dimension, and Ω d−2 = 2π is the volume of a (d-2)-dimensional unit sphere. Next, evaluating the expression for the change in the boundary area at fixed volume in some maximally symmetric reference geometry, one finds a result involving the time-time component of the Einstein tensor. Combining this result and the above expression (1.3) for δS IR of spheres in all reference frames and positions, one can then derive the Einstein equations (with a cosmological constant) for a CFT coupled to gravity [11], G ab + Λ g ab = 8πG δ T ab . (1.4) One would like to extend the above discussion to the case of a nonconformal quantum field theory. In this general case, Jacobson [10,11] makes the assumption that eq. (1.3) extends to the following form: where X is some scalar operator in the QFT. For example, an initial suggestion was that X = −T a a /d in which case, the expression on the right-hand side of this equation is proportional to (the time-time component of) the traceless part of the stress tensor [10]. In any event, with this assumption (1.5), it is straightforward to generalize the arguments above and derive eq. (1.4) for a general quantum field theory coupled to gravity. Although the spacetime geometry is considered dynamical here, we emphasize that eq. (1.5) is a conjecture for quantum field theories in a fixed flat spacetime. 1 In this paper, we analyze the validity of this key assumption (1.5) within the context of the AdS/CFT correspondence. In particular, we consider a holographic CFT perturbed by a relevant operator O ∆ of scaling dimension ∆. That is, the action of the boundary theory becomes (1. 6) with some (dimensionful) coupling λ. The dual gravitational problem corresponds to solving the Einstein equations coupled to a massive scalar field (and a negative cosmological constant) in d + 1 dimensions. Further, we can compute the entanglement entropy by using the holographic prescription of [8]. Since we are only considering small spherical regions, the holographic entanglement entropy is only probing the asymptotic geometry of the bulk spacetime and we can proceed by only considering the asymptotic behaviour of the bulk fields. The metric perturbation and bulk scalar become vanishingly small as we approach the asymptotic boundary and hence we calculate perturbatively to leading order in the amplitude in the scalar field. In terms of the boundary theory, R d T ab , R ∆ O ∆ and R d−∆ λ are all small and we work to leading order in these (dimensionless) quantities. The details of the calculations will be described below in the next section, but let us present here the main results coming from this analysis. For relevant operators with d 2 < ∆ < d, we find where C T is the central charge appearing in the vacuum correlator of the stress tensor -see eq. (2.18) below. In the deformed boundary theory, various expectation values may be nonvanishing in the vacuum state and hence our notation above indicates, e.g., δ T 00 ≡ T 00 − T 00 vacuum . 2 Now the first line in the above expression matches the desired form given in eq. (1.5). However, we note that it is not the traceless part of the stress tensor that appears here. We have also included an extra contribution in the second line of eq. (1.7), which is proportional to δ O ∆ 2 and, as dictated by dimensional analysis, this term is accompanied by a factor R 2∆ . Now the expectation values, δ T ab and δ O ∆ 2 are determined by infrared scales that are independent of R and hence because d < 2∆ as we take a limit R → 0, the contribution in the second line becomes negligible compared to the contributions involving δ T ab . Hence this result (1.7) agrees with the form of δS IR required in the derivation of Einstein's equations.
Note that the coefficient of the δ T a a in eq. (1.7) is singular in the limit ∆ → d/2. Hence the holographic calculations must be redone for the particular case of ∆ = d/2 -see section 2.3 -with the result is the harmonic number defined by H n = 1 0 dx 1−x n 1−x . Also note that we have a new renormalization scale µ appearing in the logarithmic term. In this case, the dimension of δ O ∆ 2 matches that of the stress-energy tensor and so at this order, all of 2 For clarity, let us add that our notation is that vacuum above and throughout the paper. the contributions appear with the same overall factor of R d . Hence eq. (1.8) almost has the desired form (1.5) except for the appearance of an extra logarithmic dependence on R in the term proportional to δ T a a . Now the unitarity bound in CFTs also allows for scalar operators with conformal dimension (d − 2)/2 < ∆ < d/2. However, this regime requires the so-called alternative quantization of the holographic theory [14], which will subsequently involve an alternative holographic renormalization procedure. Interestingly, this procedure yields the same result as for ∆ > d/2. That is, δS IR is still given by eq. (1.7) in this new regime. However, with ∆ < d/2, the term proportional to R 2∆ becomes the leading contribution in the small R expansion. Hence our holographic results for operators in this regime present a challenge for Jacobson's argument. However, let us emphasize that the contribution proportional to R 2∆ δ O ∆ 2 in eq. (1.7) goes beyond the first law variation (1.2) which was central to Jacobson's reasoning. In section 3, we discuss this point and several ways in which, in principle, this extra contribution can be incorporated into Jacobson's argument so that one could still derive Einstein's equations from an principle of maximal vacuum entanglement.
The remainder of the paper is organized as follows: We begin in section 2 by describing the details of the holographic model and of our holographic entanglement entropy calculations. In subsection 2.1, we compute δS IR for operators with conformal dimension in the regime d/2 < ∆ < d, while subsections 2.2 and 2.3 describe the computations for the regime (d − 2)/2 < ∆ < d/2 and the specific case ∆ = d/2, respectively. In section 3, we conclude with a brief discussion of the implications of our results for the proposal in [10,11]. We have three appendices to discuss certain technical details. Appendix A describes the details of holographic renormalization in the context of the alternate quantization. Appendix B presents a short calculation of the contribution in the shift in the entanglement entropy proportional to T 00 2 for a thermal state in a d = 2 CFT. Appendix C extends the calculation of δS IR to include the next-to-leading order contributions in the coupling λ.

Main Results
In this section, we provide the details of the calculations that produced the results given in eqs. (1.7) and (1.8) above. We start by setting up our holographic model in subsection 2.1. The results in that section are valid for general values of the scaling dimension ∆. However, certain values require extra consideration; this is presented in subsections 2.2 and 2.3 -see also appendix A. In appendix C, we also present calculations to next order in the perturbation parameter and compute subleading contributions in the variation of the entanglement entropy.

Holographic Framework
Our holographic model is comprised of the following action, Here for completeness, we included a cubic term in the potential but this term will be neglected in all of our calculations in the main text and it will only play a role in Appendix C. Of course, if Φ = 0, the metric solution for the vacuum will be pure AdS d+1 with L being the curvature scale. However, in general, our calculations involve examining the Einstein and scalar field equations together and finding solutions where the scalar has a nontrivial profile reflecting the presence of the relevant perturbation (1.6) in the boundary theory. As noted previously, we will only be examining the asymptotic region near the boundary of the bulk spacetime where the metric perturbation and bulk scalar become vanishingly small. This allows us to construct the solutions perturbatively in the amplitude in the scalar field. Near the asymptotic boundary, it is useful to introduce Fefferman-Graham coordinates for the metric, where z is the holographic coordinate with z = 0 corresponding to the boundary. Near the boundary, we can expand g ab (z, x) as where the first term (i.e., η ab ) is chosen for a flat boundary metric. The standard AdS/CFT dictionary relates the expectation value of the boundary stress tensor with the z d coefficient in this expansion, as T ab = d ab . However, for the nonconformal case, we reconsider this expression in section 2.1.1. Similarly, the scalar field has two independent asymptotic solutions, where, as the notation indicates, the exponent ∆ is the conformal dimension of the dual operator and is given by Now, the usual holographic dictionary relates the first coefficient φ 0 to the coupling λ, and φ 1 to the expectation value of the relevant operator O ∆ -the explicit relations will be given below in section 2.1.1. Implicitly with eq. (2.6) and throughout this subsection, we are assuming the relevant operators have d/2 < ∆ < d. As commented before, the cases of (d − 2)/2 < ∆ < d/2 and ∆ = d/2 will be treated separately in subsequent subsections.
In the above discussion, the ansatz has been fairly general so that it could describe expectation values which vary across the boundary spacetime, e.g., O ∆ (x). However, in the problem of interest, we want to probe the boundary theory by examining the entanglement entropy of spheres that are much smaller than the scale of any such variations. Hence for simplicity, we will assume that our holographic background is invariant under translations in both space and time in the following. Hence the metric perturbations g (k) ab in eq. (2.4) and the coefficients φ 0,1 in the scalar (2.5) are constants, and our metric ansatz in eq. (2.3) is simplified with g ab (z, x) reduced to g ab (z). We also impose the boundary condition that g ab (z) → η ab asymptotically to recover the flat boundary metric in eq. (2.4). Note, however, that we are not otherwise restricting the metric perturbations, which will allow us to consider states which are anisotropic and stationary (but not static).
With these choices, the Einstein equations become, e.g., [15,16]: Similarly, the scalar field equation following from eqs. (2.1) and (2.2) becomes Moreover, we note that these equations are redundant. For example, the second equation in eq. (2.7) will automatically be solved if the first set of equations, as well as the scalar field equation (2.8), are solved. However, this equation may still be used as a consistency check for the solutions.
As commented above, we will construct the solutions perturbatively in the amplitude in the scalar field. Hence we introduce a small expansion parameter ε, so that the scalar is written as as z approaches zero -we re-iterate that φ 0 and φ 1 are now simply constants. Solving eq. (2.8) to first order in ε yields the usual solution for ∆ given in eq. (2.6). Next, we solve eq. (2.7) to second order in ε. Here, we formulate the most general ansatz for the metric functions g ab (z) which approach η ab as z → 0, where m ab , a ab , b ab and c ab are all matrices with constant coefficients. The role of the m ab terms will be to introduce additional contributions to the stress-energy tensor which are independent of the conformal perturbation. Implicitly, we have also set L = 1 here. Substituting this ansatz (2.10) into the Einstein equations (2.7), we find the solution , The trace in the last term is made with the boundary metric, i.e., m a a = η ab m ab . Here, we must comment that there is some ambiguity in the metric ansatz (2.10) because both m ab and c ab appear at order z d . In particular, if we shift these two matrices by δm ab = −δc ab φ 0 φ 1 and δc ab , respectively, the metric is left invariant at this order in the ε expansion. In fact, the Einstein equations (2.7) only fix the trace of the z d contribution in eq. (2.10) and hence, in writing the second line of eq. (2.11), we are making a convenient choice for these matrices which simplifies our results in the following.

Holographic Renormalization
The ultimate aim is to express the variation of the entanglement entropy for a spherical region in terms of field theoretic quantities. In this section, we apply the usual holographic renormalization to evaluate various expectation values in the field theory in terms of dual parameters in the gravitational solution.
First, we must evaluate the on-shell bulk action but to regulate the result, we introduce a cut-off surface at z = z ǫ near the boundary. Then, we add a counterterm action that cancels the divergences [17]. Hence the total gravitational action becomes where I bulk is just the action in eq. (2.1) and the two boundary contributions are

13)
In these expressions, γ is the (determinant of the) induced metric on the cut-off surface. In general, the counterterm action will contain extra terms involving boundary curvatures and derivatives of the scalar field, e.g., see [17,18]. However, eq. (2.13) is sufficient for the present purposes. Now we are interested in finding the expectation value of the stress-energy tensor and the operator O ∆ , so we take functional derivatives of the action, i.e., where λ = ε φ 0 . The limit z ǫ → 0 yields the (finite) renormalized expectation values. We performed the above calculations following appendix C of [18] and found (2.17) For convenience, we have also included the trace of the stress tensor above -as before, the trace is performed with the flat boundary metric, i.e., T a a = η ab T ab . Finally, it is useful to consider the central charge C T which appears in the two-point function of the stress tensor [19,20], where the structure of I µν,αβ (x) is completely fixed by conformal invariance. This expression applies in the vacuum state on R d for any general CFT. Holographic calculations of this correlator then yield, e.g., [21] for the boundary CFT dual to eq. (2.1). Recall that we have set the AdS scale L = 1 and hence the last factor implicitly corresponds to (L/ℓ P ) d−1 -in fact, the same factor appears in each of the expectation values above in eqs. (2.15-2.17).

Entanglement Entropy Calculation
We want to compute the holographic entanglement entropy for a spherical boundary region R of radius R. According to the Ryu-Takayanagi prescription [8], this is given by where A(v) is the area of the (d-1)-dimensional bulk surface v and we extremize over all such surfaces which are homologous to R.
In the AdS vacuum, the extremal surface v for a spherical entangling surface is well known [8]. In this case, the bulk metric (2.3) simplifies with g ab (z) = η ab and one can take advantage of the spherical symmetry to consider a bulk profile z(r) where The minimal surface is just the hemisphere: In the present case, we should extremize the area functional with a general profile z(x) in the bulk geometry defined by our general ansatz for g ab (z). However, we are only working to leading order in a perturbative expansion around the vacuum AdS geometry. In this case, one can show that any change in the position of the surface will not contribute to the first order correction in the value of A(v), e.g., see [12]. Hence it is sufficient to compute A(v) with the vacuum profile z 0 (r) but with the perturbed metric functions g ab (z).
In this case, the induced metric on the bulk surface becomes 3 where, as described above, z 2 0 = R 2 − r 2 and z ′ 0 = dz 0 /dr. Further µ i are direction cosines for the spatial coordinates, i.e., x i = rµ i with the normalization µ 2 i = 1, e.g., see [22]. Eq. (2.10) gives the background metric but for simplicity, we combine the various metric perturbations as Then to leading order, the area functional becomes . Further, it is convenient to replace the radial integral with an integration over the bulk coordinate z, which then yields where r 2 0 = R 2 − z 2 . Note that implicitly both eqs. (2.23) and (2.24) require a UV regulator because various terms in the radial integral diverge as z → 0.
We note that gravity is absent in the boundary theory and so our holographic entanglement entropy calculations are only evaluating the variation due to matter fields in a fixed background geometry, i.e., we are calculating δS IR in eq. (1.1). In particular, we wish to evaluate δS = S 1 −S 0 , where S 1 corresponds to the holographic entanglement entropy with the full perturbed metric g ab (z) as in eq. (2.3) and S 0 corresponds to that when the expectation values (2.15-2.17) have some 'vacuum' values. In the latter case, the metric is given by Note that δS is finite as all UV divergences in the entanglement entropy correspond to the vacuum divergences and they are cancelled in the difference S 1 − S 0 . Therefore we may ignore the UV cut-off for the radial integral mentioned below eq. (2.24). Substituting the expressions forĝ ij (z) in eq. (2.10) into eq. (2.24), as well as r 2 0 = R 2 − z 2 , the difference leaves the following expression at order ε 2 where as above, we have introduced the notation δX ≡ X − X vac . Finally after making the z integration, we obtain where we have also used the tracelessness of m ab in eq. (2.11) to replace δ ij δm ij = δm 00 .
Finally, it is easy to express eq. (2.27) in terms of field theory quantities using eqs. (2.15-2.19). Hence, we can see to order ε 2 , As noted above, the calculations described in section 2.1.1 are valid for operators with d/2 < ∆ < d. Hence our translation of eq. (2.27) to eq. (2.28) should only be accepted as valid for this particular range of conformal dimensions. In this case, the contribution proportional to δ O ∆ 2 R 2∆ is negligible in the limit R → 0. The remaining contributions in the first line then agree with the desired form (1.5) required for Jacobson's construction.

Alternate quantization
The holographic result presented in eq. (2.27) is valid for any generic value of the scaling dimension ∆. That is, eq. (2.27) is not only valid for d/2 < ∆ < d but also for (d − 2)/2 < ∆ < d/2, where ∆ = (d − 2)/2 is the limit set by unitarity constraints. 4 However, as noted above, our translation of eq. (2.27) to field theory quantities in eq. (2.28) has only been justified for the first range of ∆. In the regime (d − 2)/2 < ∆ < d/2, we need a different holographic renormalization procedure, that goes under the name of alternative quantization [14]. This is because for ∆ < d/2, we cannot longer have the usual relation between ∆ and the mass of the scalar field as in eq. (2.6). Instead, we need Note that with this choice the roles of the normalizable and non-normalizable modes will be interchanged. That is, the scalar field still has the asymptotic expansion in (2.9), but now the mode corresponding to φ 0 , which is still dual to the coupling λ, actually decays more rapidly in the limit z → 0. Even though the need of an alternate quantization approach was pointed out in an early discussion [14] of the AdS/CFT correspondence, the computation of renormalized one-point functions using this approach appears not to have been carried in detail. Interesting ideas on how to construct a well-defined action in this regime and beyond the unitary bound were analyzed in [23,24]. In this section, we comment on how this procedure is performed and will present the main results coming from the alternate holographic renormalization. We leave the details and discussion of this method to Appendix A -see also [25].
By requiring that the expectation values are finite and that the Ward identities hold, we find the following unique regulated action for our case, where the first two terms are the standard contributions given in eqs. (2.1) and (2.13) and the last two boundary terms are given by Note that in the counterterm action (2.31), the coefficient of Φ 2 is modified here compared to the expression in eq. (2.13). With this choice, the sum I bulk + I GHBY + I ct is finite, however, it does not satisfy the standard Ward identities, i.e., the conformal and diffeomorphism Ward identities. The latter are restored by the addition of I Legendre . In this last expression,n is the unit normal outgoing vector to the surface with z = z ǫ . Hence this term does not have the usual form of the counterterms considered in the holographic renormalization procedure. Instead, it should be considered as the term required for the Legendre transformation between the effective action and the generating functional [14] -see further discussion in Appendix A.
Using the above action (2.30), we find that the renormalized expectation values for (d − 2)/2 < ∆ < d/2 take exactly the same form given in eqs. (2.15-2.17) for d/2 < ∆ < d. Hence, the translation of eq. (2.27) to field theory quantities is unchanged and δS is given by precisely the same expression as before, namely, eq. (2.28). However, in this regime with ∆ < d/2, the contribution proportional to δ O ∆ 2 R 2∆ dominates over the R d terms involving the stress tensor. Therefore, these results do not agree with the desired form (1.5) for δS IR and they seem to present a challenge for Jacobson's derivation of Einstein's equations [10,11]. We will return to discuss this point in section 3.

∆ = d/2
It is quite clear that the expression for δS in eq. (1.7) does not apply for ∆ = d/2. In particular, the coefficient of δ T a a diverges with ∆ → 0. From the holographic perspective, the problem arises because the asymptotic expansion of the bulk scalar in eq. (2.9) contains a single power of z for both φ 1 and φ 0 when ∆ = d/2. Of course, the correct expansion takes the form to leading order in ε. Note that we need to introduce an additional scale µ (with units of mass) to make the argument of the logarithm dimensionless. While this scale is rather arbitrary in the asymptotic expansion here, we can expect that it would be determined by infrared physics if we had a complete model of the holographic RG flow. Note that making a different choice of µ will change the value of φ 1 , which is still dual to O ∆ -see holographic renormalization below. The ansatz for the metric needs to change accordingly, where m ab , b ab , c ab and d ab are matrices with constants to be determined by the Einstein's equations. This ansatz is motivated by the form of the scalar field expansion (2.33), i.e., every factor φ 0 is accompanied by a log(µz). Now we solve eqs. (2.7-2.8) order by order in powers of z (and log(µz)) with m 2 L 2 = −d 2 /4 (and κ = 0). Here we use a redundancy in the parameterization of the metric function in eq. (2.34) to simplify the following results. We obtain with the trace of the matrix m ab given by m a a = φ 2 0 2d(d−1) . Next step is to compute the area functional with the given metric to get S 1 and the same with vacuum expectation values for S 0 . As in section 2.1.2, at order ε 2 , we continue to use the hemisphere found in the AdS vacuum as the extremal surface and we just need to evaluate the area functional on this surface with the new metric. Then expanding the integrand in eq. (2.23) to order ε 2 and considering the desired difference of entropies, we find, after doing the angular integrals, The remaining radial integral is finite and so there is no need to introduce a UV cut-off. Moreover, the integration can be performed analytically yielding Note that there is an additional term proportional to R d log(µR), which comes entirely from the logarithmic term already present in ∆s.
Next step is to write the expression in terms of the quantities in the boundary field theory. For that, we would like to obtain the renormalized expectation value for both the stress tensor and the operator of dimension ∆ = d/2. For this special case, we will need to introduce extra counterterms, e.g., as shown in [26]. To eliminate the extra divergences arising from the logarithmic expansion of the scalar field, the counterterm action becomes This boundary term removes all of the divergences in the expectation values to order ε 2 . Note that the logarithmic term will also introduce some renormalization ambiguities in the definition of the expectation values, e.g., see [26]. However, as we are interested in differences of expectation values, this will not be an issue here since the ambiguities cancel out in the subtraction. Following a process analogous to that described in section 2.1.1 but now with the above counterterm action, we obtain (2.42) Given these results, as well as eq. (2.19), we can now write the variation in the entanglement entropy as Hence in the small R expansion, the leading term is proportional to R d log(µR) and the desired R d contribution is actually a subdominant contribution. Hence, we again find that our holographic results here are in conflict with the form assumed in eq. (1.5).
We will return to discuss this point in section 3.

Massive Dirac Fermions in d = 2
It is interesting that with precisely ∆ = d/2, δS acquires a logarithmic term in our holographic calculation above. It is important to understand the appearance of this term is special to such a holographic framework or if similar terms arise with general CFT's. In the case of the free massive Dirac fermion in d = 2, the modular Hamiltonian is known exactly and so can be computed perturbatively for small mass [27,28]. In this case, the mass operator has dimension ∆ = d − 1 = 1 = d/2 and so it is possible to check whether the logarithmic behaviour is also present in that context. We will expand the modular Hamiltonian of a Dirac field with mass m on an interval of size 2R in d = 2 for small mR. The modular Hamiltonian for one interval is (2.44) The kernel H(x, y) is given by where C(x, y) = 0|ψ(x)ψ † (y)|0 is the correlator kernel in the interval. Expanding C to first order in the mass we have Here γ 0 , γ 1 are the Dirac matrices and γ 3 = γ 0 γ 1 . The zeroth order calculation gives the expected conformal result [27] -see also eq. (3.2) below: To compute the first order in the mass, since we are interested in the small size limit of the expectation values of H, we can replace where we have used that the first order contribution is proportional to the Dirac γ 0 matrix. That is, we only need the kernel H 1 integrated in the interval. A more detailed calculation of local and non-local terms in H 1 will be presented in [29]. We insert the second term of (2.47) into (2.45), do the integral in β in (2.45), and use L 0 dx ψ s (x) = π 2 to do the integrals in x, y in (2.54). We find The constant term in the brackets can be integrated using (2.55). The integral of the logarithmic term can be obtained passing to the variables u = s + s ′ , v = s − s ′ and doing the integrals over u, v first, and then the integrals over x, y. We finally get Together with the leading term (2.53), this gives where we have used T 00 = (i/2)Ψ † γ 3 ↔ ∂ x Ψ + mΨΨ and T a a = mΨΨ. Of course, γ E denotes the Euler-Macheroni constant, i.e., γ E ≃ 0.5772157. Now if consider applying the first law of entanglement (1.2), we arrive at which exactly coincides with the holographic result (1.8), except for the constant coefficient multiplying T a a . 6 In order to properly compare this coefficient, a physical choice for the mass scale µ in (1.8) must first be fixed in a more complete holographic model. In any event, we may conclude that the logarithmic contribution is not an artifact of the holographic calculations.

Discussion
Jacobson's derivation [10,11] of Einstein's equations makes a precise connection between entanglement and gravity. However, his argument relies on two key assumptions: The first is that the entanglement entropy for the vacuum reduced to a small ball is maximal for variations holding the volume fixed. We have nothing to add on this point in this paper and will simply accept this postulate in the following discussion. The second assumption is that the variation of the entanglement entropy coming from variation of the matter fields takes a certain form given in eq. (1.5). This is the assumption that we examined in detail here for a class of holographic models. In particular, our holographic calculations evaluated the variation of the entanglement entropy for small spheres where the boundary theory is deformed by a relevant operator O ∆ .
Let us begin with two technical comments on our results in eqs. (1.7) and (1.8): First, the variation of the entropy δS IR is only a scalar quantity but ref. [10] is deriving the full tensor comprising Einstein's equations (1.4). Hence it is important that the variation has the form δS IR = Y abt atb where Y ab is some symmetric tensor andt a is the unit time-like vector orthogonal to the Cauchy slice containing the spherical region for which we are evaluating the entanglement entropy. Since our analysis was done for a general state (i.e., state which is anisotropic and stationary), it is straightforward to verify that our holographic results for δS IR take this form. In particular, we can boost any given background to a new reference frame while leaving the entangling sphere fixed, and the form of our results is unchanged. That is, the first contribution in either eq. (1.7) or (1.8) is proportional to δ T 00 in the new frame and hence corresponds to a term proportional to δ T ab in Y ab . Similarly, the contributions proportional to δ T a a and δ O ∆ 2 are left unchanged and so we can interpret these two terms as appearing in Y ab with a factor of −g ab . 6 Above, we have the coefficient 1 + α + γ E ≃ −2.95 for the free fermion, while substituting d = 2 into the holographic result (1.8), the corresponding constant is 2 3 − 1 2 H 1/2 ≃ −0.0323.
Our second technical comment has to do with the factor of 1/C T appearing in the contributions proportional to δ O ∆ 2 . Naively, one may think that this factor indicates that this term is suppressed relative to the others because our holographic framework requires that we are working with a large central charge, e.g., this is the usual large N limit for holographic gauge theories. However, our normalization is such that the expectation values of any (single-trace) operators are themselves proportional to C T , e.g., as revealed by the factor of 1/ℓ d−1 P appearing in eqs. (2.15-2.17). Hence this factor of 1/C T ensures that all three contributions in eqs. (1.7) and (1.8) are contributing at the same order in this regard, i.e., they are all proportional to C T . A short calculation in Appendix B of δS IR for a thermal state in a general d = 2 CFT shows that the appearance of such factors is natural in evaluating entanglement entropy for CFTs, even beyond holography.
Our results in eqs. (1.7) and (1.8) found the appearance of a contribution to δS IR proportional to R 2∆ δ O ∆ 2 , where the power of R is completely fixed on dimensional grounds. Such a contribution seems problematic in the regime (d − 2)/2 < ∆ < d/2 since this term decays more slowly than the desired R d terms in the limit R → 0. 7 Hence it is natural to think that δS IR would be dominated by this contribution and so Jacobson's argument, which relies on the form (1.5), would be invalid. One obvious resolution of this problem would be if there were no such operators in the UV fixed point theory that describes the matter fields in our universe. Again, the unitarity bound for general CFT's allows (scalar) operators with conformal dimensions in this problematic regime. 8 Hence this requirement would be a restriction on the spectrum of operators appearing in the matter sector of a theory described Einstein gravity.
Let us also observe that the expectation values appearing in eq. (1.7) are all set by infrared physics scales which are independent of the size of the sphere R. Now in general, we would have Of course, we are considering the regime where all of these energy scales are much smaller than that set by the radius of the sphere, i.e., µ O,0,T ≪ 1/R. However, in arriving at the conclusion that the δ O ∆ 2 contribution creates a problem, we are implicitly assuming that these scales are all roughly the same, i.e., µ O ≃ µ 0 ≃ µ T ≃ µ, so that the different contributions in eq. (1.7) can be compared with powers of the same dimensionless product µR. However, in general, there is no need for these scales to be 7 In Appendix C, we also identified a contribution proportional to R 3∆ δ O ∆ 3 , which also becomes problematic for d−2 2 < ∆ < d 3 with d < 6. However, this contribution will always be subdominant compared to the term discussed above in the main text. 8 In fact, explicit examples are known in certain supersymmetric gauge theories, e.g., see [14].
the same. In particular, one can imagine that there will be broad families of states where µ O ≪ µ 0,T . Then even if there are operators with ∆ < d/2, one may still have (µ O R) 2∆ ≪ (µ 0,T R) d ≪ 1 for small but finite R in a broad class of states. In this case, gravity would be properly described by Einstein's equations in this family of states but it raises the intriguing possibility that this description would breakdown in other 'low entropy' states. 9 We return to discussing this possibility later in this section. We also considered the special case of ∆ = d/2 for which δS IR is given by eq. (1.8). In this case, the term proportional to δ O ∆ 2 carries a factor of R d and so this contribution can simply be absorbed into the X term in eq. (1.5). However, there is a additional contribution proportional to δ T a a with a factor of R d log(µR). In fact, in section 2.3.1, we confirmed the appearance of this extra logarithmic dependence beyond the framework of holography. There, we found that that the same term appears for a free Dirac fermion in two dimensions, for which the entanglement Hamiltonian is explicitly known [27,28]. When the theory is perturbed by a small mass, i.e., mR ≪ 1, it is quite remarkable that the same logarithmic term appears in eq. (2.59) with the precisely same coefficient as in the holographic result (1.8).
The extra logarithm gives an enhanced, although only mildly enhanced, dependence on the radius R so that the appearance of this term in δS IR is again problematic for Jacobson's construction. Since the appearance of this term requires a precise value for the conformal dimension, i.e., ∆ = d/2, it may seem more reasonable to require no such operators appear in the UV fixed point theory of the matter fields. Of course, in four dimensions, a mass term for a free scalar field would be a canonical example of such a term. However, one should expect that unless the scalar is completely free that even weak interactions will induce a small anomalous dimension and hence eliminate the appearance of this problematic contribution to δS IR .
We should emphasize that, apart from section 2.3.1, the calculations here are limited to holographic theories with an Einstein gravity dual. Hence there are the usual caveats that the corresponding CFTs should have a large central charge, be strongly coupled and have a sparse spectrum. A priori, it is not clear how universal the results obtained here would be for more general theories. However, it seems that in fact our calculations may extend to generic CFTs following the approach of [30]. The latter reference argued that when a generic CFT is deformed as in eq. (1.6), δS IR is completely determined by universal two-and three-point correlators in the CFT and further that the result could be evaluated by recasting it into the form of a holographic gravity calculation similar to those presented here. Note, however, that ref. [30] evaluated the change in the entanglement entropy between the CFT vacuum and the vacuum of the deformed theory. Hence it remains to consider excited states in the deformed theorysee [31]. However, the primary challenge is to extend these calculations to conformal dimensions in the regime found to be of most interest here, i.e., d−2 2 < ∆ ≤ d 2 .

Modular Hamiltonians
As described in the introduction, an important contribution to the variation of the entropy comes from the first law of entanglement (1.2). In the present discussion, examining the expectation value H for small spheres is related to examining what would be commonly referred to as the 'operator product expansion' (OPE) of the modular Hamiltonian. That is, H would be given by some complex and generally nonlocal expression involving collections of operators restricted in a finite region, i.e., the sphere of radius R. However, if it is only examined with long wavelength probes, we can effectively approximate H by a sum of local operators. 10 In fact, reducing the flat space vacuum of a CFT to a sphere of radius R yields a remarkably simple expression for the modular Hamiltonian [13] where the constant c ′ is fixed by demanding that the corresponding density matrix is normalized with unit trace. Hence in this case, the OPE only involves the energy density and its derivatives, but of course, the derivative terms are accompanied higher powers of R, i.e., Hence in these additional terms are higher order contributions in the limit of small R, which are then negligible for the purposes of Jacobson's argument. Let us add that Jacobson's general argument compares a given state to the vacuum in a maximally symmetric background, i.e., Minkowski space, de Sitter space or anti-de Sitter space. This approach allows him to accommodate the possibility of a cosmological constant, as well as the scalar contribution X from nonconformal matter fields in eq. (1.5). One can easily extend the construction of [13] to evaluate the modular Hamiltonian for a CFT in the dS or AdS backgrounds. For example, let us consider the static patch of dS space, i.e., The interested reader can find more detailed considerations of the OPE for Wilson lines and surface operators in gauge theories in [32] and of twist operators in higher dimensional CFTs in [21].
For a spherical region of radius R placed at the origin in the above coordinates, the modular Hamiltonian becomes Of course, the modular Hamiltonian (3.5) is still given by a local integral of the energy density alone. We are interested in the regime where the radius of the sphere R is much smaller than the dS curvature scale L. 11 Then the leading curvature correction in the OPE expansion is That is, the modifications due to the curvature scale in the first law (1.2) are suppressed by powers of R/L ≪ 1 and make a negligible contribution in Jacobson's construction. We expect that curvature contributions will again be suppressed in a similar way for the case of a deformed CFT. Given eq. (3.2), one may conclude that for a CFT, the contributions proportional to O ∆ 2 appearing in eqs. (1.7) and (1.8) take us beyond the first law of entanglement (1.2). The coupling λ does not explicitly appear in these terms and so they would also appear for excited states of the CFT, i.e., even when λ = 0. However, as noted above, all of the contributions from the modular Hamiltonian in a CFT will only involve the energy density and its derivatives. Hence the O ∆ 2 contribution cannot be contained in the expression for δ H on the right-hand side of eq. (1.2). In this sense, these must be 'higher order' contributions to δS IR that go beyond the first law. At this point, we should stress that Jacobson's arguments only considered first law contributions to δS IR -we return to this point in the discussion below. Further, we observe that in the case of a CFT, we have O ∆ vacuum = 0 and so Hence the sign of the corresponding coefficient in eqs. (1.7) and (1.8) is such that δS IR ≤ δ H as required by the positivity of relative entropy, e.g., [12]. A similar discussion applies for a deformed CFT, where we may expect that O ∆ vacuum = 0. In this case, we may write . Now the second term on the right-hand side is linear in the deviation of O ∆ away from the deformed vacuum. Hence this contribution must come from the variation of the expectation value of the modular Hamiltonian (of the deformed CFT) in the first law. 12 Note that this second term above can have either sign. In contrast, the first term in eq. (3.8) is positive, being quadratic in the deviation from the deformed vacuum. Hence this contribution goes beyond the first law again and gives the leading term in the relative entropy with δ H − δS ≥ 0. Further, let us add that the contribution to δS IR proportional to the trace of the stress tensor δ T a a also comes from the modular Hamiltonian, since again it is linear in the deviation of the expectation value from that in the deformed vacuum. Our assertion is also supported by results in section 2.3.1 for massive fermions in two dimensions, which only considers the first law contribution. Further support comes from the results in [30], which indicate that simply replacing T 00 by T 00 − g 00 T a a /(2∆ − d) in eq. (3.2) yields the modular Hamiltonian of the deformed CFT to leading order in a λ expansion. 13 This result would apply for d/2 < ∆ < d in generic CFTs and would yield precisely the first line of our result for δS IR in eq. (1.7). Note however that this simple expression for the modular Hamiltonian is singular for ∆ = d/2 and so will not apply for this value of the conformal dimension. Our d = 2 fermion calculations in section 2.3.1, which apply to this case, emphasize that the local expressions such as those in eq. (1.7) or (1.8) emerge from taking the OPE limit of the modular Hamiltonian. That is, for the massive free fermions, the modular Hamiltonian is a nonlocal expression in general, even to leading order in the mass deformation.
In general then, the OPE limit will yield δS IR as a sum of expectation values of local operators, plus expectation values of local operators squared and higher powers. As in eq. (3.3), the coefficients of higher dimension operators or higher powers of expectation values will include higher powers of the radius R and so these tend to give subleading terms. This structure will be the completely general independent of holography or the particular details of the theory under study. Moreover, one may ask which of the contributions in this expansion are determined by the modular Hamiltonian, i.e., the first law (1.2), and which are not? In general, the expectation value of modular Hamiltonian will yield contributions which are linear in the excitation of the expectation value of any operator above its vacuum expectation value (e.g., δ T 00 , δ T a a or δ O ∆ ). Any contributions which are not linear in such expectation values extend δS IR beyond the first law.

Four Roads to Quantum Gravity
At this point, we would like to assess the implications of our results for Jacobson's derivation of Einstein's equations [10,11]. It seems that there are at least four different possible interpretations: 14 The first and most straightforward seems to be that the derivation only applies to linear variations around the vacuum. That is, Jacobson's original argument only considered variations of the entanglement entropy consistent with the first law (1.2), whereas we showed that the problematic term proportional to R 2∆ δ O ∆ 2 (with ∆ < d/2) in eq. (1.7) extends δS IR beyond this range. Hence, the correct interpretation of Jacobson's derivation may be to restrict attention to linear variations about the vacuum. As pointed out in [10], this interpretation already seems to be necessary when considering coherent states, which can have a finite energy density while leaving the entanglement entropy unchanged [33,34]. 15 Unfortunately, in this case, the derivation would only reveal the linearized Einstein equations, similar to the holographic analysis in [9]. Moreover, it would be useful to investigate more thoroughly (i.e., beyond holography) the possibility that low dimension terms may appear in the OPE expansion of the modular Hamiltonian to test the validity of this interpretation [29].
Above, we already suggested an alternate resolution of this issue. Namely, no problematic second order terms would arise if there were no ∆ < d/2 operators in the UV fixed point theory that describes the matter fields in our universe. That is, the spectrum of operators appearing in the ultraviolet would be restricted in order for the infrared theory to be described by Einstein gravity. In fact, the first approach would still require such a restriction to avoid the logarithmic contribution which appears in the first law when ∆ = d/2, as shown in eq. (1.8). Ruling out only operators with precisely ∆ = d/2 in the UV fixed point theory would seem to be a more mild restriction.
A third proposal (by Ted Jacobson) is that the contributions proportional to R 2∆ δ O ∆ 2 could be absorbed in a manner analogous to the treatment of δ X in eq. (1.5). 16 In the latter case, the curvature of the maximally symmetric reference background is chosen locally to absorb this scalar expectation value, i.e., If we extend this choice to allow dependence on both the state and the size of the sphere, then settingΛ will absorb the second order term and produce the expected Einstein equations (1.4). Here, the (positive) constant k ∆ is given by the ratio of the numerical coefficients of δ O ∆ 2 and δ T 00 in eq. (1.7). This approach has the uncomfortable feature thatΛ grows arbitrarily large as R → 0 (when ∆ < d/2). Note, however, that C T (µ O R) 2∆ (ℓ P /R) d−2 ≪ 1 ensures that the radius of curvature set byΛ is still much larger than the radius of the sphere R. 17 This interpretation would then to allow for the small but finite variations and the derivation of the full nonlinear Einstein equations. However, this approach still calls for a better understanding to the role of the maximally symmetric reference geometry in Jacobson's construction.
A last more speculative possibility is that Jacobson's derivation indicates that particular class of states which 'gravitate' according to Einstein's equations. Recall that we suggested that the R 2∆ δ O ∆ 2 contribution would not be a problem, even with ∆ < d/2, as long as µ O ≪ µ 0,T , where these are the scales characterizing the various expectation values in eq. (3.1). Hence, Jacobson's derivation still carries through and gravity would still be described by Einstein's equations for the family of states. However, this also raises the intriguing prospect that Einstein's equations will 'fail' for low entropy states, where µ O µ 0,T . The idea that Einstein's equations break down for highly excited states is not a surprising one. For example, when the stress energy reaches the Planck scale, i.e., µ 0,T ∼ 1/ℓ P , in the early universe, one no longer expects that the cosmological evolution is described by Einstein's equations. However, the interesting feature of the breakdown anticipated here is that none of the scales involved need to be Planckian. Instead, one can anticipate a failure of Einstein's equations even when the energy density is much much less than the Planck scale.
If we naively carry Jacobson's derivation ahead with eq. (1.7) (and use eq. (3.9) to choose the reference geometry), the gravitational equation becomes where as in eq. (3.10), k ∆ corresponds to the ratio of the coefficients of δ O ∆ 2 and δ T 00 in eq. (1.7). Hence the new δ O ∆ 2 term might be seen as an additional contribution to the cosmological constant term in eq. (3.11). However, to properly interpret 17 Here ℓ d−2 P ≡ 8πG (rather than the Planck scale in the holographic theory). Also recall that µ O was defined in eq. (3.1). Of course, Jacobson's derivation requires both µ O R ≪ 1 and ℓ P /R ≪ 1. this equation, it remains to understand the role of R. Of course, R was the radius of the spherical entangling surface but this is simply an auxiliary scale in Jacobson's derivation that disappears from the final Einstein equation (1.4). One suggestion would be that R can be regarded roughly as a renormalization scale in eq. (3.11). That is, we should think of eq. (3.11) as the appropriate gravitational equation when we are probing the spacetime on (length) scales of the order of R. Of course, this interpretation seems incomplete since we cannot choose R arbitrarily rather the construction requires µ O,0,T ≪ 1/R.
Further, let us comment that the sign of the coefficient k ∆ is fixed to be positive by the positivity of relative entropy, as commented below eq. (3.7). As further noted there, δ O ∆ 2 is also guaranteed to be positive at a conformal fixed point and so the new contribution in eq. (3.11) would resemble to a negative energy density. Beyond a conformal fixed point, the sign of δ O ∆ 2 might have either sign. Hence if we apply this interpretation of Jacobson's derivation, it seems that we are naturally led to conclude that Einstein's equations may 'fail' for certain classes of low energy states. It will be interesting to better understand how gravity is modified in these states and the implications of these modifications. In particular, they may have important consequences for our understanding of the cosmological constant problem, early universe cosmology and perhaps warp drive engineering [35].

A Alternate Holographic Renormalization
The aim of this Appendix is to show how to perform holographic renormalization for operators in the range (d−2)/2 < ∆ < d/2. In particular, we are interested in obtaining renormalized one-point functions of the operator and the stress energy tensor which: (a) are UV finite and (b) satisfy both the conformal and the diffeomorphism Ward identities. In the usual regime with ∆ > d/2, the well-studied procedure of holographic renormalization gives the desired expectation values that satisfy both Ward identities. However, the extension to cover the ∆ < d/2 case is not straightforward and here we present a consistent way of doing so. For that, we will merge the ideas that first appeared in [14] with the modern approach of holographic renormalization. We also consider a slightly different approach considered in [24] to derive the same action.
In [14], it was proposed that the generating functional of the theory in which ∆ is given by eq. (2.29) is in fact the Legendre transformation of the one with ∆ given by eq. (2.6). This would suggest that in the holographic renormalization procedure we should add a term that would play the role of this Legendre transformation. In fact, this will be a boundary term of the form Φ (n·∇) Φ, wheren is the unit normal vector to the surface with fixed z = z ǫ . So the most general action that includes all possible counterterms that are relevant to our case 18 , and also includes this Legendre transformation term is where the first two terms are the standard contributions given in eqs. (2.1) and (2.13) and the last two boundary terms are given by Here, γ is the determinant of the induced metric in the surface z = z ǫ , and A and B are numerical constants to be determined in order to satisfy all our requirements. In the counterterm action, we have already fixed the coefficient of Φ 2 so that the sum I bulk + I GHBY + I ct is finite and yields finite expectation values. 19 However, these expressions do not satisfy the desired Ward identities.
Hence our goal will now be to determine A and B so that it is possible to get finite expectation values and satisfy both Ward identities. Note that we only have two free parameters to satisfy three (or four) different requirements. Getting finite expectation values for O ∆ and T ab will give B = B(A). Satisfying the trace Ward identity will then fix A. If this procedure is consistent, and we will show it is, then we should be able to automatically satisfy the diffeomorphism Ward identity, that is independent from the other. Checking that the second Ward identity is satisfied then provides a highly nontrivial check of our results. At this point, let us add that in fact, the finite part of this resulting (finite) term (A.3) yields Hence we have produce exactly the expression required for the Legendre transform in the boundary theory -more details on this procedure are given in [25]. So the first step is to compute the expectation values for the stress tensor and the operator and set B as a function of A by requiring them to be finite. Note that in this regime we will need to cancel terms proportional to z −d+2∆ ǫ , whereas in the usual holographic renormalization approach the counterterms are set to cancel the terms proportional to z d−2∆ ǫ . An important point in our calculation is that even though we are mostly treating φ 1 and φ 0 as independent in this perturbative analysis, in a more general setup φ 1 , proportional to the expectation value of O ∆ , will be a function of φ 0 , the coupling. Then, if we want to compute the variation of some element X with respect to the source λ we should consider both contributions, In the case of the usual holographic renormalization, the second term will be negligible, but will be important in this case. In fact, we will be assuming that φ 1 is just proportional to φ 0 , i.e., φ 1 = k 1 φ 0 . This assumption is reasonable, at least at this order in the ε expansion, as any higher order terms will include additional powers of ε. In any event, even though this will give contributions to the different parts of the action, the final result will not depend on this assumption. Now we need to take variations with respect to the different terms in the regulated action. For that, it would be useful to consider, To compute the variations we will be closely following Appendix C in [18], fitting our formulas to the present simpler case where the scalar field does not have any dependence on the boundary spacetime coordinates. For instance, The variations of the Legendre action are also straightforward, i.e., the variation with respect to the induced metric is analogue to eq. (A.10), but with the term B Φ (n·∇) Φ. Then we need to recall eq. (A.5) when taking variations with respect to the source λ. Now we can compute the expectation values of interest. The first requirement we want to impose is to have finite expectation values, so we will concentrate in the divergent terms, i.e., again, in this regime, these are the ones proportional to z −d+2∆ ǫ .
In the computation of the energy density, T 00 , the divergent terms read as Note that B = 0 implies A = 0, which, as mentioned, will give a finite I reg . We might also consider choosing A = −∆/2 with which the term proportional to Φ 2 in the Legendre action will cancel that in the counterterm action. This would produce a boundary term (with B = 1/2) resembling that appearing in the original discussion of [14]. However, we will find that neither of these choices yields the desired Ward identities. It is important to mention that by satisfying eq. (A.13), the expectation values of the spatial components of the stress tensor, as well as of O ∆ , also become finite.
Next step is to determine the coefficient A by requiring that the conformal Ward identity is satisfied. Given the relation, between A and B, the finite expectation values for the operator and the trace of the stress energy tensor turn out to be, The conformal Ward identity reads where λ = εφ 0 . Comparing these expressions, we conclude that Substituting these values back into eq. (A.3), we obtain the regulated action that appears in eqs. (2.30-2.32) in the main text. Of course, by construction, it is finite and satisfies the conformal Ward identity. Still, the inclusion of the Legendre term is novel, so we would like to check whether with this action, we also satisfy the diffeomorphism Ward identity, Of course this is trivial to satisfy in the case where neither the stress energy tensor nor the coupling have any space-time dependence. But we are looking for a nontrivial check.
For that, we will assume now that all our coefficients have some temporal dependence. Of course, in general this can set up a much more complicated problem such as the one considered in [18]. However, working at leading order in the perturbation parameter ε, we will have some simplifications. In particular, we will assume that to leading order the expectation values won't be modified with the exception that φ 0 , φ 1 and m 00 will be now functions of time. Then, we have, 19) and the Ward identity, as there is only time dependence, simplifies to Now, it is obvious that the naive evaluation of the identity will not be satisfied, as the LHS will contain a term proportional to m ′ 00 (t) and the RHS will not. 20 However, we have an extra constraint coming from the Einstein's equations. By adding time dependence, there will be a new nontrivial zt equation, R zt − 1 2 ∂ t Φ∂ z Φ = 0, which determines m ′ 00 (t) in terms of φ ′ 0 (t) and φ ′ 1 (t). Computing that extra equation we obtain, Now it is now straightforward to show that the diffeomorphism Ward identity (A. 20) is perfectly satisfied if we impose this condition. The fact that we obtain a consistent Ward identity given our choice of A and B is highly nontrivial and provides an argument in favour of this alternate holographic renormalization procedure. After the whole process we obtained an action that provides finite expectation values that satisfy both Ward identities in the regime where (d − 2)/2 < ∆ < d/2. As far as our knowledge goes, this was not calculated before in the literature and it would be nice to have further checks of this proposal. For example, we can test that this generates the correct thermodynamics when we analyze the full black hole case [25]. As for now, this shows that holography can deal with this kind of cases. Moreover, as detailed in the main text, this provides a confirmation that in this regime, the leading term in the entanglement entropy expansion for small spheres is not simply related to the stress energy tensor alone but rather involves δ O ∆ 2 . Finally, let us comment on how the same action was derived in [24] without asking that the Ward identities be satisfied. Instead, the two requirements to fix A and B were finiteness of the on-shell action and stationarity of the action with either Dirichlet or Neumann boundary conditions. The scalar field part of the action is, Now, varying the bulk action by parts (with respect to Φ) one gets equations on motion plus a boundary term. Then we impose the usual boundary behaviour to the scalar field, i.e., Φ(z) = εφ 0 z d−∆ + εφ 1 z d .
By carefully analyzing the finite part of the action, we arrive to a variation that reads, and hence eq. (B.3) becomes The first term matches precisely the expected first law contribution given in eq. (1.3) for d = 2. Two observations for the second order contribution are: first, the sign is negative ensuring that δS ≤ δ H and second, the coefficient appears with a factor of the inverse of the central charge. This factor is precisely analogous to those appearing in the second order contributions in eqs. (1.7) and (1.8). We might also observe that this factor arises here because the expectation value of the energy density is itself proportional c, as shown in eq. (B.4). Similar formulae can be obtained holographically in any number of dimensions -see section 3.2 of [12].

C Higher order corrections
In all of our computations, we have two different deformations: the first one is the deformation of the theory away from the conformal one; and the second is the excitation of the state away from the new vacuum. We are parameterizing these small deformations with our parameter ε. In this Appendix, we will compute the next-to-leading order corrections, i.e., the ε 3 corrections, to the entanglement entropy. The procedure is analogous to the calculations in section 2.1 but with a series of modifications. First, we need to include κ = 0 in the scalar potential (2.2). Without this cubic term, we would not find an extra contribution to δS at order ε 3 . Then, since the scalar field equation (2.8) is now nonlinear, we need to include cross-terms between the φ 0 and the φ 1 series in the scalar field expansion. Note that while the following analysis is valid for generic values of ∆, at this order, there will be certain special values (e.g., ∆ = 2d/3) that need to be analyzed separately because of the appearance of extra logarithms, as appeared for ∆ = d/2 in section 2.3. We do not contemplate those cases in the rest of this Appendix.
At order ε 2 , the ansatz for the scalar field will have the most general form As before, φ 0 and φ 1 will be free parameters related to the coupling and the expectation value of O ∆ and the f i 's are constants to be determined by the equations of motion. The ansatz for the metric is completely analogous, i.e., we write all the possible ε 3 terms possible. We report here just the answer obtained after solving the equations to order ε 3 (and linear order in κ). The scalar field solution can be written as For simplicity, we limited the present calculations to a state described by a static and isotropic bulk geometry, i.e., g 0i (x) = 0 and g ij (z) ∝ δ ij in eq. (2.4), but our results for the third order corrections are still general. At second order, this choice corresponds to restricting the coefficients in eq. (2.10) to have a simple diagonal form and in the end, it only affects the solution for m ab . The third order corrections for the metric have a simple form with: g ab (z) = g (ε 2 ) ab + ε 3 η ab g 3 (z) where 23 .
Of course, the functions at order ε 2 are precisely the same as found in section 2.1.
Hence the solution goes back to the previous one if we set κ = 0 and restrict to terms of order up to ε 2 . With the perturbation of the metric in hand, we can now proceed to compute the entanglement entropy. However, a few comments are in order before that though. First, in the holographic computation of the entanglement entropy for perturbed states, we have two different contributions: one related to the change in the background metric and the other related to the change in the position of minimal surface. The leading metric corrections appear at order ε 2 . This means that corrections in the position of the extremal surface (the semi-sphere for CFT's vacuum) will appear at this same order but the leading corrections which this shift in the position makes to δS appear at order ε 4 . Of course, this argument tells us that we can neglect that contribution for the current computation, which only considers corrections up to order ε 3 . Now let us write δS in the form where we already integrated the angular coordinates. Of course, ∆s 2 (z) is the leading order contribution that we already computed in eq. (2.26). After some algebra, ∆s 3 (z) turns out to be .
Again, the integrand is everywhere finite so there is no need to introduce a cut-off. After integration, the entanglement entropy receives three types of ε 3 -contributions, Each of the three contributions are proportional to the cubic coupling in the scalar potential κ. Hence we see that it was important to add that term in order to get a nonzero result at this order in the ε expansion. Next we would like to write δS in terms of field theory quantities. For that, we should carry out the holographic renormalization procedure as in section 2.1.1. As we introduced an extra cubic term in the scalar potential, we should expect that new divergences will arise and that we need to modify our counterterm action in order to remove them. In fact, one can show that a finite action is produced with Next, we need to compute the expectation value for the stress tensor and the scalar operator. However, this computation does not produce any extra contributions to either the expectation value of the stress tensor or of the operator. So the expectation values are unchanged at order ε 3 . Using eq. (2.17), we can rewrite the entanglement entropy contributions (C.7) as functions of the field theory coupling and expectation value. In doing so, we find So, in all, we have three different possible contributions to order ε 3 , each one proportional to a different power of the expectation value for the operator. Note that the last two terms will give higher order contributions in the R expansion when ∆ < d, but depending on the space-time dimension, the first term can also be important when (d − 2)/2 < ∆ < d/2. Of course, an alternative renormalization procedure would be needed in that case to make sure that the term remains unchanged in that regime. But assuming this is true, then that term would dominate over R d contributions in the small R expansion. We can write ∆ as ∆ = d/2 − ν, with 0 < ν < 1. Then the first term would scale as R to the power 3d/2 − 3ν, that consequently gives that a power of R smaller than d if ν > d/6. Of course for this to happen, d < 6 (because ν < 1). Even in that case, there will be a term proportional to ε 2 that scales as R 2∆ , that would be the leading contribution to the entanglement entropy.