Abstract
We show that Minkowski higher-derivative quantum field theories are generically inconsistent, because they generate nonlocal, non-hermitian ultraviolet divergences, which cannot be removed by means of standard renormalization procedures. By “Minkowski theories” we mean theories that are defined directly in Minkowski spacetime. The problems occur when the propagators have complex poles, so that the correlation functions cannot be obtained as the analytic continuations of their Euclidean versions. The usual power counting rules fail and are replaced by much weaker ones. Self-energies generate complex divergences proportional to inverse powers of D’Alembertians. Three-point functions give more involved nonlocal divergences, which couple to infrared effects. We illustrate the violations of the locality and hermiticity of counterterms in scalar models and higher-derivative gravity.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The ultraviolet structure of quantum field theories is notoriously a fundamental problem in high-energy physics. Nowadays, with the Large Hadron Collider currently running at a center-of-mass energy of 13 TeV, the standard model is experimentally verified at the TeV scale. On the theoretical side, the renormalizability of the standard model, together with the relatively small value found for the Higgs mass, \(m_{H}\simeq 125\) GeV, imply that the model could be valid at energies much higher than the ones investigated so far. The interest in a high-energy modification of the standard model is, therefore, rather limited, on the practical side. As far as quantum gravity is concerned, the situation is different. The negative mass dimension of the coupling constant (compared with the dimensionless gauge couplings of the standard model) makes the Hilbert–Einstein action nonrenormalizable [1,2,3,4,5]. This fact, together with the difficulties to build up a phenomenology, render the investigation of alternative high-energy structures of quantum gravity more attractive.
An interesting class of higher-derivative quantum field theories are those whose propagators have complex poles. In that case, the Euclidean and Minkowski versions of the theories are not related to each other by the analytic continuation. In this paper, we concentrate on the Minkowski formulation of such theories, that is to say, we integrate the loop energies along the real axis. We show that such theories are generically inconsistent, because they violate both the locality and the hermiticity of counterterms. For example, the one-loop bubble diagram \(\Sigma (p)\) of massless higher-derivative scalar fields in six spacetime dimensions evaluates to
where \(\Lambda _{UV}\) is a hard ultraviolet cutoff on the space momenta, M is the scale associated with the higher-derivative terms and the dots denote convergent terms. Similar results occur in four dimensions, when vertices carry derivatives. More involved nonlocal structures appear in triangle diagrams. Moreover, gauge symmetries are unable to protect the locality and hermiticity of counterterms. We prove this fact by extending the calculations to a model of higher-derivative quantum gravity [6, 7].
The rules of power counting obeyed by Minkowski higher-derivative theories are much weaker than the standard ones, because a propagator calculated on the pole of another propagator falls off half less rapidly than expected. This property implies that the higher-derivative terms often have an “anti-regulating” effect, in the sense that they enhance divergences rather than suppressing them. The divergences (1.1) can also be related to specific pinch singularities occurring for \(p^{2}\rightarrow 0\), which have no direct analog in standard field theories.
It is well known that, in general (for example, when the free propagators have poles infinitesimally close to the real axis, in the second and fourth quadrants), Minkowski higher-derivative theories are physically unacceptable, because they violate perturbative unitarity. Our results show that when the free propagators also contain poles that are located at finite distances from the real axis, in the first and third quadrants, the theories are in general unacceptable from the mathematical point of view, because they violate both the locality and the hermiticity of counterterms.
The problems we have found do not occur in Euclidean theories, Lee–Wick models [8,9,10,11] or Minkowski theories that are analytically equivalent to their Euclidean versions. In those cases, nonlocal divergences may appear in some intermediate steps of the calculations (examples being the residues of the integrals on the energies, if taken separately), but they cancel out at the end.
The paper is organized as follows. In Sect. 2 we study some key aspects of the higher-derivative Minkowski propagator. In Sect. 3, we calculate the nonlocal divergent part of the bubble diagram in six dimensions and generalize the calculation to the bubble diagram with nontrivial numerators in four dimensions. In Sect. 4, we study the one-loop triangle diagram and provide an interpretation for its nonlocal divergent part. In Sect. 5 we investigate the modified power counting of Minkowski quantum field theories. In Sect. 6, we study a higher-derivative version of quantum gravity in four dimensions and prove that gauge symmetries fail to protect the locality and hermiticity of counterterms. Finally, in Sect. 7 we draw our conclusions.
In Appendix A we show that the dimensional regularization (of the integrals on the space momenta) allows us to apply the residue theorem on the energy integrals, even when they are divergent. In Appendix B we discuss the gauge fixing of Minkowski higher-derivative gravity and show that the Ward–Takahashi–Slavnov–Taylor (WTST) identities [12,13,14,15] are also plagued with nonlocal divergences.
2 Higher-derivative propagator
The standard propagator of a spinless particle reads
To increase the convergence of the loop integrals for large virtualities, \( |p^{2}|\gg m^{2}\), it is natural to introduce additional powers of \(p^{2}\) in the denominator to obtain, for example, a modified propagator of the form
For small virtualities, \(|p^{2}|\ll M^{2}\), the modified propagator approaches the standard one, \(S(p,m)\simeq \Delta (p,m)\), while in the asymptotic region
it decays quite faster:
By means of a partial fractioning in \(p^{2}\), the higher-derivative propagator (2.2) can also be written as
where \(\omega _{\epsilon }(p_{s})\equiv \sqrt{p_{s}^{2}+m^{2}-i\epsilon }\), \( \Omega \mathcal {(}p_{s})\equiv \sqrt{p_{s}^{2}-iM^{2}}\), \(p^{\mu }=\left( p_{0},\mathbf {p}\right) \) and \(p_{s}=|\mathbf {p}|\). The poles are located at \(p_{0}=\pm \omega _{\epsilon }(p_{s})\), \(p_{0}=\pm \Omega (p_{s})\) and \( p_{0}=\pm \bar{\Omega }(p_{s})\), where the bar denotes the complex conjugation. Since poles are present in every quadrant, the Euclidean theory and the Minkowski theory are not related in a simple way. In this paper we concentrate on the Minkowski theory, that is to say, we assume that the loop integral is defined by integrating the energy along the real axis.
In general, the factor
is expected to have an ultraviolet regulating effect, by suppressing the states with \(\left| p^{2}\right| \gg M^{2}\). We show that it is not the case in Minkowski theories. Actually, often the factor (2.5) roughly has an opposite, “anti-regulating” effect.
For large space momenta, \(p_{s}\gg M\), the complex poles come close to the real axis, since
where \(\eta \) is the small positive quantity
Therefore, in the asymptotic region \(p_{s}\gg M\) the terms involving M effectively act as \(\pm i\eta \) prescriptions for the propagation of exotic, high-energy excitations on the light cone, with the large lifetimes
Both \(\epsilon \) and \(\eta \left( p_{s}\right) \) are positive quantities, but \(\epsilon \) is infinitesimal, while \(\eta \left( p_{s}\right) \) is small and finite.
Since \(\eta \left( p_{s}\right) \rightarrow 0\) for \(p_{s}\rightarrow +\infty \), there is a pinch singularity of the pole located in the first quadrant with the two poles located in the fourth quadrant, and a similar pinch singularity of the pole located in the third quadrant with the two poles located in the second quadrant. The violations of power counting that we find in the next sections can be traced back to this pinching and ultimately to the presence of both the \(+i\eta \) and the \(-i\eta \) terms at \(p_{s}\gg M\).
3 Bubble diagrams
In this section we compute the nonlocal divergent parts of the higher-derivative, one-loop scalar bubble diagrams in six and four dimensions, with trivial and nontrivial numerators.
As usual, the ultraviolet divergent part is a sum of powerlike divergences and logarithmic divergences. The powerlike divergences are less interesting than the logarithmic ones for the purpose of singling out inconsistencies, because they depend on the subtraction scheme and can be removed in a renormalization-group invariant way. The one-loop logarithmic divergences, on the contrary, do not depend on the regularization scheme, and provide meaningful tests of the locality of counterterms. For these reasons, we focus our attention mostly on them. We either use the dimensional regularization technique or a sharp cutoff \(\Lambda _{UV}\) on the space momenta of the loops, according to convenience. We always convert the outcome to the cutoff notation.
3.1 Bubble diagram in six dimensions
The bubble diagram with different masses in D spacetime dimensions gives the loop integral
where \(S\left( p,m\right) \) is given in Eq. (2.2), \(\Lambda _{UV}\) is an ultraviolet cutoff on the space momenta \( \mathbf {k}\) and \(k_{s}=|\mathbf {k}|\).
Since the higher-derivative theory is relativistically invariant, \(\Sigma (p) \) is expected to be a function of \(p^{2}\) only.Footnote 1 It is then convenient to consider a timelike external momentum, \( p^{2}>0\), and select a Lorentz frame in which \(p^{\mu }\) has only the time component, \(p^{\mu }=\left( p_{0},\mathbf {0}\right) \). Once the loop integral is evaluated, we can retrieve the Lorentz invariant result by means of the replacement \(p_{0}^{2}\rightarrow p^{2}\). The values of the bubble diagram for a spacelike external momentum, \(p^{2}<0\), are obtained by means of the analytic continuation in \(p^{2}\).
The first step is to integrate \(k_{0}\) over the real line, which we can make in two equivalent ways. The first method is by applying the residue theorem, after closing the integration path with a semicircle at infinity, say in the upper half \(k_{0}\) plane. The second method involves the partial fractioning in \(k_{0}\). Since there are only simple poles for \(p_{0}\ne 0\), one ends up with a \(k_{0}\) integrand of the form
where \(s_{i}\) are the poles of the propagators and \(c_{i}\) are coefficients that depend on the nonintegrated variables \(k_{s},p_{0},m_{1},m_{2},M\). Since each propagator contains six poles, there are 12 terms in total. Then one integrates over \(k_{0}\) term by term. In practice, \(1/\left( k_{0}-s\right) \) gives \(\pm i\pi \), depending on whether \(\mathrm {Im}\,s>0\) or \(\mathrm {Im}\,s<0\).
The next step is to integrate over the space momentum \(\mathbf {k}\). The angular integration is trivial, because of our choice of the Lorenz frame, and gives the volume \(\Omega _{D-2}\) of the unit sphere in \(D-2\) dimensions.
Since we are only interested in the ultraviolet divergences for \( k_{s}\rightarrow \infty \), we expand the integrand for large \(k_{s}\), in order to avoid special functions due to the \(k_{s}\) integral. Consider the residues calculated at the first step. In each of them, either \(k^{2}\) or \( (k-p)^{2}\) is equal to a constant. The two cases are symmetrical, so we just assume \(k^{2}\) = constant. Then the propagator \(S\left( k,m_{1}\right) \) gives a contribution \(\sim 1/k_{s}\) for large \(k_{s}\), by Eq. (2.4). Instead, the propagator \(S\left( k-p,m_{2}\right) \) behaves as \( 1/((k-p)^{2})^{3}\sim 1/(p\cdot k)^{3}\sim 1/k_{s}^{3}\) (having used \(p\cdot k=p_{0}k_{0}\)). The product of the two behaves as \(1/k_{s}^{4}\), so the \( k_{s}\) integral diverges like
However, it is easy to check that the contributions of such type coming from the poles with \(k^{2}=\) constant compensate analogous contributions coming from the poles with \((k-p)^{2}=\) constant. In the end, the integrand of (3.1) behaves as \(1/k_{s}^{5}\), so we find that the leading \(\Sigma \) divergence is proportional to
We conclude that for \(D<6\) the bubble diagram is ultraviolet finite, while it is divergent for \(D\geqslant 6\). In particular, at \(D=6\) there is a logarithmic divergence,
which leads to the final result
where by “\(\mathrm {finite}\)” we mean terms that are finite or infinitesimal for \(\Lambda _{UV}\rightarrow +\infty \).
The divergent part is nonlocal, equal to the sum of a term proportional to \( 1/(p^{2})^{2}\) plus a term proportional to \(1/p^{2}\). Differently from the usual divergences of local theories, which are anti-hermitian, the ones of (3.2) are not, since the coefficients have nontrivial real and imaginary parts. For these reasons, we cannot absorb the divergent part in the usual way, by shifting the bare masses, rescaling the bare fields and adding new local, hermitian terms to the Lagrangian. We cannot even add nonlocal hermitian terms. We conclude that the locality and hermiticity of counterterms are both violated.
Since we are exploring an uncharted territory, we wish to make an explicit check of Lorentz invariance and analyticity. We consider the usual bubble in the case \(p^{2}<0\), by taking \(p^{\mu }=(0,\mathbf {p}_{s})\). As in the previous computation, we integrate over \(k_{0}\) by means of the residue theorem or the partial fractioning in \(k_{0}\). Then we have to integrate over the angles, which is a nontrivial operation now. Writing \(k^{\mu }=(k_{0},\mathbf {k}_{s})\), we switch to spherical coordinates, letting \( \theta \) denote the angle between \(\mathbf {p}_{s}\) and \(\mathbf {k}_{s}\). The integral over the remaining three angles is trivial, which leads to the replacement
where \(u\equiv \cos \theta \). At this point, we should expand the integrand for large \(k_{s}\). This cannot be done naively without generating u poles. For example, consider a typical denominator that is met in the calculation, such as
If we expand it for large \(k_{s}\), we obtain divergent u integrals. However, according to (3.3), the u pole has a positive imaginary part. If we first replace u by \(u-i\epsilon \), with \(\epsilon \) arbitrarily small, then the expansion for large \(k_{s}\) is safe. So doing, no fictitious singularity is generated.
The procedure works as long as the integrand can be arranged so that the powers \(1/(u-i\epsilon )^{n}\) do not mix with the powers \(1/(u+i\epsilon )^{n}\). It can be shown that the bubble diagram has this property. Carrying on the computation to the end, we find (3.2) again.
3.2 Bubble diagram in four dimensions
As shown in the previous section, the bubble diagram with unit numerator has nonlocal divergences only in dimensions \(D\geqslant 6\). On the other hand, bubble diagrams with nontrivial numerators may have nonlocal divergences also in four dimensions. In this section we study typical one-loop integrals of this type. Their applications to higher-derivative gravity will be considered in Sect. 6.
We assume that the propagator has again the form (2.2) and the vertices contain an arbitrary number of derivatives. We study the scalar integrals
The nonlocal divergent part can be calculated with the method explained in the previous subsection. We set \(p^{2}>0\) and choose \(p^{\mu }=(p_{0}, \mathbf {0})\). First, we integrate on the energy by means of the residue theorem, closing the integration path on the upper half complex plane. In Appendix A we show that, if we use the dimensional regularization, the energy integral can always be evaluated by summing the residues, even when it is divergent, because the contribution of the integration path at infinity is always zero.
Then we remain with the integral on the space momentum \(\mathbf {k}\). The logarithmic nonlocal divergences \(I_{r,n}^{\text {nld}}\) of \( I_{r,n}\) are obtained by expanding the integrand in powers of the absolute value \(k_{s}=|\mathbf {k}|\) and isolating the contributions proportional to \( \mathrm {d}k_{s}/k_{s}\).
We report results in various cases, starting from the massless limit. There, we find
where \(c_{r,n}\) are positive integer numbers. As anticipated, the pole \( 1/(4-D)\) of the dimensional regularization has been converted into the logarithm \(\log (\Lambda _{UV}/M)\) of a generic ultraviolet cutoff \(\Lambda _{UV}\) divided by M. The lowest-order coefficients are
The basic features of these results may be justified by evaluating the residues associated with the propagators of \(I_{r,n}\), as explained in Sect. 3.1. The residues with \(k^{2}=\) constant have a superficial degree of divergence \( \omega _{1}=D-5+r\), where \(D-1\) powers come from the \(k_{s}\) integration measure, \(-1\) and \(-3\) from the propagators and r from the numerator. Instead, the residues with \(\left( k-p\right) ^{2}=\) constant have degree of divergence \(\omega _{2}=D-5+r+n\), where the n additional powers come from the numerator, using \(k^{2}\sim 2k\cdot p\). For \(r=1\), \(n=0\), we have \( \omega _{1}=\omega _{2}=D-4\), so ultraviolet logarithmic divergences are expected from both types of residues in \(D=4\). However, formula (3.6) shows that the coefficient \(c_{1,0}\) vanishes. This may be interpreted as an eikonal cancellation between the two types of residues. For \(r=0\), \(n=1\), we have \(\omega _{2}>\omega _{1}=D-4\), so the cancellation cannot occur in \(D=4\). Indeed, \(c_{0,1}\) is different from zero. More generally, cancellations between the ultraviolet divergences of the residues are unlikely to occur for \(n>0\). Indeed, all the coefficients (3.6) with \(n>0\) are nonvanishing.
In the massive case, we find Eq. (3.5) with coefficients \(c_{r,n}\) that depend on m / M. The lowest-order ones are
Instead, if we replace the propagators S(p, m) with the more general ones
we obtain Eq. (3.5) with
where \(\Delta m^{2}=m^{2}-\mu ^{2}\).
We see that the locality and hermiticity of counterterms are violated again. The nonlocal behavior is always of the form \(1/p^{2}\), but it must be recalled that the integrals (3.4) contain r powers of \(p^{\mu }\) in the numerator, through the term \((k\cdot p)^{r}\). If we divide by those powers, the true nonlocal behavior of the divergent part is \(\sim 1/(p^{2})^{(2+r)/2}\).
4 Triangle diagrams
In this section we consider the one-loop three-point function. A peculiar, double-logarithmic structure
is found, coming from the overlap between the collinear (infrared) region \( \mu ^{2}\ll k_{\perp }^{2}\ll Q^{2}\) and the ultraviolet region \(M^{2}\ll k_{s}^{2}\ll \Lambda _{UV}^{2}\), where \(k_{\perp }\) is the transverse loop momentum, Q is the hard scale and \(\mu \) is the virtuality of external legs.
As in the case of the bubble diagram, in order to effectively generate such divergences, we need to introduce nontrivial numerators, which we assume to be scalar for simplicity. Specifically, we consider the following three-index family of amplitudes:
The nonlocal divergent part \(I_{r,n,t}^{\text {nld}}\) can be calculated with a procedure similar to the one used for the bubble diagram. First, we use Lorentz invariance and analyticity to impose the requirement that the incoming momenta are all timelike, i.e.
and choose \(p_{1}^{\mu }=(E_{1},\mathbf {0})\), while \(p_{2}^{\mu }=(E_{2}, \mathbf {p}_{2})\) remains generic. Then we integrate on the energy \(k_{0}\) by means of the residue theorem or a partial fractioning. At that point, we expand the integrand in powers of \(1/k_{s}\) for \(k_{s}\) large and integrate term by term over \(u=\cos \theta \), \(\theta \) being the angle between the vectors \(\mathbf {p}_{2}\) and \(\mathbf {k}\). When the conditions (4.2) hold, this operation is safe. Indeed, the expansion highlights factors
where \(p_{2s}=|\mathbf {p}_{2}|\). The subleading corrections have denominators that are equal to powers of those shown in formula (4.3). We see that every term of the expansion leads to a regular u integral. At the end, the logarithmic divergences are the coefficients of \( \mathrm {d}k_{s}/k_{s}\).
The symmetry relation \(I_{r,n,t}^{\text {nld}}\left( p_{1},p_{2}\right) =I_{t,n,r}^{\text {nld}}\left( p_{2},p_{1}\right) \) obviously holds, so we can assume, for example, \( r\geqslant t\). By explicit calculation of \(I_{r,n,t}\) for different values of its indices, we find that
This result may be justified by evaluating the residues associated with the propagators of \(I_{r,n,t}\), as explained in Sect. 3.1. The residues with \(k^{2}=\) constant have a superficial degree of divergence \( \omega _{0}=D-8+r+t\), while the residues with \(\left( k+p_{1}\right) ^{2}=\) constant and \(\left( k+p_{2}\right) ^{2}=\) constant have the generally larger degrees \(\omega _{1,2}=D-8+r+n+t\). Barring cancellations, the degree of divergence of \(I_{r,n,t}\) at \(D=4\) is \(\omega =r+n+t-4\), which is nonnegative when \(r+n+t\geqslant 4\).
Now we report the results of explicit calculations that confirm that \( I_{r,n,t}\) is indeed nonlocally divergent when the inequality of (4.4) holds. We begin with the massless case \(m=0\). The simplest nontrivial integral is
where we have defined \(Q^{2}\equiv -(p_{2}-p_{1})^{2}\). The term \(1/Q^{2}\) is reminiscent of the pole-dominance models of form factors and seems to signal – if we insist with some physical interpretation – the propagation of a massless particle in the t-channel, with an ultraviolet logarithmically divergent coefficient.
A more interesting case is provided by the amplitude
where
In order to understand the dynamic properties of the first term of Eq. (4.6) (the second term has the same form as the one of the previous amplitude), let us assume the kinematics of the deep inelastic scattering, i.e.
Infrared singularities (soft and/or collinear) are then regulated by nonzero virtualities of the external legs. Using the asymptotic expansion
it is easy to show that
where we have defined
It is convenient to assume \(\mu _{2}^{2}>0\) in order to have a real logarithm and then avoid absorptive parts related to the “decay” of the \(p_{2}\) leg. Formula (4.7) exhibits a collinear divergence for \(\mu _{2}^{2}\rightarrow 0\) (or, equivalently, \( Q^{2}\rightarrow +\infty \)), which overlaps the nonlocal divergence already found in the previous cases. The asymmetry of the result, namely the absence of a collinear singularity for \(\mu _{1}^{2}\rightarrow 0\), is related to the fact that the power \((k\cdot p_{1})^{2}\) appearing in the numerator screens the singularity related to the emission of a particle collinear to the particle with momentum \(p_{1}\). In the case of the previous amplitude \( I_{1,2,1}\), collinear singularities related to the emission from any leg were screened by the factor \((k\cdot p_{1})(k\cdot p_{2})\) in the numerator. By generalizing the example just discussed, we expect that nonlocal divergences overlap the usual, logarithmic infrared singularities found in vertex functions.
It is natural to expect that nonlocal divergences also occur in one-loop box diagrams, pentagon diagrams, etc., and that they overlap the usual logarithmic structures (infrared divergences, small-x logarithms, etc.).
We conclude by briefly reporting results concerning the massive case \(m\ne 0 \). In both \(I_{1,2,1}^{\text {nld}}\) and \(I_{2,2,0}^{\text {nld}}\), it is sufficient to replace \(M^{8}\) with \( M^{10}/(M^{2}+im^{2})\) in Eqs. (4.5) and (4.6).
5 Power counting for nonlocal divergences
The standard loop integrals in Minkowski spacetime are related to Euclidean integrals by the Wick rotation, so the power counting rules governing their ultraviolet behaviors are the same. This fact is actually far from trivial. A Minkowski integral
is intuitively expected to be more singular than the corresponding Euclidean integral in the ultraviolet region. Because of the Euclidean metric, the Euclidean integrand falls off when any component of the momentum \(k_{\mu }\) gets large. At first sight, the Minkowski pseudometric fails to provide an equivalent suppression in several subdomains of integration, such as the regions where the loop momentum is close to surfaces of the form \(k^{2}+a\cdot k+b=0\) (determined by the poles of the propagators), where a is a vector and b is a constant. The reason why this intuitive argument is not correct is that it overlooks the role played by the \(+i\epsilon \) prescription, which allows the Wick rotation.
On the other hand, the higher-derivative propagator S(k, m) has poles in all quadrants, so the analytic continuation to Euclidean space is not possible. Then the rules of power counting are no longer guaranteed to coincide in Minkowski and Euclidean spaces and actually turn out to be different.
To illustrate this fact, we work out a formula for the degree of divergence of a generic one-loop diagram in Minkowski higher-derivative theories. Assume that the propagators S(k, m) behave like \(1/(k^{2})^{N}\) for large \( |k^{2}|\) and that the vertices contain up to \(N^{\prime }\) derivatives. Consider a one-particle irreducible diagram with V vertices, equal to the number of internal lines. We assume that \(V>1\), i.e. exclude the tadpoles, because they are independent of the external momenta and cannot originate nonlocal divergences.
Letting k denote the loop momentum, we integrate over the energy \(k_{0}\) by means of the residue theorem. In each residue, \((k-q)^{2}\) is equal to some constant, q being a linear combination of external momenta. Making a translation, we can assume that the integrand is evaluated at \(k^{2}=\) constant. Then by Eq. (2.4) the propagator S(k, m) gives a contribution that behaves like \(1/k_{s}\) for large \(k_{s}\), while each one of the other \(V-1\) propagators behaves like \(1/((p-k)^{2})^{N}\sim 1/(p\cdot k)^{N}\), where p is also a linear combination of the external momenta. If we use analyticity to assume \(p^{2}>0\), the factors \(1/(p\cdot k)^{N}\) are regular everywhere.
On the other hand, the vertices provide at most \(N^{\prime }V\) powers of \( k_{s}\) and the integration measure is \(k_{s}^{D-2}\mathrm {d}k_{s}\). Collecting these pieces of information, the degree of divergence \(\omega _{ \text {nl}}\) of the \(k_{s}\) integral is at most equal to
An integral with \(\omega _{\text {nl}}<0\) is ultraviolet convergent, while an integral with \(\omega _{\text {nl}}\geqslant 0\) may be divergent. The divergent parts are in general nonlocal, because, as shown in Eq. (4.3), the large \(k_{s}\) expansion makes the ratios \(1/(p\cdot k)^{N}\) factorize as \(1/k_{s}^{N}\) times nonpolynomial functions of the external momenta.
The condition \(\omega _{\text {nl}}\geqslant 0\) is necessary to have a divergence, but not sufficient. In many cases, it is possible to enhance it by means of more sophisticated arguments. For example, it is possible to show that the bubble diagram (\(V=2\)) benefits from an enhancement of one unit when N is odd. Then
The reason is a simplification between the contributions \({\sim }1/(p\cdot k)^{N}\) of each propagator, calculated on the poles of the other propagator.
If \(D=6\), \(N=3\), \(N^{\prime }=0\), \(V=2\), which is the case treated in Sect. 3.1, we have \(\omega _{\text {nl}}^{\prime }=0\), which confirms that there is a logarithmic divergence. The same diagram in four dimensions has no nonlocal divergences (\(\omega _{\text {nl}}^{\prime }=-2\)), unless we equip it with nontrivial numerators. If we take \(N^{\prime }=1\), we raise \(\omega _{\text {nl}}^{\prime }\) to 0, which is confirmed by the nonvanishing coefficient \(c_{0,1}\) of Eq. (3.6). On the other hand, it is not enough to have a vertex with one derivative and a vertex with no derivatives (which can be formally obtained by setting \(N^{\prime }=1/2\)), as the vanishing of the coefficient \(c_{1,0}\) confirms.
In the case of the triangle diagram (\(D=4\), \(N=3\) and \(V=3\)), we may distribute the \(r+2n+t\) derivatives over the three vertices by formally writing \(N^{\prime }=(r+2n+t)/3\). The integrals \(I_{1,2,1}^{\text {nld}}\) and \(I_{2,2,0}^{\text {nld}}\) have \(N^{\prime }=2\) and \(\omega _{\text {nl}}>0\), indeed formulas (4.5) and (4.6) shows that they are divergent. Moreover, \(r+2n+t<4\) implies \(\omega _{\text {nl}}<0\), which agrees with Eq. (4.4). A better agreement can be obtained by improving the power counting as shown in Sect. 4. Indeed, after a residue is evaluated, a \(k^{2}\) factor in the numerator does not provide two powers of \(k_{s}\), but one at most. This is equivalent to setting \(N^{\prime }=(r+n+t)/3\). Then Eq. (4.4) follows in all cases. Moreover, both \(I_{1,2,1}^{\text {nld}}\) and \(I_{2,2,0}^{ \text {nld}}\) have \(N^{\prime }=4/3\), \(\omega _{\text {nl}}=0\), which implies that the ultraviolet divergence is at most logarithmic, as is actually the case.
Let us inquire which theories have no nonlocal divergences at one loop, i.e. when \(\omega _{\text {nl}}<0\) for every \(V>1\). Formula (5.1) shows that this happens when \(D-2<N-2N^{\prime }\) and \(N\geqslant N^{\prime }\). All the scalar and fermion theories with nonderivative interactions satisfy these conditions for N sufficiently large, in arbitrary dimensions. For example, in four-dimensional scalar models with nonderivative interactions it is sufficient to take \(N=3\). As far as the fermions are concerned, assume that their propagators \(S_{F}(k,m)\) behave as \(k^{\mu }/(k^{2})^{N}\) for large \( |k^{2}|\). Then we can attach their numerator \(k^{\mu }\) to a nearby vertex, so the arguments given above apply with \(N^{\prime }\rightarrow N^{\prime }+1 \). The higher-derivative theories of gauge fields have \(N^{\prime }=2N-1\), while those of gravity have \(N^{\prime }=2N\), so neither of the two satisfies the conditions for having no nonlocal divergences at one loop. Both are expected to violate the locality of counterterms, if their propagators have poles in the first or third quadrants. In the next section we study the case of gravity explicitly.
6 Higher-derivative gravity
In this section we use the results of the previous ones to work out the nonlocal divergences of the graviton two-point function in a relatively simple model of four-dimensional higher-derivative gravity with complex poles. We simplify the calculations as much as possible by choosing a specific Lagrangian and a convenient gauge fixing. The loop integrals are linear combinations of the scalar integrals (3.4).
The simplest model of higher-derivative gravity is the Stelle theory [6, 7], which contains the scalars R, \(R^{2}\) and \(R_{\mu \nu }R^{\mu \nu }\). However, it is not suitable for our investigation, because its propagators do not have poles in the first or third quadrants. The simplest model with the features we need is the one with Lagrangian
We expand the metric tensor \(g_{\mu \nu }\) around the flat-space metric \( \eta _{\mu \nu }=\)diag\((1,-1,-1,-1)\) by writing
where \(\kappa \) is a constant of dimension \(-1\) in units of mass and \(h_{\mu \nu }\) is the quantum fluctuation. After the expansion around flat space, we raise and lower the indices by means of the flat-space metric. We further define \(h\equiv h_{\mu }^{\mu }\).
We choose the De Donder gauge-fixing function
and perform the gauge fixing as explained in Appendix B. The gauge-fixed Lagrangian then reads
where \(\square =\eta ^{\mu \nu }\partial _{\mu }\partial _{\nu }\) is the flat-space D’Alembertian, while the ghost Lagrangian is
The graviton propagator
has the same form as that of the propagators of the previous sections, apart from the constant matrices in the numerator.
Normally, the ghosts contribute to the renormalization, because they must compensate the contributions of the temporal and longitudinal components of the gauge fields, to give a total gauge invariant result. However, we can easily show that in our case they can be ignored, because they cannot give nonlocal divergences at one loop. Indeed, after the redefinition \(\bar{C} ^{\mu \prime }=\left( 1+\square ^{2}/M^{4}\right) \bar{C}^{\mu }\), the ghost Lagrangian (6.4) turns into the usual one, which is
For this reason, the ghost contribution to the graviton self-energy coincides with the usual one, which has a local divergent part.
It is sufficient to work out the three-graviton vertex, since the one-loop diagrams involving four-leg vertices are tadpoles, which can only have local divergent parts. In the end, we just evaluate two diagrams, which are
where the wiggled line represents the graviton \(h_{\mu \nu }\), the solid line represents the source \(K^{\mu \nu }\) coupled to the \(h_{\mu \nu }\) transformation and the continuous line with the arrow represents the ghosts. Diagram (a) encodes the nonlocal divergences of the graviton self-energy. Diagram (b) encodes the nonlocal renormalization of the \(h_{\mu \nu }\) transformation, which is necessary to derive the corrections to the Ward identities satisfied by (a), as explained in Appendix B.
6.1 Results
Given these ingredients, we are ready to perform the calculation, as well as the consistency checks. We can reduce to the scalar integrals \(I_{r,n}\) of Eq. (3.4) by means of the Passarino–Veltman decomposition [16], which gives identities such as
where \(A_{i}(p)\) are scalar integrals and \(T_{i}^{\mu _{1}\ldots \mu _{n}}(p) \) are completely symmetric tensors built with \(\eta ^{\mu \nu }\) and \(p^{\mu }\). Since the graviton two-point functions has four indices, we just need Eq. (6.6) for \(n=1,2,3,4\). The cases \(n=1,2\) give, for example,
The one-loop nonlocal divergent part of the graviton two-point function is equal to
where \(r\equiv p^{2}/M^{2}\). It is easy to check that it is doubly transverse, i.e.
up to local terms. However, it is not transverse, since \(p^{\nu }\langle h_{\mu \nu }(p)h_{\rho \sigma }(-p)\rangle _{1}^{\text {nld}}\) does not vanish. The reason is that the gauge transformation is itself affected by nonlocal divergences. The correct Ward identity is (B.5), derived from diagram (b) as explained in Appendix B. It is easy to show that (6.7) does satisfy (B.5), which provides a good check of the result.
Coherently with what we found in the previous sections, the divergences are nonlocal and truly complex. It is impossible to subtract them away by means of reparametrizations and (local as well as nonlocal) field redefinitions that preserve hermiticity.
In conclusion, Minkowski higher-derivative theories of gravity violate the locality and hermiticity of counterterms, when the propagators have poles in the first or third quadrants. Gauge symmetries are unable to protect those properties.
The gravitational Lagrangian (6.1) is the simplest one that exhibits the effects we have uncovered. Similar effects are expected to occur in the theories with Lagrangians
where \(\square _{c}\) denotes the covariant D’Alembertian and \(P_{n}\), \(Q_{n}\) are real polynomials of degree \(n>0\). The denominators of the free propagators have the form \(p^{2}R_{n+1}(p^{2})\), where \(R_{n+1}\) is a real polynomial of degree \(n+1\). For every \(n>0\), generic real polynomials \(P_{n}\), \(Q_{n}\) lead to poles in the first and third quadrants, which in turn generate nonlocal, non-hermitian divergences. Only by choosing the polynomials \(P_{n}\), \(Q_{n}\) in very specific ways, we can manage to have all the poles on the real axis. In that case, and only in that case, the Euclidean and Minkowski theories are equivalent and the renormalization is local.
If gravity is coupled to matter, we expect to find similar behaviors in the matter sector. In particular, if the kinetic terms of the matter fields have the same numbers of higher derivatives as the gravitational sector has, the power counting is the same.
7 Conclusions
We have shown that Minkowski higher-derivative quantum field theories whose propagators have complex poles are generically inconsistent, because they generate nonlocal, non-hermitian ultraviolet divergences. Bubble diagrams, for example, contain logarithmic divergences multiplied by inverse powers of D’Alembertians. Triangle diagrams present more involved nonlocal divergences, where ultraviolet effects mix with standard infrared effects.
Contrary to intuitive expectations, the introduction of higher-derivative terms in the Lagrangian does not have a regulating effect, because the constraints coming from power counting are much weaker. Indeed, the contribution of one propagator calculated on the pole of another propagator does not decay fast enough. This unusual behavior can also be explained by the appearance of pinch singularities, unrelated to the usual absorptive parts of amplitudes, which occur because the extra excitations introduced by the higher derivatives come with effective prescriptions of both signs.
We have extended the calculations to higher-derivative quantum gravity and proved, in particular, that gauge symmetries are unable to protect the locality and hermiticity of counterterms. The problems we have outlined add up to the well-known problems that higher-derivative Minkowski theories have with perturbative unitarity.
Notes
To be rigorous, one should use an ultraviolet regularization that preserves Lorentz invariance, such as the dimensional regularization, instead of a cutoff on the space momenta. However, we are only interested in the logarithmic divergences, which, as already noted, are independent of this choice.
References
G. ’t Hooft, M. Veltman, One-loop divergences in the theory of gravitation. Ann. Inst. Poincaré. 20, 69 (1974)
P. van Nieuwenhuizen, On the renormalization of quantum gravitation without matter. Ann. Phys. 104, 197 (1977)
M.H. Goroff, A. Sagnotti, The ultraviolet behavior of Einstein gravity. Nucl. Phys. B 266, 709 (1986)
A. van de Ven, Two loop quantum gravity. Nucl. Phys. B 378, 309 (1992)
S. Weinberg, in Ultraviolet divergences in quantum theories of gravitation, ed. by S. Hawking, W. Israel. An Einstein centenary survey (Cambridge University Press, Cambridge, 1979), p. 790
K.S. Stelle, Renormalization of higher derivative quantum gravity. Phys. Rev. D 16, 953 (1977)
E.S. Fradkin, A.A. Tseytlin, Renormalizable asymptotically free quantum theory of gravity. Nucl. Phys. B 201, 469 (1982)
T.D. Lee, G.C. Wick, Negative metric and the unitarity of the S-matrix. Nucl. Phys. B 9, 209 (1969)
T.D. Lee, G.C. Wick, Finite theory of quantum electrodynamics. Phys. Rev. D 2, 1033 (1970)
R.E. Cutkosky, P.V. Landshoff, D.I. Olive, J.C. Polkinghorne, A non-analytic S-matrix. Nucl. Phys. B 12, 281 (1969)
B. Grinstein, D. O’Connell, M. B. Wise, Causality as an emergent macroscopic phenomenon: the Lee-Wick O(N) model. Phys. Rev. D 79, 105019 (2009). arXiv:0805.2156 [hep-th]
J.C. Ward, An identity in quantum electrodynamics. Phys. Rev. 78, 182 (1950)
Y. Takahashi, On the generalized Ward identity. Nuovo Cimento 6, 371 (1957)
A.A. Slavnov, Ward identities in gauge theories. Theor. Math. Phys. 10, 99 (1972)
J.C. Taylor, Ward identities and charge renormalization of Yang–Mills field. Nucl. Phys. B 33, 436 (1971)
G. Passarino, M.J.G. Veltman, One-loop corrections for e+e- annihilation into \(\mu \) + \(\mu \) - in the Weinberg model. Nucl. Phys. B 160, 151 (1979)
I.A. Batalin, G.A. Vilkovisky, Gauge algebra and quantization. Phys. Lett. B 102, 27–31 (1981)
I. A. Batalin, G. A. Vilkovisky, Quantization of gauge theories with linearly dependent generators. Phys. Rev. D 28, 2567 (1983). Erratum-ibid. D 30 (1984) 508
S. Weinberg, The quantum theory of fields, vol. II (Cambridge University Press, Cambridge, 1995)
Acknowledgements
One of us (D.A.) is grateful to M. Piva for useful discussions.
Author information
Authors and Affiliations
Corresponding author
Appendices
Residue theorem in dimensional regularization
In this appendix we show that the dimensional regularization allows us to evaluate the energy integrals in a straightforward way, even when they are divergent: it is sufficient to sum the residues, while the contribution of the integration path at infinity is always negligible.
The dimensional regularization is defined as follows. The integral on the space momenta is continued to \(D-1\) dimensions and done first. The energy integral is not modified and done second. Strictly speaking, the two can be exchanged when the energy integral is convergent. However, we show that it is always legitimate to integrate on the energy first, if we apply the residue theorem.
Without loss of generality, we can write a one-loop integral as a linear combination of integrals of the form
where \(a(\mathbf {k})\), \(b_{i}(\mathbf {k})\) are polynomials of \(\mathbf {k}\) and \(r\geqslant 1\), \(s\geqslant 0\) are integers. The denominators contain the prescriptions to move the poles away from the real axis.
The energy integral is divergent if \(s+1\geqslant 2r\), so we write \(s=2r+n-1\) and take \(n\geqslant 0\). When |E| is much larger than all the other scales, its divergent contributions are
All of the \(\mathbf {k}\) integrals are integrals of polynomials of \(\mathbf {k} \) and give zero by the rules of the dimensional regularization. Thus, if we close the integration path by means of a semicircle at infinity, in the lower or upper half plane, we add a vanishing contribution. This allows us to safely apply the residue theorem without having to worry about the closure of the integration path.
Gauge fixing and WTST identities in higher-derivative gravity
To handle the WTST identities [12,13,14,15] of quantum gravity in a compact form, we use the Batalin–Vilkovisky formalism [17,18,19]. We collect the fields into the row
where \(C_{\mu }\), \(\bar{C}_{\mu }\) and \(B_{\mu }\) are the ghosts and the antighosts of diffeomorphisms and the Lagrange multipliers for the gauge fixing, respectively. We introduce conjugate sources
and define the antiparentheses of two functionals X and Y of \(\Phi \) and K as
where the integral is over the spacetime points associated with repeated indices and the subscripts l, r in \(\delta _{l}\), \(\delta _{r}\) denote the left and right functional derivatives, respectively.
The total action is then
where \(S_{\text {HD}}=\int \mathcal {L}_{\text {HD}}\) is the classical action, \( \Psi (\Phi )\) is a functional of the fields that performs the gauge fixing, called gauge fermion, and the terms
collect the symmetry transformations coupled to the external sources K. The Lagrangian \(\mathcal {L}_{\text {HD}}\) is given by formula (6.1).
The action S satisfies the master equation
which collects the gauge invariance of \(S_{\text {HD}}\) and the closure of the symmetry transformations.
The generating functional Z of the correlation functions and the generating functional W of the connected correlation functions are defined by the formulas
while the generating functional \(\Gamma (\Phi ,K)=W(J,K)-\int \Phi ^{\alpha }J_{\alpha }\) of the one-particle irreducible diagrams is the Legendre transform of W(J, K) with respect to J. Formula (B.1) implies that \(\Gamma \) satisfies an identical master equation
which collects the WTST identities in a compact form.
We choose the gauge fermion
where \(\mathcal {G}_{\mu }\) is given in Eq. (6.2). Observe that the indices of all the fields \(\Phi ^{\alpha }\) and the sources \(K_{\alpha }\) are raised and lowered by means of the flat-space metric. We find
where \(S_{\text {gh}}=\int \mathcal {L}_{\text {gh}}\) and \(\mathcal {L}_{\text {gh }}\) is given in Eq. (6.4). We can integrate \(B_{\mu }\) out, which is equivalent to replacing it with the solution of its own field equation:
So doing, we get
The gauge-fixed Lagrangian is thus (6.3).
Now we work out the WTST identity satisfied by the graviton two-point function. Expand the functional \(\Gamma \) as
where \(\Gamma _{i}\) collects the contributions of the i-loop diagrams. Note that \(\Gamma _{i}\) cannot depend on B, \(K_{\bar{C}}\) and \(K_{B}\), because no one-particle irreducible diagrams can be built with external legs of this type. The master equation (B.2) gives, at one loop,
Expanding the antiparentheses on the left-hand side of this equation and setting \(\bar{C}=B=K=0\), we get
This identity encodes the modified gauge invariance of the classical action \( S_{\text {HD}}\). Indeed, it has the form
where \(\tilde{\Gamma }_{1}=\left. \Gamma _{1}\right| _{\bar{C}=B=K=0}\) and
In turn, the gauge invariance of \(S_{\text {HD}}\), combined with Eq. (B.4), gives
up to two-loop corrections. This identity states that the corrected action \( S_{\text {HD}}+\tilde{\Gamma }_{1}\) is invariant under the corrected gauge transformations \(\Delta h_{\mu \nu }=\Delta _{0}h_{\mu \nu }+\Delta _{1}h_{\mu \nu }\). The derivatives \(\delta \tilde{\Gamma }_{1}/\delta h_{\mu \nu }\) and \(\left. \delta _{l}\Gamma _{1}/\delta K^{\mu \nu }\right| _{ \bar{C}=B=K=0}\) are calculated through the diagrams (a) and (b) shown in (6.5), respectively.
Precisely, we can write
where the one-loop correlation functions \(\langle \cdots \rangle _{1,J}\) are evaluated at nonvanishing external sources J.
Note that \(\langle h_{\mu \nu }\rangle _{1}^{\text {1PI}}\) has no nonlocal divergences, because it is a tadpole. On the other hand, \( \langle \partial _{\mu }C_{\nu }\rangle _{1,J}^{\text {1PI}}=0\), because the insertion is linear in the fields. As far as the term \( -2\langle \Gamma _{\mu \nu }^{\rho }C_{\rho }\rangle _{1,J}^{\text {1PI}}\) is concerned, we are just interested in its nonlocal divergent part to the zeroth order in \(h_{\mu \nu }\), which we denote by \(\left. \Delta _{1}h_{\mu \nu }\right| _{h=0}^{\text {nld}}\). We get, in momentum space,
where \(r=p^{2}/M^{2}\). The quadratic part of \(S_{\text {HD}}\) is, in momentum space,
where \(\Pi _{\mu \nu }=\eta _{\mu \nu }-p_{\mu }p_{\nu }/p^{2}\). Inserting these expressions in (B.3), we find the Ward identity
up to local corrections.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Funded by SCOAP3
About this article
Cite this article
Aglietti, U.G., Anselmi, D. Inconsistency of Minkowski higher-derivative theories. Eur. Phys. J. C 77, 84 (2017). https://doi.org/10.1140/epjc/s10052-017-4646-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-017-4646-7