Abstract
We revisit the definition and some of the characteristics of quadratic theories of gravity with torsion. We start from a Lagrangian density quadratic in the curvature and torsion tensors. By assuming that General Relativity should be recovered when the torsion vanishes and investigating the behaviour of the vector and pseudo-vector torsion fields in the weak-gravity regime, we present a set of necessary conditions for the stability of these theories. Moreover, we explicitly obtain the gravitational field equations using the Palatini variational principle with the metricity condition implemented via a Lagrange multiplier.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
General relativity (GR) radically changed our understanding of the universe. The predictions of this elegant theory have been confirmed up to the date [1, 2]. In order to fit extragalactic and cosmological observational data, however, the presence of a non-vanishing cosmological constant and six times more dark matter than ordinary matter have to be assumed in this framework [3]. In addition, the observed value of this cosmological constant differs greatly from the value expected for the vacuum energy. On the other hand, while the strong and electroweak forces are renormalisable gauge theories, that is not the case for GR, and the compatibility of GR with the quantum realm is still a matter of debate. Given this situation, there has been a renewed interest in alternative theories of gravity, which modify the predictions of GR.
A particular approach to formulating alternative theories of gravity involves an extension of the geometrical treatment that covers the microscopic properties of matter [4]. It should be noted that the mass is not enough to characterize particles at the quantum level given that they have another independent label, that is, the spin. Whereas at macroscopic scales the energy-momentum tensor is enough to describe the source of gravity, a description of the spacetime distribution of the spin density is needed at microscopic scales. Moreover, there are macroscopic configurations that may also need a description of the spin distribution, as super-massive objects (e.g. black holes or neutron stars with nuclear polarisation). In this spirit, a new geometrical concept should be related to the spin distribution in the same way that spacetime curvature is related to the energy-momentum distribution. Torsion is a natural candidate for this purpose [4, 5] and an important advantage of a theory of gravity with torsion is that it can be formulated as a gauge theory [6,7,8].
Since 1924 many authors have considered theories of gravity in a Riemann–Cartan \(U_4\) spacetime. In this manifold the non-vanishing torsion can be coupled to the intrinsic spin density of matter and, in this way, the spin part of the Poincaré group can change the geometry of the manifold as the energy-momentum tensor does it. The first attempt to introduce torsion in a theory of gravity was the Einstein–Cartan theory, which is a reformulation of GR in a \(U_4\) spacetime. In this theory the scalar curvature of the Einstein–Hilbert action is constructed from a \(U_4\) connection instead of using the Christoffel symbols. However, the resulting theory was not completely satisfactory because the field equations relate the torsion and its source in an algebraic way and, therefore, torsion is not dynamical. Hence the torsion field vanishes in vacuum and the Einstein–Cartan theory collapses to GR except for unobservable corrections to the energy-momentum tensor [4]. In order to obtain a theory with propagating torsion, we need to consider an action that is at least quadratic in the curvature tensors [4, 6,7,8,9,10,11]. Moreover, an important advantage of adding quadratic terms \(\mathcal{{R}}^2\) to the Einstein–Hilbert action is the possibility of making the theory renormalisable [9]. In addition, it can be shown [4, 6] that, considering a gauge description, the torsion and curvature tensors correspond to the field-strength tensors of the gauge potentials of the Poincaré group \((e_{\mu }^{\ a }, w_{\mu }^{\ a b })\), which are the vierbein and the local Lorentz connection, respectively. Thus, a pure \(\mathcal{{R}}^2\) gauge theory of gravity has some resemblance to electroweak and strong theories.
From an experimental point of view there have been many attempts to detect torsion or to set an upper bound to its gravitational effects. One of the most debated attempts was the use of the Gravity Probe B experiment to measure torsion effects [12]. Nevertheless, this experiment was criticized because torsion will never couple to the gyroscopes installed in the satellite [13]. Therefore, this probe cannot measure the gravitational effects due to torsion. On the other hand, other unsuccessful experiments aimed to constrain torsion with accurate measurements on the perihelion advance and the orbital geodetic effect of a satellite [14]. The experimental difficulty is the need of dealing with elementary particles with spin to obtain a maximal coupling with torsion.
In this paper we present a self-contained introduction to quadratic theories of gravity with torsion in the geometrical approach (gauge treatment is not considered). We partly recover well-known results about the stability of these theories using simple methods. Therefore, we simplify the existent mathematical treatment and reinforce the critical discussion as regards some controversial results published in the literature.
The paper is organized as follows: In Sect. 2 we present a general introduction to the basic concepts on general affine geometries and introduce the conventions used throughout the paper. In Sect. 3 we present our main results. In the first place, we consider a Lagrangian density quadratic in the curvature and torsion tensors. In Sect. 3.1 we discuss the different methods presented in the literature to obtain the field equations and explicitly derive them in the Palatini formalism. In Sect. 3.2 we obtain conditions on the parameters of the Lagrangian necessary to avoid large deviations from GR and instabilities. Then, in Sect. 3.3, we analyse the Lagrangian density with the aim of setting necessary conditions for avoiding ghost and tachyon instabilities. The conclusions are summarized in Sect. 4. We relegate some calculations and further comments to the appendices: in Appendix A we include the Gauss–Bonnet term in Riemann–Cartan geometries; in Appendix B we include detailed expressions necessary to obtain the equations of the dynamics using the Palatini formalism; in Appendix C we discuss the source terms of these equations; and, in Appendix D, we include relevant expressions for the study of the vector and pseudo-vector torsion fields around Minkowski.
2 Basic concepts and conventions
The geometric structure of a manifold can be catalogued by the properties of the affine connection. A general affine connection \(\tilde{\varGamma }\) provides three main characteristics: curvature, torsion, and non-metricity. Combinations of these quantities in the affine connection generate the geometric structure [5]. In GR it is assumed that the spacetime geometry is described by a Riemannian manifold, thus the affine connection reduces to the so-called Levi-Civita connection and the gravitational effects are only produced by the consequent curvature in terms of the metric tensor alone. Nevertheless, in a general geometrical theory of gravity the gravitational effects are generated by the whole connection, which involves a post-Riemannian approach described by curvature, torsion and non-metricity. In this scheme, there are many ways to deal with torsion and non-metricity due to different conventions. For that reason, it is important to set the conventions and definitions used throughout this work. Thus, the notation assumed for the symmetric and the antisymmetric part of a tensor A is
respectively, where P(s) is the set of all the permutations of \(1,\ldots , s\) and \(\text {sgn}(\pi )\) is positive for even permutations whereas it is negative for odd permutations.
In the first place, the Cartan torsion is defined as the antisymmetric part of the affine connection as [4, 15,16,17]
Note that a dot appears below the index \(\mu \) to indicate the position that it takes when it is lowered with the metric. As the difference of two connections transforms as a tensor, the Cartan torsion is a tensor. Thus, from now on we call it just the torsion and emphasise that it cannot be eliminated with a suitable change of coordinates.
In the second place, non-metricity can also be described by a third rank tensor. This is
where \(\tilde{\nabla }\) is the covariant derivative defined from the affine connection \(\tilde{\varGamma }\). The non-metricity tensor is usually split into a trace vector \(\omega _{\rho }\equiv \frac{1}{4}Q^{\ \ \ \nu }_{\rho \nu \cdot }\), called the Weyl vector [18], and a traceless part \(\overline{Q}_{\rho \mu \nu }\),
It should be noted that there are manifolds with non-metricity where the cancellation of the \(\omega _{\rho }\) or the traceless part of Q are demanded.
Since the general connection \(\tilde{\varGamma }\) is asymmetric in the last two indices, a convention is needed for the covariant derivative of a tensor. Let \(A^{\mu _1 \cdots \mu _r}_{\cdot \ \ \cdots \ \cdot \ \nu _1 \cdots \nu _s}\) be the components of a tensor type (r, s), then
It is important to emphasise the syntax of the lower indices in the affine connections, that is, the index \(\rho \) of the derivative is written in the last position in the affine connection.
Using the definitions presented in this section, the general connection \(\tilde{\varGamma }\) is written as [4, 15, 19]
with \(\varGamma ^{\mu }_{\cdot \nu \sigma }\) the Levi-Civita connection,
which is expressed in a compact form by the permutation tensor [20]
and the additional tensor \(W^{\mu }_{{}^{.} \nu \sigma }\) defined by the following expression:
where \(K^{\mu }_{{}^{.} \nu \sigma }\) is called the contortion tensor,
Note that \(Q_{\rho \mu \nu }\) is symmetric in the last two indices, while \(T^{\mu }_{\cdot \nu \sigma }\) is antisymmetric in these indices. However, the contortion, \(K^{\mu }_{{}^{.} \nu \sigma }\), is antisymmetric in the first pair of indices. This property ensures the existence of a metric-compatible connection when the non-metricity tensor vanishes.
Furthermore, it is useful to write the torsion through its three irreducible components. These are [19]
-
(i)
the trace vector \( T^{\mu }_{{}^{.} \nu \mu }\equiv T_{\nu }\);
-
(ii)
the pseudo-trace axial vector \(S^{\nu }\equiv \epsilon ^{\alpha \beta \sigma \nu }T_{\alpha \beta \sigma }\);
-
(iii)
the tensor \(q^{\alpha }_{{}^{.} \beta \sigma }\) , which satisfies \( q^{\alpha }_{{}^{.} \beta \alpha }=0 \) and \( \epsilon ^{\alpha \beta \sigma \nu } q_{\alpha \beta \sigma }=0 \).
Thus, the torsion field can be rewritten as
The introduction of these new geometrical degrees of freedom leads to the generalisation of the usual definition of the curvature tensor in the Riemann spacetime, \([\nabla _{\rho },\nabla _{\sigma }]V^{\mu }=R^{\mu }_{ \cdot \nu \rho \sigma }V^{\nu } \), by the following commutative relations associated with a connection \(\tilde{\varGamma }\):
where the curvature tensor reads
Using Eq. (7), the curvature tensor can be rewritten as
with \(R^{\mu }_{ \cdot \nu \rho \sigma }\) the curvature tensor of the Riemann spacetime, commonly called the Riemann tensor, and \(\nabla \) the covariant derivative constructed from the Levi-Civita connection.
On the other hand, the generalisation of the two Bianchi identities can be computed from Eq. (14). Taking into account Eq. (3), the new Bianchi identities are
Moreover, it is well known that not all the components of the curvature tensor (14) are independent. By definition, this tensor is antisymmetric in the last pair of indices \(\tilde{R}^{\mu }_{\cdot \nu \rho \sigma }=\tilde{R}^{\mu }_{\cdot \nu [\rho \sigma ]}\). A simple calculation using Eq. (15) shows that
Thus, when the connection is set to be metric-compatible, the curvature tensor is also antisymmetric in the first pair of indices. The symmetry of the curvature tensor under the exchange of pair of indices depends on the torsion and non-metricity tensors. In general, for non-trivial values for those tensors, this symmetry does not hold. However, there are particular conditions under which the exchange symmetry is recovered for non-trivial values.
From now on we consider a metric-compatible connection, focusing our attention only on curvature and torsion. We denote by a hat the objects constructed from a metric-compatible connection with torsion:
All the conventions and identities that we have already presented are, of course, still valid. The Ricci tensor and the scalar curvature are obtained with the usual contractions, \( \widehat{R}_{\mu \nu }=\widehat{R}^{\sigma }_{\cdot \mu \sigma \nu } \) and \(\widehat{R}=g^{\mu \nu }\widehat{R}_{\mu \nu }\). However, the absence of symmetry in the exchange of pair of indices in Eq. (14) allows the Ricci tensor \(\widehat{R}_{\mu \nu }\) to be non-symmetric. Indeed, the antisymmetric part of this tensor is
In view of this identity, a modified torsion tensor can be defined
and a modified covariant derivative can be introduced,
Hence the antisymmetric part of the Ricci tensor is rewritten as
It should be stressed the importance of this modified derivative for vectors, since \(\partial _{\mu }(\sqrt{-g}A^{\mu })=\sqrt{-g}\,\overline{\nabla }_{\mu }A^{\mu }\), for any vector \(A^{\mu }\).
3 Quadratic theory of gravity
As we have already argued in the introduction, we are going to consider an action that is quadratic in the curvature tensor, in order to obtain a theory with propagating torsion [4, 6,7,8,9,10,11]. Excluding parity violating pieces, a total of six independent scalars can be formed from the curvature tensor (14) and its contractions. In addition, three other scalars can be constructed from the torsion tensor (3). On the other hand, the Gauss–Bonnet action is known to lead to a total divergence in a 4-dimensional Riemannian manifold and, therefore, it does not produce any contribution through the variational process of the action. It is worth noting that the Gauss–Bonnet Lagrangian does not contribute to the field equations even in a Riemann–Cartan geometry [6, 21].Footnote 1 Therefore, the terms \(\widehat{R}^2\), \(\widehat{R}_{\nu \sigma } \widehat{R}^{ \sigma \nu }\), and \(\widehat{R}_{\mu \nu \rho \sigma } \widehat{R}^{\rho \sigma \mu \nu }\) in the Lagrangian density are not independent. Throughout this work, we are going to consider the quadratic Lagrangian density from Poincaré gauge theory of gravity, as written in Refs. [6, 7, 10, 11]. This is
with \(\lambda \), a, b, c, p, q, r, s and t the free parameters of the theory. The particular combinations of the parameters that appear in the Lagrangian density have been chosen for convenience without loss of generality. Note that the scalar curvature is also included, which is the only term present in the Einstein–Cartan theory. The procedure to obtain the field equations of this Lagrangian density is summarized in Sect. 3.1. In addition, parity violating pieces can also be assumed in a natural way in the Lagrangian density leading to interesting results; see Refs. [8, 22].
In this work we are interested in the stability of theories of gravity with dynamical torsion that avoid large deviations from the predictions of GR where this theory is satisfactory. In this spirit, we focus on quadratic theories, because that is the minimal modification leading to dynamical torsion, and we will not assume that all the components obtained by the irreducible decomposition of the torsion necessarily propagate. In order to study the stability of the theory, we will focus on two regimes where the metric and torsion degrees of freedom completely decoupled from each other through the consideration of the following conditions:
-
(a)
GR must be recovered when the torsion vanishes.
-
(b)
The theory must be stable in the weak-gravity regime.
Note that condition (a) implies both that the general relativistic predictions will be recovered when the torsion is small and that the theory is stable at least when the torsion vanishes. This condition will be imposed in Sect. 3.2 by means of the geometrical structure of the manifold, whereas the second condition will be investigated in Sect. 3.3 considering the propagation of the torsion modes in a Minkowki space. Both conditions have been studied separately in the literature using different approaches; see Refs. [6,7,8].
3.1 Field equations
The field equations of the Lagrangian density (24) have to be obtained, as usual, from a variational principle where the action is extremised with respect to the dynamical variables. However, different sets of dynamical variables can be chosen and different field equations will be obtained accordingly. On one hand, the metric and the affine connection can be taken as completely independent variables. Then the field equations are obtained from varying the action with respect \(g^{\mu \nu }\) and \(\tilde{\varGamma }^{\sigma }_{\cdot \mu \nu }\). This is called the Palatini formalism.Footnote 2 On the other hand, the connection can be taken to be metric-compatible from the beginning. Hence, the field equations are obtained varying with respect to g and T, or to g and K. This procedure is sometimes called the metric or Hilbert variational method. The Palatini and Hilbert methods are known to differ only on the constraint on the symmetric part of the connection \(\tilde{\varGamma }_{(s)}{}^{\sigma }_{\cdot \mu \nu }=\varGamma ^{\sigma }_{\cdot \mu \nu }-T^{ \ \mu }_{\nu {}^{.} \sigma }-T^{ \ \mu }_{\sigma {}^{.} \nu }\); that is, they differ on a Lagrange multiplier for the metricity condition, see Refs. [23, 24]. Therefore, the two methods coincide without imposing the Lagrange multiplier when after solving the field equations the related quantity turns out to be zero. In addition, a third method consists in treating the theory as a gauge theory. This may be seen as being more natural, since the variables are the gauge potentials \((e_{\mu }^{\ a }, w_{\mu }^{\ a b })\). The field equations in this formalism can been found in Refs. [8, 10].
Let us use the Palatini formalism with the metricity condition implemented as a constraint via a Lagrange multiplier \(\varLambda \) to obtain the field equations. The total Lagrangian density of the theory can by written as
with \(\mathcal{{L}}_g\) from Eq. (24), \(\mathcal{{L}}_M\) the Lagrangian density for matter fields minimally coupled to gravity, and \(\varLambda ^{\ \ \ \rho }_{\nu \mu \cdot }\) a Lagrange multiplier. The use of the Lagrange multipliers in theories of gravity has been studied in Refs. [20, 25, 26]. For the sake of simplicity, we rewrite the Lagrangian density \(\mathcal{{L}}_g\) as
with the permutation tensors \(f_{{}_{{}_T} \lambda \alpha }^{\ \ \eta \rho \beta \gamma }\) and \(f_{{}_{{}_R} \lambda \alpha }^{\ \ \eta \rho \sigma \beta \gamma \delta }\) defined in Appendix B. This decomposition factorizes \(\mathcal{{L}}_g\) in parts depending purely on the metric and parts depending on the connection—those are the permutation tensors, and the curvature tensors and the torsion tensors, respectively; thus, the application of the Euler–Lagrange equations is straightforward. The field equations for the Lagrangian density (25) are
Note that the metricity condition is obtained as a field equation from the variation of the action with respect to the Lagrange multiplier. The definitions used in the above equations are
The tensor \(\tilde{\mathcal {E}}_{\mu \nu }\) could be considered as the generalisation of the Einstein tensor for the Lagrangian density \(\mathcal {L}_g\), as it contains the dynamical information of the metric. Analogously, the tensor \(\tilde{\mathcal {P}}_{\tau }^{\cdot \mu \nu }\) is the generalisation of the Palatini tensor. The source tensors are the energy-momentum tensor
and the hypermomentum tensor
Now, taking into account the expression of \(\mathcal {L}_g\) in Eq. (26), the generalized Einstein and Palatini tensors are
where \(\tilde{G}_{(\mu \nu )}\) is the symmetric part of the Einstein tensor, and
respectively. The full expressions of these tensors in terms of the free parameters of the Lagrangian density are shown in Appendix B.
As the metricity condition has arisen as a field equation, from now on we can consider a metric-compatible connection \(\widehat{\varGamma }\). Then the field equations (27) and (28) reduce to
To obtain the final expression for the field equations, the Lagrange multiplier \(\varLambda \) must be solved from Eqs. (36) and (37). To this end, note that a generic third rank tensor A can always be written as
where \(\varDelta _{\beta \alpha \gamma }^{\mu \nu \rho }\) is defined in Eq. (9). As \(\varLambda ^{\ \ \ \rho }_{\nu \mu \cdot }\) is symmetric in the first two indices, we can solve from Eq. (36)
Thus, the field equations become
These are the general expressions of the field equations of any theory of gravity with metricity and torsion. This set of equations is obviously equivalent to the equations obtained from a Hilbert variational principle over the variables (g, K) or (g, T), as can easily be checked. Now, taking into account the calculations showed in Appendix B for the Lagrangian density (24), these equations are
and
For an interpretation of the right sides of both field equations, see Appendix C.
3.2 Reduction to GR
We want to obtain a theory which reduces to GR when the torsion vanishes. Thus, the theory will not only be stable in this regime, but it will also deviate only slightly from the predictions of GR when the torsion is small. Note that when the torsion is set to zero, the usual Riemannian structure is recovered. Therefore, the Riemann tensor is now symmetric under the exchange of the first and the second pair of indices and the Ricci tensor is symmetric. From the first Bianchi identity (16), it follows that
Then, when \(T=0\), the Lagrangian density (24) becomes
From this expression, it is clear that GR is recovered when \(T=0\) if and only if \(p=r\) and \(s=0\). This is the only choice of parameters that leads to GR when the torsion vanishes.
Note that the same conclusion can be extracted from a different and longer approach. That is, considering the field equations (42) and (43), it can be concluded that this is the only choice of parameters that produce the Einstein equations of GR when the torsion vanishes. The same conclusion was reached in Ref. [8].
3.3 Stability in Minkowski spacetime
It is well known that the Lagrangian density (24) contains, along with the usual graviton \(2^+\), up to six new modes or torsions. These are \(2^+\), \(2^-\), \(1^+\), \(1^-\), \(0^+\) and \(0^-\), in the representation \(S^P\) where S is the spin and P is the parity of the mode. A physically meaningful restriction is to demand the theory to be stable in all the \(S^P\) sectors; see Refs. [6, 7, 28,29,30]. Quadratic theories in the curvature and torsion tensors are usually treated as a gauge theory, hence the variables considered are the gauge potentials of the Poincaré group \((e_{\mu }^{\ a }, w_{\mu }^{\ a b })\). Then the stability analysis is made through the construction of the spin projection operators.
In this work, however, we consider the metric formulation. We will examine the decoupling limit between the torsion and curvature degrees of freedom. Thus, in view of Eq. (15), we focus on the case where \(g_{\mu \nu }=\eta _{\mu \nu }\), with \(\eta _{\mu \nu }\) the Minkowski metric. For the sake of simplicity, we do not consider the purely tensor component of the torsion in Eq. (12). As the only torsion components compatible with a Friedmann–Lemaître–Robertson–Walker (FLRW) universe are the vectorial \(T^{i}\) and pseudo-vectorial \(S^{i}\) components [31], we assume that they are the minimum non-vanishing components that should be taken into account in this framework. In the spirit of investigating only slight modifications of GR, we assume that they are the only non-vanishing torsion components for a minimal modification over the FLRW background. Under these considerations, we will now impose the absence of ghost and tachyon instabilities for the theory given by the Lagrangian density (24). The quadratic Riemann and torsion terms that appear in this Lagrangian density are computed in Appendix D.
As we consider only the vector and pseudo-vector torsion components in Minkowski spacetime, the Lagrangian density (24) reduces in this regime to an ordinary vector and pseudo-vector field theory in flat spacetime. A general quadratic action for a vector \(A^{\mu }\) in flat spacetime comes from [32,33,34]
where \(\mathcal {V}\) is a possible potential for \(A^{\mu }\). However, not all the kinetic terms are independent from each other. The terms with factors \(\beta \) and \(\gamma \) are related by
as can be seen from Eq. (13). Thus, in flat spacetime these terms are related by a total derivative. On the other hand, as is well known, the Hamiltonian density of a system is obtained by performing a Legendre transformation. For this vector system, it is
where \(\dot{A}_{\mu }\equiv \partial _0 A_{\mu }\) are the generalized velocities and \(\pi ^{\mu }\) the canonical momenta defined as \(\pi ^{\mu }\equiv \frac{\partial \mathcal {L}}{\partial \dot{A}^{\mu }}\). The canonical momenta of the Lagrangian density (46) are
or written in terms of the components of the four-vector,
Then, performing the Legendre transformation (48), the Hamiltonian density reads
with \(F_{ij}=2\partial _{[i}A_{j]}\). Unfortunately, the kinetic energy of this system is unbounded from below and, therefore, suffers from ghost-type instabilities whatever the signs of \(\alpha \), \(\beta \) and \(\gamma \) are. This behaviour confirms that vector theories suffer from ghost-type instabilities if all the degrees of freedom of the four-vector \(A^{\mu }\) propagates (see Refs. [32, 33]). Hence, a necessary condition for the absence of this kind of instabilities is to make the scalar mode non-dynamical. Alternatively, the vector degrees of freedom can be frozen and propagate only the scalar mode, but this corresponds to a scalar theory rather than a vectorial one. To remove the scalar mode, the free parameters of the theory must be chosen in such a way that the canonical momenta given in Eq. (50) vanish. Since \(\partial _0{A}^0\) and \(\partial _{i}A^{i}\) are independent quantities, the only possibility to cancel out the contribution of \(\partial _{i}A^{i}\) to the canonical momenta of the scalar mode is to set \(\gamma =0\). In addition, \(\alpha +\beta =0\) is also needed to remove the contributions of the two remaining kinetic terms in the Lagrangian density (46) to the dynamics of the scalar mode. With these conditions, the kinetic terms in the vector Lagrangian density becomes a Maxwell-type \( F_{\mu \nu } F^{\mu \nu }\) that only propagates the spatial degrees of freedom of the four-vector \(A^{\mu }\). This conclusion is in agreement with the well-known fact that the only ghost-free vector theory in flat spacetime is the Maxwell–Proca Lagrangian density. Then the Hamiltonian density can be positive-defined with \(\alpha =-\beta <0\). For a more detailed discussion on this item see Ref. [34].
Back to the Lagrangian density (24), when the metric corresponds to the Minkowski spacetime the expression reduces to
where \(\mathcal{{V}}(T, S)\) are potential-type terms of the torsion fields; see Appendix D. As discussed previously, the free parameters p, q, r, s and t must be carefully selected to produce ghost-free kinetic terms, i.e. Maxwell-type kinetic terms for the trace four-vector \(T^{\mu }\) and pseudo-trace four-vector \(S^{\mu }\). After suitable integrations by parts the expression above simplifies to
Since we have two dynamical fields, there are two canonical momenta. These are
Written in terms of the scalar and vectorial degrees of freedom of the four-vectors we have
As here we have two fields with their own kinetic terms, we need to ensure that neither of them introduces a ghost. Thus, to remove the scalar \(T^0\) and pseudo-scalar \(S^0\) degrees of freedom, we consider \(p-r+2s=0\) and \(q=0\), respectively. Then the Hamiltonian density reads
The kinetic energy can be bounded from below with the extra conditions of \(p+s+t<0\) and \(2r+t<0\) for the vectorial and pseudo-vectorial torsion fields, respectively. These conditions are summarised in Table 1.
On the other hand, we now require the absence of tachyon instabilities. In the first place, we consider the weak torsion fields regime, that is, the regime where the quadratic terms in torsion fields lead the evolution of the potential. Thus, the potential in the Lagrangian density (54) takes the form
see Appendix D. Note that the mass terms in an action for a vector field comes from a potential of type \(V(\phi )\propto \frac{1}{2}m^2\phi _{\mu }\phi ^{\mu }\). Hence, the roles of the masses \(m^2\) for the vector and pseudo-vector torsion fields are played by the combinations of the coupling constants b, c and \(\lambda \). For these combinations, the correct sign must be taken for the spatial components to avoid tachyon-like instabilities. In our convention, \(\phi _{\mu }\phi ^{\mu }=\phi _0^2-\mathbf {\phi }^{\,\,2}\), then the combinations \(c+3\lambda \) and \(b+3\lambda \) must be positive to ensure a well-behaved vector and pseudo-vector sector, respectively (see Table 1). In summary, with these simple arguments we have found a set of conditions for the ghost and tachyon stability of the Lagrangian density (24) at the decoupling limit and the weak torsion regime, summarized in Table 1.
In Refs. [6, 7], Sezgin and Nieuwenhuizen provided a detailed analysis of the stability of the Lagrangian density (24) for the weak torsion field regime. These two articles were the first systematic stability analysis of this kind of theories, made with the spin projectors formalism, and they are a key reference point in this issue. The conclusions they showed for the \(1^-\) torsions are compatible with those obtained here. Their ghost-free condition is the same we have obtained here, and the tachyon-free condition is compatible. On the other hand, for the \(1^+\) sector both conclusions are, however, incompatible. While the condition obtained for a well-defined kinetic term for \(S^{\mu }\) in this section is \(2r+t<0\), they claim that \(2r+t>0\) is needed. It is worth noting that other authors have suggested that the analysis carried out by Sezgin and Nieuwenhuizen is not restrictive enough to ensure a ghost- and tachyon-free spectrum; see Refs. [28, 29]. In fact, in Ref. [28] the authors pointed out that they even obtain a different expression of the spin projector operator for the pseudo-vector mode. Furthermore, they argue the relevance of considering the additional condition for the absence of \(p^{-4}\) poles in all spin sectors, which is not done in the analysis of Refs. [6, 7]. In Ref. [35], Fabbri analyses the stability of the most general quadratic gravitational action with torsion and Dirac fields by demanding, in addition, a consistent decoupling between curvature and torsion that preserves continuity in the torsionless limit, concluding that the only non-vanishing component of the torsion is given by the pseudo-vector mode and that parity-violating terms are not allowed in the Lagrangian density. Nevertheless, due to some lack of clarity in the existing literature, a deeper analysis of the origins of these differences is not available yet.
Let us now go beyond the weak torsion regime when analysing the potential \(\mathcal{{V}}\). Thus, higher orders in the potential can dominate its evolution. The highest order that appears in the potential is quartic, symbolically \(\mathcal{{V}}^{(4)}\),
As there are terms mixing the vector and pseudo-vector fields, we note that the potential can be diagonalized in the following basis:
with \(\mathbb {V}^{(4)}\) a \(3\times 3\) matrix. The eigenvalues of \(\mathcal{{V}}^{(4)}\) are
with
For a positive-defined quadratic form, the three eigenvalues must be positive. Since we are only interested in the vector and pseudo-vector torsion degrees of freedom, we can assume \(p-r+2s=0\) and \(q=0\), which are the conditions found for making the scalar and pseudo-scalar mode non-dynamic, respectively. Then the expressions of the eigenvalues reduce to
It is easy to see that these eigenvalues cannot be positive at the same time for any combination of p and s. Hence, the quartic order in the potential in Eq. (61) is unstable and, therefore, this order must be removed to obtain a stable theory. This can be done taking \(3s+p=0\). Furthermore, the third order in the potential is not present once we consider that GR is recovered when the torsion vanishes. Therefore, when we take \(p=r\), \(s=0\) and \(3s+p=0\), there are only quadratic terms in the potential. Thus, the potential is stable under the same conditions as those obtained in the weak torsion field approximation with the additional constraint of \(p+3s=0\); see Table 1.
On the other hand, we should stress that the stability analysis developed in the literature is usually made using a weak curvature approximation for the metric. However, our stability analysis is made in the limit where the degrees of freedom of the torsion are completely decoupled from those of the metric. For this purpose, we have considered that GR is recovered when \(T=0\) and we have investigated the stability of the torsion in Minkowski flat spacetime, assuming that only the vector and pseudo-vector modes propagate. These conditions are combined and summarized in Table 2. Therefore, we expect that the conditions obtained, which are found to be necessary and sufficient for the stability in this regime, are necessary but no longer sufficient conditions for the stability of the theory when both curvature and torsion are present.
4 Summary
In this work we have investigated a quadratic and parity preserving action with curvature and torsion [6, 7, 10, 11] in order to obtain a stable theory of gravity with dynamical torsion. For this purpose, we have analysed two regimes where the degrees of freedom of the metric and those of the torsion are completely decoupled. The assumptions made in those regimes are also motivated by looking for theories of which the predictions are expected not to be in great disagreement with those of GR.
On the one hand, we have assumed that the theory reduces to GR when the torsion vanishes. This implies the stability of the metric degrees of freedom in the regime where there are no torsion modes. Therefore, we have imposed the requirement that the only term independent of the torsion is contained in the scalar curvature \(\widehat{R}\), obtaining two conditions for the parameters of the general quadratic Lagrangian.
On the other hand, we have investigated the stability of the torsion when the metric is flat, following an approach that differs from the usual techniques used in the literature. We have focussed attention on the stability of the vector and psuedo-vector torsion components in Minkowski because they are the only components that propagate in a FLRW spacetime [31] from the torsion irreducible decomposition. Therefore, it is not necessary to consider the purely tensor component if we are interested in “minimal” modifications of the predictions of GR. We have studied the stability of these fields analysing the Hamiltonian formulation of the theory to ensure a ghost and tachyon-free spectrum in this regime. Thus, we have obtained several conditions for the parameters of the general quadratic action with propagating torsion that we have summarized in Table 1. Moreover, we have contrasted the conditions obtained in the weak torsion limit of this regime with those already presented in the literature [6, 7, 28, 29]. As we have discussed in detail, the disagreement with the conclusions of Ref. [6, 7] regarding the pseudo-vector field may be due to the arguments exposed in Refs. [28, 29]. It should be stressed that, after the first approach, we have gone beyond the weak torsion approximation, obtaining the general conditions for the stability of the vector and pseudo-vector torsion fields in Minkowski spacetime.
In summary, we have found the most general subfamily of the Lagrangian density (24) that is stable in both decoupling regimes. This is described by
where \(b+3\lambda >0\), \(c+3\lambda >0\), and \(t<0\), and we restrict our study to theories where only the vector and pseudo-vector torsion components of the irreducible decomposition propagate.
Change history
24 October 2022
An Erratum to this paper has been published: https://doi.org/10.1140/epjc/s10052-022-10930-9
Notes
We include the definition of the Gauss–Bonnet action in the presence of the torsion and check this property in Appendix A, since incompatible definitions are used throughout the literature.
It should be stressed that, for the Palatini method, the general connection \(\tilde{\varGamma }\) should be considered. Then the conditions of metricity and of being torsion-free must be implemented via Lagrange multipliers.
References
M.C. Will, Resource letter PTG-1: precision tests of gravity. Am. J. Phys. 78, 1240 (2010). arXiv:1008.0296 [gr-qc]
B.P. Abbott et al., Observation of gravitational waves from a binary black hole merger. Phys. Rev. Lett. 116(6) (2016). arXiv:1602.03837 [gr-qc]
E. Papantonopoulos (ed.), Modifications of Einsteins theory of gravity at large distances. Lecture Notes in Physics 892 (2015)
V. De Sabbata, M. Gasperini, Introduction to Gravitation (World Scientific, Singapore, 1985)
Y. Mao, Constraining gravitational and cosmological parameters with astrophysical data. arXiv:0808.2063 [astro-ph]
E. Sezgin, P. van Nieuwenhuizen, New ghost-free gravity Lagrangians with propagating torsion. Phys. Rev. D 21, 3269 (1980)
E. Sezgin, Class of ghost-free gravity Lagrangians with massive or massless propagating torsion. Phys. Rev. D 24, 1677 (1981)
Y.N. Obukhov, V.N. Ponomarev, V.V. Zhytnikov, Quadratic poincare Gauge theory of gravity: a comparison with the general relativity theory. Gen. Relat. Gravit. 21, 1107 (1989)
D.E. Neville, Gravity theories with propagating torsion. Phys. Rev. D 21, 867 (1980)
R. Rauch, H.T. Nieh, Birkhoff’s theorem for general Riemann–Cartan type \(R+R^2\) theories of gravity. Phys. Rev. D 24, 2029 (1981)
R.T. Rauch, Asymptotic flatness, reflection symmetry, and Birkhoff’s theorem for \(R+R^2\) actions containing quadratic torsion terms. Phys. Rev. D 25, 577 (1982)
Y. Mao, M. Tegmark, A.H. Guth, S. Cabi, Constraining torsion with gravity probe B. Phys. Rev. D 76, 104029 (2007)
F.W. Hehl, Y.N. Obukhov, D. Puetzfeld, On Poincaré gauge theory of gravity, its equations of motion, and Gravity Probe B. Phys. Lett. A 377, 1775 (2013). arXiv:1304.2769 [gr-qc]
R. March, G. Bellettini, R. Tauraso, S. Dell’Agnello, Constraining spacetime torsion with the Moon and Mercury. Phys. Rev. D 83, 104008 (2011). arXiv:1101.2789 [gr-qc]
L.L. Smalley, Variational principle for general relativity with torsion and non-metricity. Phys. Lett. A 61(7) (1977)
S. Capozziello, R. Cianci, C. Stornaiolo, S. Vignolo, f(R) cosmology with torsion. Phys. Scripta 78, 065010 (2008). arXiv:0810.2549 [gr-qc]
S. Capozziello, R. Cianci, C. Stornaiolo, S. Vignolo, f(R) gravity with torsion: the Metric-affine approach. Class. Quant. Gravit. 24, 6417 (2007). arXiv:0708.3038 [gr-qc]
O.V. Babourova, B.N. Frolov, Gauss–Bonnet type identity in Weyl–Cartan space. Int. J. Mod. Phys. A 12(21), 3665 (1997). arXiv:gr-qc/9609004
I.L. Shapiro, Physical aspects of the space-time torsion. Phys. Rept. 357, 113 (2002). arXiv:hep-th/0103093
F.W. Hehl, G.D. Kerlick, Metric-affine variational principles in General Relativity I. Riemannian space-time. Gen. Relat. Gravit. 9, 691 (1978)
H.T. Nieh, Gauss–Bonnet and Bianchi identities in Riemann-Cartan type gravitational theories. J. Math. Phys. 21, 1439 (1980)
P. Baekler, F.W. Hehl, Beyond Einstein–Cartan gravity: quadratic torsion and curvature invariants with even and odd parity including all boundary terms. Class. Quant. Grav. 28, 215017 (2011). arXiv:1105.3504 [gr-qc]
J.L. Safko, M. Tsamparlis, Variational methods with torsion in general relativity. Phys. Lett. A 60, 1 (1977)
J.L. Safko, F. Elston, Lagrange multipliers and gravitational theory. J. Math. Phys. 17, 1531 (1976)
V.N. Ponomariev, Tseytlin, Correct use of Palatini principle in gravity theory. Vestnik Mosk. Univ. (ser. fiz. astr.) 6, 57 (1978)
W. Kopczynski, The Palatini principle with constraints. Bulletin de l’académie Polonaise des sciences, Série de sciences math. astr. et phys. 23, 4 (1975)
F.W. Hehl, G.D. Kerlick, P. Von der Heyde, On hypermomentum in general relativity III. Coupling hypermomentum to geometry. Z. Naturforsch. 31A, 823 (1976)
R. Kuhfuss, J. Nitsch, Propagating modes in gauge field theories of gravity. Gen. Relat. Gravit. 18, 1207 (1986)
M. Blagojevic, M. Vasilic, Extra gauge symmetries in a weak-field approximation of an \(R+T^2 + R^2\) theory of gravity. Phys. Rev. D 35(12) (1987)
D.E. Neville, Gravity Lagrangian with ghost-free curvature-squared terms. Phys. Rev. D 18, 3535 (1978)
H. Gonner, F. Mueller-Hoissen, Spatially homogeneous and isotropic spaces in theories of gravitation with torsion. Class. Quant. Gravit. 1, 651 (1984)
N.L. Gagne, Hamiltonian constraint analysis of vector field theories with spontaneous Lorentz symmetry breaking. Colby College (2008)
J. Beltrán Jiménez, A.L. Maroto, Viability of vector-tensor theories of gravity. JCAP 0902, 025 (2009). arXiv:0811.0784 [astro-ph]
G. Esposito-Farese, C. Pitrou, J.P. Uzan, Vector theories in cosmology. Phys. Rev. D 81, 063519 (2010). arXiv:0912.0481 [gr-qc]
L. Fabbri, A discussion on the most general torsion-gravity with electrodynamics for Dirac spinor matter fields. Int. J. Geom. Meth. Mod. Phys. 12(09), 1550099 (2015). arXiv:1409.2007 [gr-qc]
H. Stephani, Relativity: An Introduction to Special and General Relativity (Cambridge University Press, London, 2004)
Acknowledgements
The authors acknowledge Y. N. Obukov for useful discussions. This work was partly supported by the projects FIS2014-52837-P (Spanish MINECO) and FIS2016-78859-P (AEI/FEDER, UE), and Consolider-Ingenio MULTIDARK CSD2009-00064. PMM was funded by MINECO through the postdoctoral training contract FPDI-2013-16161 during part of this work.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendices
Appendix A: The Gauss–Bonnet term in Riemann–Cartan geometries
We have noted that there is no agreement about the expression of the Gauss–Bonnet term in a Riemann–Cartan manifold throughout the literature, probably due to several misprints. Therefore, in this appendix, we present the correct expression for the Gauss–Bonnet action. This is
One can easily check that this is the correct order of the indices focussing attention on the vectorial and pseudo-vectorial torsion fields in the weak curvature approximation. In this regime we have
Let us now prove that, order by order in the fields \(h_{\alpha \beta }, T_{\alpha } \ and S_{\alpha }\), the term (A.1) leads to a total divergence. The expressions of \(R^{\mu }_{\cdot \nu \rho \sigma }\), \(R_{\nu \sigma }\) and R in terms of h are well known in linearized gravity [36]. These are
with \(\Box =\partial _{\mu }\partial ^{\mu }\). Then, from Eq. (15), it is clear that in the action (A.1) will appear a Gauss–Bonnet term for the Levi-Civita connection, terms quadratic in torsion and a term mixing torsion and h terms. This action can be expressed as
The first term on the r.h.s. of this equation is known to be invariant. Nevertheless, this invariance can be proven with an explicit calculation from Eqs. (A.3), (A.4) and (A.5) with the appropriate boundary conditions on h. The second term is calculated with the results of Appendix D. It can be seen that
After integration by parts, the expression above leads to a total divergence. Taking the torsion to be zero at the boundary of \(\mathcal{{U}}_4\), \(S_{GB}^{(2)}\) is identically zero. Finally, the third term on the r.h.s. of Eq. (A.6), \(S_{GB}^{(3)}(\partial h,\partial T , T, S)\), is analysed using Eqs. (D.27), (D.28) and (D.29) for the torsion part and (A.3), (A.4) and (A.5) for the metric dependent part. Thus,
Note that there are no mixing terms between \(\partial h\) and \(\partial S\) or ST, as expected from parity conservation. After some algebraical manipulations and integration by parts, the equation for \(S_{GB}^{(3)}\) vanishes. Hence, we have checked the invariance of an action upon addition of the action (A.1) in the weak curvature limit. As was pointed by Nieh [21], the Gauss–Bonnet term will remain invariant even in a curved non-flat metric \(g_{\mu \nu }\). But, for this work, the invariance in weak field limit is sufficient.
Appendix B: Variations in the Palatini formalism
The Palatini formalism for varying the action consists in taking the metric \(g^{\mu \nu }\) and the generic connection \(\tilde{\varGamma }^{\sigma }_{\cdot \alpha \beta }\) as the dynamical variables. So, it is useful to rewrite the action in terms of those variables. Some useful well-known relations for considering that variation are
Thus, one can easily obtain
Let us know consider the variation of the action written in terms of the Lagrangian density (26). This is
where the permutation tensors are
In order to compute the complete generalized Einstein tensor in Eq. (34), the following expressions are needed:
For the calculation of the generalized Palatini tensor in Eq. (35), we need the following expressions:
Then, taking into account the definition of the torsion and curvature tensors, Eqs. (3) and (14), respectively, the generalized Einstein and Palatini tensors of the quadratic Lagrangian density (24) read
Appendix C: Source tensors
In order to understand the r.h.s. of the field equations (42) and (43), it is necessary to make a distinction between the Hilbert definition of the energy-momentum tensor and the definition carried through in Eq. (32). Hilbert’s definition is made in a Riemannian \(\mathcal {V}_4\) spacetime and, therefore, there is a dependence of the matter Lagrangian density on \(\partial g\) introduced by the Levi-Civita connection. This definition is
Nevertheless, in the Palatini formalism this dependence on \(\partial g\) does not exist, since the matter Lagrangian depends on g and \(\tilde{\varGamma }\) as independent variables. Therefore, the energy-momentum tensor is as in Eq. (32). This is
There is a clear difference between the two definitions.
However, when the metricity condition is implemented, the connection \(\tilde{\varGamma }\) becomes \(\widehat{\varGamma }=\varGamma +K\) and, therefore, there appears a dependence on \(\partial g\) in the definition (32). The term \(\varDelta _{\nu \mu \kappa }^{\alpha \beta \gamma }\overline{\nabla }^{\kappa }\widehat{\varSigma }_{\alpha (\beta \gamma )}\) in the r.h.s. of Eq. (42) takes into account this new dependence, which is not present in the original definition of \(\widehat{\tau }_{\mu \nu }\). To check the consistency of this argument, let us take
where different tensors have been defined in Eqs. (8), (32) and (33). This leads to
The r.h.s. of Eq. (C.24) is exactly the expression on the r.h.s. of Eq. (42), while the l.h.s. is similar to Hilbert’s definition of the energy-momentum tensor (C.21). Indeed \(\widehat{\tau }_{\mu \nu }+\frac{1}{2}\varDelta _{\nu \mu \kappa }^{\alpha \beta \gamma }\overline{\nabla }^{\kappa }\widehat{\varSigma }_{\alpha (\beta \gamma )}\) is the generalisation of Hilbert’s definition of the energy-momentum tensor to the Riemann–Cartan \(U_4\) spacetime.
On the other hand, \(\varSigma _{[\tau \mu ]\nu }\) is related to the contortion tensor, which is the remaining part of the connection, see Ref. [27]. Thus, the r.h.s. of Eq. (43) corresponds to the spin distribution tensor
Appendix D: Vector and pseudo-vector torsion in the weak-gravity regime
In this appendix we are going to take the vector \(T^{\mu }\) and pseudo-vector \(S^{\mu }\) torsion components as the only non-vanishing torsion fields and calculate the expressions needed for the analysis carried out in Sect. 3.3.
Assuming that the only non-vanishing components of the torsion tensor in the decomposition (12) are the vector \(T_{\mu }\) and pseudo-vector \(S_{\mu }\) torsion components, the expression for the contortion tensor (11) can be rewritten as
Under this assumption, the curvature tensor (15) takes the form
The Ricci tensor is obtained by the usual contraction \(\widehat{R}^{\mu }_{{}^{.} \nu \mu \sigma } \),
and the scalar curvature \(\widehat{R}=\eta ^{\nu \sigma }\widehat{R}_{\nu \sigma } \),
As we want to get a set of stability condition on the parameters of the theory when \(g_{\mu \nu }=\eta _{\mu \nu }\), we take the expression of the curvature tensors (D.27), (D.28) and (D.29) to compute the scalars in the Lagrangian density (24). These are
Note that there are no terms \(\partial T \partial S\), \(\partial S TT\), or STT , as expected from parity conservation. On the other hand, it is also possible to compute the pure torsion squared terms via Eq. (12). These are,
In view of these calculations, the potential that appears in Eq. (53) is
Note that the parameter t does not appear in the expression of the potential, since the antisymmetric part of the Ricci tensor does not give rise to potential-type terms for the vector and pseudo-vector torsion degrees of freedom.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Funded by SCOAP3
About this article
Cite this article
Vasilev, T.B., Cembranos, J.A.R., Valcarcel, J.G. et al. Stability in quadratic torsion theories. Eur. Phys. J. C 77, 755 (2017). https://doi.org/10.1140/epjc/s10052-017-5331-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-017-5331-6