1 Introduction

The successful detection of gravitational waves made up the last puzzle missing in the experimental verification of general relativity [1,2,3,4,5]. Therefore, general relativity has become the most successful theory of gravity so far.

However, there are still many theoretical problems that can not be explained by general relativity, such as how to explain the hierarchy between the Planck scale and the electroweak scale [6,7,8] and how to quantize gravity [9]. In addition, the phenomena observed in experiments, such as the accelerated expansion of the Universe [10] and the flat rotation curves of galaxies [11], can not be explained by general relativity. For these reasons, many modified theories of gravity were considered [6,7,8, 12,13,14,15,16] in the hope of answering the problems that general relativity could not answer.

Adding additional scalar field is one way to modify gravity. Theories obtained in this way are called scalar–tensor theories. In Ref. [17], a tentative indication for scalar transverse gravitational waves was reported. If this is further confirmed in the future, it will strongly suggest that the gravity theory describing our world should have a scalar degree of freedom. In order to avoid the Ostrogradsky instability [18,19,20,21], we expect to give priority to those theories that can derive second-order field equations. In the metric formalism, the most general scalar–tensor theory that can derive second-order field equations is Horndeski theory [13].

However, Refs. [22,23,24] pointed out that the observation of the speed of tensor gravitational waves in the Universe by the gravitational wave event GW170817 together with the gamma ray burst GRB170817A would severely constrain the possible parameter space of metric Horndeski theory. Specifically, GW170817 and GRB170817A require the tensor gravitational wave speed \(c_g\) to meet [25, 26]

$$\begin{aligned} -3\times 10^{-15} \le \frac{c_g}{c}-1 \le 7\times 10^{-16}. \end{aligned}$$
(1)

This shows that in a very high precision, we can say that the tensor gravitational wave speed in the Universe is equal to the basic constant c (speed of light). Considering that the cosmic background is also evolving during gravitational wave propagation, the most economic and natural assumption made by this observation result for the theory seems to be that: in any cosmological background, tensor gravitational waves always propagate at the speed of light. However, the possible subclasses of metric Horndeski theory satisfying this assumption only remain [22, 27]

$$\begin{aligned} S=\int d^4x \sqrt{-g} \big [ K(\phi ,X)-G_{3}(\phi ,X){\Box }\phi +G_{4}(\phi )R \big ].\nonumber \\ \end{aligned}$$
(2)

This constraint limits the application of scalar–tensor theories. Therefore, We expect to find scalar–tensor theories beyond the metric Horndeski framework. Reference [28] studied the orbital evolution of eccentric binary systems in metric Horndeski gravity. There are also many studies using GW170817 to constrain modify gravity theories [29,30,31,32,33,34,35,36,37,38,39,40].

Further analysis shows that not all higher derivative theories have the Ostrogradsky instability. The higher derivative theory without the Ostrogradsky instability is required to satisfy the degeneracy condition [21, 41, 42]. In the metric formalism, the scalar–tensor theory with higher derivative but without the Ostrogradsky instability is called degenerate higher-order scalar–tensor (DHOST) theory [43,44,45,46,47]. In addition to DHOST theory, considering the teleparallel framework is another way to go beyond the metric Horndeski framework. In teleparallel Horndeski theory established by Bahamonde et al., metric Horndeski theory is included in the teleparallel framework as one of many subclasses [48, 49].

Considering the scalar–tensor theory in the Palatini formalism may be another way to go beyond metric Horndeski framework. There have been some works on scalar–tensor gravity in the Palatini formalism [50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67]. Cosmology in Palatini–Horndeski theory is different from that in metric Horndeski theory and their stability properties are different [60]. Different from metric Horndeski theory, under some parameter spaces, the connection of Palatini–Horndeski theory will introduce some new degrees of freedom [60]. In addition, the polarization modes of gravitational waves in Palatini–Horndeski theory are different from that in metric Horndeski theory [63]. Thus, it seems that Palatini–Horndeski theory may be different from metric Horndeski. However, it is necessary to further investigate the possible parameter space of Palatini–Horndeski theory.

In this paper, we will find possible subclasses of Palatini-Horndeski theory that satisfied the following condition: the speed of tensor gravitational waves is the speed of light in any spatially flat cosmological background. In Sect. 2, we review Palatini–Horndeski theory. In Sect. 3, we develop a general method in the Palatini formalism to obtain the speed of tensor gravitational waves in the spatially flat cosmological background and constrain the parameter space. The conclusion is given in Sect. 4.

We use the natural system of units in this paper. Greek alphabet indices \((\mu ,\nu ,\lambda ,\rho )\) and Latin alphabet indices (ijkl) range over spacetime indices (0, 1, 2, 3) and space indices (1, 2, 3), respectively.

2 Palatini–Horndeski theory

In the Palatini formalism, the connection is independent of the metric. Therefore, it is necessary to take the variations of the action with respect to the metric and the connection independently. The Riemann tensor \({\tilde{R}}^{\mu }_{\ \nu \rho \sigma }\) and Ricci tensor \({\tilde{R}}_{\mu \nu }\) in the Palatini formalism are defined as

$$\begin{aligned} {\tilde{R}}^{\rho }_{~\mu \lambda \nu }= & {} \partial _{\lambda }\varGamma ^{\rho }_{\mu \nu } -\partial _{\nu }\varGamma ^{\rho }_{\mu \lambda } +\varGamma ^{\rho }_{\sigma \lambda }\varGamma ^{\sigma }_{\mu \nu } -\varGamma ^{\rho }_{\sigma \nu }\varGamma ^{\sigma }_{\mu \lambda }, \end{aligned}$$
(3)
$$\begin{aligned} {\tilde{R}}_{\mu \nu }= & {} \partial _{\lambda }\varGamma ^{\lambda }_{\mu \nu } -\partial _{\nu }\varGamma ^{\lambda }_{\mu \lambda } +\varGamma ^{\lambda }_{\sigma \lambda }\varGamma ^{\sigma }_{\mu \nu } -\varGamma ^{\lambda }_{\sigma \nu }\varGamma ^{\sigma }_{\mu \lambda }. \end{aligned}$$
(4)

Furthermore, we assume that the connection is nontorsion: \(\varGamma ^{\lambda }_{\mu \nu }=\varGamma ^{\lambda }_{\nu \mu }\).

The action of Palatini–Horndeski theory is defined as follows:

$$\begin{aligned} S = \int d^4x \sqrt{-g} \;\Bigl ({\mathcal {L}}_{2}+{\mathcal {L}}_{3}+{\mathcal {L}}_{4} +{\mathcal {L}}_{5}\Bigl ), \end{aligned}$$
(5)

where

$$\begin{aligned} {\mathcal {L}}_{2}= & {} K(\phi ,X), \end{aligned}$$
(6)
$$\begin{aligned} {\mathcal {L}}_{3}= & {} -G_{3}(\phi ,X){\tilde{\Box }}\phi ,\end{aligned}$$
(7)
$$\begin{aligned} {\mathcal {L}}_{4}= & {} G_{4}(\phi ,X){\tilde{R}}\nonumber \\{} & {} +G_{4,X}( \phi ,X) \left[ \left( {{\tilde{\Box }}\phi }\right) ^{2} -\left( {\tilde{\nabla }}_{\mu }{\tilde{\nabla }}_{\nu }\phi \right) \left( {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}^{\nu }\phi \right) \right] , \end{aligned}$$
(8)
$$\begin{aligned} {\mathcal {L}}_{5}= & {} G_{5}(\phi ,X)\left( {\tilde{R}}_{\mu \nu } -\frac{1}{2} g _{\mu \nu }{\tilde{R}}\right) {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}^{\nu }\phi \nonumber \\{} & {} -\frac{1}{6}G_{5,X}(\phi ,X) \left[ \left( {\tilde{\Box }}\phi \right) ^{3}-3{\tilde{\Box }}\phi \left( {\tilde{\nabla }}_{\mu }{\tilde{\nabla }}_{\nu }\phi \right) \left( {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}^{\nu }\phi \right) \right. \nonumber \\{} & {} +2\left. \left( {\tilde{\nabla }}^{\lambda }{\tilde{\nabla }}_{\rho }\phi \right) \left( {\tilde{\nabla }}^{\rho }{\tilde{\nabla }}_{\sigma }\phi \right) \left( {\tilde{\nabla }}^{\sigma }{\tilde{\nabla }}_{\lambda }\phi \right) \right] . \end{aligned}$$
(9)

Here, \({\tilde{\Box }}={\tilde{\nabla }}^{\mu }{\tilde{\nabla }}_{\mu }\), \(X=-\frac{1}{2}\partial _{\mu }\phi \partial ^{\mu }\phi \), and \( K, G_{3}, G_{4}\) and \(G_{5}\) are real analytic functions of the variables \(\phi \) and X. To distinguish the quantities in the metric formalism, we add tilde to represent the corresponding quantities defined in the Palatini formalism. A comma in subscript means partial derivative, e.g., \(G_{4,X}\equiv \partial G_4 /\partial X\).

In the Palatini formalism, the compatibility condition \({\tilde{\nabla }}_{\lambda }g_{\mu \nu }=0\) is generally no longer valid. Therefore, the definition of \({\tilde{\Box }}\phi , {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}^{\nu }\phi \) and \( {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}_{\nu }\phi \) in the action (5) in the Palatini formalism is not unique [60, 62]. In this paper, we take the definition in Ref. [63]:

$$\begin{aligned} {\tilde{\Box }}\phi= & {} g^{\mu \nu }{\tilde{\nabla }}_{\mu } {\tilde{\nabla }}_{\nu }\phi ,\nonumber \\ {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}^{\nu }\phi= & {} g^{\mu \rho }{\tilde{\nabla }}_{\rho }(g^{\nu \sigma }{\tilde{\nabla }}_{\sigma }\phi ),\\ {\tilde{\nabla }}^{\mu }{\tilde{\nabla }}_{\nu }\phi= & {} g^{\mu \rho }{\tilde{\nabla }}_{\rho }{\tilde{\nabla }}_{\nu }\phi . \nonumber \end{aligned}$$
(10)

3 The speed of tensor gravitational waves

In this section, we will calculate the speed of tensor gravitational waves propagating in a spatially flat cosmological background and find possible subclasses of Palatini–Horndeski theory that satisfy the following condition: the speed of tensor gravitational waves is the speed of light in any spatially flat cosmological background.

We first consider the background evolution of a spatially flat Universe in the Palatini–Horndeski theory. For a spatially flat Universe, the metric \(g_{\mu \nu }\) is the spatially flat Friedmann-Robertson-Walker (FRW) metric, and the connection \(\varGamma ^{\lambda }_{\mu \nu }\) and scalar field \(\phi \) are only functions of time:

$$\begin{aligned} ds^2= & {} -N(t)^2 dt^2+a(t)^2\delta _{ij}dx^idx^j,\nonumber \\ \varGamma ^{\lambda }_{\mu \nu }= & {} \varGamma ^{\lambda }_{\mu \nu }(t), \quad \phi =\phi (t). \end{aligned}$$
(11)

We consider that the connection has spatial isotropy, that is, under the spatial rotation transformation, the components of the connection \(\varGamma ^{\lambda }_{\mu \nu }\) are invariant. This condition further limits the value of the connection. Specifically, under the spatial rotation transformation, the transformation law of the components of the connection is the same as that of the third-order tensor, which requires that the connection \(\varGamma ^{\lambda }_{\mu \nu }\) satisfies [68, 69]

$$\begin{aligned} \varGamma ^{0}_{0i}= & {} \varGamma ^{i}_{00}=0, \quad \varGamma ^{0}_{ij}=\varGamma ^{0}_{11} \delta _{ij}, \quad \varGamma ^{i}_{0j}=\varGamma ^{1}_{01} \delta ^{i}_{j}, \nonumber \\ \varGamma ^{i}_{jk}= & {} \varGamma ^{1}_{23} \delta ^{il} \varepsilon _{ljk} =\varGamma ^{1}_{23} \delta ^{il} \varepsilon _{l(jk)} =0. \end{aligned}$$
(12)

Here, \(\delta _{ij}\) is the Kronecker delta, and \(\varepsilon _{ijk}\) is the Levi-Civita tensor. It can be seen that for the components of the connection, only \(\varGamma ^{0}_{00}, \varGamma ^{0}_{11}=\varGamma ^{0}_{22}=\varGamma ^{0}_{33}\) and \(\varGamma ^{1}_{01}=\varGamma ^{2}_{02}=\varGamma ^{3}_{03}\) may not be zero.

By substituting Eqs. (11) and (12) into the action (5), we obtain the action that describes the evolution of a spatially flat Universe:

$$\begin{aligned} S= & {} \int dt\int d^3x \nonumber \\{} & {} \times L\left( \ddot{\phi },{\dot{\phi }},\phi ,{\dot{N}},N, a,\varGamma ^{0}_{00},{\dot{\varGamma }}^{0}_{11}, {\varGamma }^{0}_{11},{\dot{\varGamma }}^{1}_{01},{\varGamma }^{1}_{01}\right) , \end{aligned}$$
(13)

where,

$$\begin{aligned} \frac{L}{a^3 N}= & {} K +\frac{3G_3\varGamma ^{0}_{11}{\dot{\phi }}}{a^2} -\frac{G_3}{N^2}\left( \varGamma ^{0}_{00}{\dot{\phi }}-\ddot{\phi }\right) \nonumber \\{} & {} + \frac{3G_4}{a^2}\left( \varGamma ^{0}_{00}\varGamma ^{0}_{11}+\varGamma ^{0}_{11}\varGamma ^{1}_{01} +{\dot{\varGamma }}^{0}_{11} \right) \nonumber \\{} & {} -\frac{3G_4}{N^2}\left( \varGamma ^{0}_{00}\varGamma ^{1}_{01}-{\varGamma ^{1}_{01}}^2-{\dot{\varGamma }}^{1}_{01} \right) \nonumber \\{} & {} + \frac{3G_5\varGamma ^{1}_{01}{\dot{\phi }}}{2a^2N^4} \left[ N^2\left( \varGamma ^{0}_{00}\varGamma ^{0}_{11}+\varGamma ^{0}_{11}\varGamma ^{1}_{01}+{\dot{\varGamma }}^{0}_{11} \right) \right. \nonumber \\{} & {} + \left. 3a^2\left( -\varGamma ^{0}_{00}\varGamma ^{1}_{01}+{\varGamma ^{1}_{01}}^2+{\dot{\varGamma }}^{1}_{01} \right) \right] \nonumber \\{} & {} + \frac{3G_5}{2a^2N^5} \left[ N^2\left( \varGamma ^{0}_{00}\varGamma ^{0}_{11}+\varGamma ^{0}_{11}\varGamma ^{1}_{01} +{\dot{\varGamma }}^{0}_{11} \right) \right. \nonumber \\{} & {} +\left. a^2\left( \varGamma ^{0}_{00}\varGamma ^{1}_{01}-{\varGamma ^{1}_{01}}^2-{\dot{\varGamma }}^{1}_{01} \right) \right] \nonumber \\{} & {} \times \left[ -2{\dot{N}}{\dot{\phi }}+N\left( \varGamma ^{0}_{00}{\dot{\phi }}+\ddot{\phi }\right) \right] \nonumber \\{} & {} +\frac{G_{4,X}{\dot{\phi }}}{a^4N^5} \left[ 9N^5{\varGamma ^{0}_{11}}^2{\dot{\phi }} \right. \nonumber \\{} & {} - 3a^2N^3\varGamma ^{0}_{11} \left( 2\varGamma ^{0}_{00}{\dot{\phi }}+\varGamma ^{1}_{01}{\dot{\phi }}-2\ddot{\phi } \right) \nonumber \\{} & {} +\left. 2a^4\left( N\varGamma ^{0}_{00}-{\dot{N}} \right) \left( \varGamma ^{0}_{00}{\dot{\phi }}-\ddot{\phi } \right) \right] \nonumber \\{} & {} - \frac{G_{5,X}}{2a^6N^7}{\dot{\phi }} \left[ -11N^7{\varGamma ^{0}_{11}}^3{\dot{\phi }}^2 \right. \nonumber \\{} & {} -\left. 3a^4N^2\varGamma ^{0}_{11} \left( N\left( 2\varGamma ^{0}_{00}+\varGamma ^{1}_{01}\right) -2{\dot{N}} \right) {\dot{\phi }}\left( \varGamma ^{0}_{00}{\dot{\phi }}-\ddot{\phi }\right) \right. \nonumber \\{} & {} + \left. 9a^2N^5{\varGamma ^{0}_{11}}^2{\dot{\phi }} \left( \varGamma ^{0}_{00}{\dot{\phi }}+\varGamma ^{1}_{01}{\dot{\phi }}-\ddot{\phi } \right) \right. \nonumber \\{} & {} +\left. 2a^6\left( N\varGamma ^{0}_{00}-{\dot{N}}\right) \left( -\varGamma ^{0}_{00}{\dot{\phi }}+\ddot{\phi }\right) ^2 \right] . \end{aligned}$$
(14)

Here and below, the dot on the letter represents the derivative of the corresponding quantity with respect to time, \(K,G_3,G_4,G_5,G_{4,X}\), and \(G_{5,X}\) are functions of \((\phi ,\frac{{\dot{\phi }}^2}{2N^2})\).

One may want to set \(N(t)=1\) at the level of action, and then obtain the evolution equations by varying the variables \(\left( a,\phi ,\varGamma ^{0}_{00},\varGamma ^{0}_{11},\varGamma ^{1}_{01}\right) \). However, this will miss one equation [70]. Thus, in order to obtain complete evolution equations, N should be kept in the action.

In addition to the gravitational field, the ideal fluid material field is also distributed in the spatially flat Universe. Therefore, in addition to the gravitational field action (13), we should also add an action \(S_m\) that describes the ideal fluid into the the total action

$$\begin{aligned} S_{tot}=S+S_m. \end{aligned}$$
(15)

Here, S is defined by (13). In the Palatini formalism, \(S_m\) is only a function of the metric and the material field, and it is independent of the connection. Varying the action \(S_m\) with respect to \(g_{\mu \nu }\), we obtain

$$\begin{aligned} \delta S_m=-\frac{1}{2}\int d^4x \sqrt{-g} T^{\mu \nu } \delta {g_{\mu \nu }}. \end{aligned}$$
(16)

Here, \(T^{\mu \nu }\) is the energy-momentum tensor of the ideal fluid:

$$\begin{aligned} T^{\mu \nu }=(P+\epsilon )u^{\mu }u^{\nu }+Pg^{\mu \nu }, \end{aligned}$$
(17)

where \(\epsilon \) is the matter density, P is the matter pressure. The four-velocity \(u^{\mu }\) satisfies \(u^{0}=\frac{1}{N}, u^{i}=0\).

By varying the action (15) with respect to \(N,\phi ,a,\varGamma ^{0}_{00},\varGamma ^{0}_{11}\) and \(\varGamma ^{1}_{01}\), we obtain the background equations:

$$\begin{aligned} \begin{aligned}{} & {} \frac{d}{dt}\frac{\partial {{L}}}{\partial {{\dot{N}}}} -\frac{\partial {{L}}}{\partial {N}}+a^3\epsilon =0, \\{} & {} \frac{d^2}{{dt}^2}\frac{\partial {{L}}}{\partial {\ddot{\phi }}} -\frac{d}{dt}\frac{\partial {{L}}}{\partial {{\dot{\phi }}}} +\frac{\partial {{L}}}{\partial {{\phi }}}=0, \\{} & {} \frac{\partial {L}}{\partial {a}}-3a^2 P=0,~ \frac{\partial {L}}{\partial {\varGamma ^{0}_{00}}}=0, \\{} & {} \frac{d}{dt}\frac{\partial {{L}}}{\partial {{\dot{\varGamma }}^{0}_{11}}} -\frac{\partial {{L}}}{\partial {{\varGamma }^{0}_{11}}}=0, \\{} & {} \frac{d}{dt}\frac{\partial {{L}}}{\partial {{\dot{\varGamma }}^{1}_{01}}} -\frac{\partial {{L}}}{\partial {{\varGamma }^{1}_{01}}}=0. \end{aligned} \end{aligned}$$
(18)

Here, L is defined by (14). Because the specific expressions of the background equations (18) are very lengthy and easy to obtain, they will not be listed in this paper. In the following, we take \(N(t)=1\).

In order to study the tensor gravitational waves, we need to obtain the linear perturbation equations of the tensor perturbations on the spatially flat cosmological background.

Since the metric and connection are independent in the Palatini formalism, they should be perturbed independently:

$$\begin{aligned} g_{\mu \nu } \rightarrow g_{\mu \nu }+h_{\mu \nu }, \quad \varGamma ^{\lambda }_{\mu \nu } \rightarrow \varGamma ^{\lambda }_{\mu \nu }+\varSigma ^{\lambda }_{\mu \nu }. \end{aligned}$$
(19)

We take the part describing tensor gravitational waves in perturbations:

$$\begin{aligned} h_{00}= & {} h_{0i}=0,~ h_{ij}=H_{ij},~ \varSigma ^{0}_{00}=\varSigma ^{0}_{0i}=\varSigma ^{i}_{00}=0,\nonumber \\ \varSigma ^{0}_{ij}= & {} A_{ij},~ \varSigma ^{i}_{0j}=B^i_j,~ \varSigma ^{i}_{jk}=\partial ^{i}C_{jk}+\partial _{(j}D^{i}_{k)}. \end{aligned}$$
(20)

Here, \(H_{ij}, A_{ij}, B_{ij}, C_{ij}\) and \(D_{ij}\) are symmetric transverse traceless tensors. They satisfy

$$\begin{aligned} H_{ij}= & {} H_{ji},~ A_{ij}=A_{ji},~ B_{ij}=B_{ji},~\nonumber \\ C_{ij}= & {} C_{ji},~ D_{ij}=D_{ji},\nonumber \\ H_{i}^{i}= & {} A_{i}^{i}=B_{i}^{i}=C_{i}^{i}=D_{i}^{i}=0,\\ \partial ^{i}H_{ij}= & {} \partial ^{i}A_{ij}=\partial ^{i}B_{ij} =\partial ^{i}C_{ij}=\partial ^{i}D_{ij}=0.\nonumber \end{aligned}$$
(21)

Only in this paragraph, we use \(\delta ^{ij}\) (\(\delta _{ij}\)) to raise and lower the index. In Appendix 1, we give the decomposition of the connection and explain why the perturbations describing the tensor gravitational waves are given by Eq. (20).

Without losing generality, we consider the propagation direction of gravitational waves as \(+z\) direction. At this time, it can be seen from (20) that the components of the perturbations \(h_{\mu \nu }\) and \(\varSigma ^{\lambda }_{\mu \nu }\) that may not be zero are

$$\begin{aligned} h_{11}= & {} -h_{22},~h_{12},~ \varSigma ^{0}_{11}=-\varSigma ^{0}_{22},~ \varSigma ^{0}_{12},~ \nonumber \\ \varSigma ^{1}_{01}= & {} -\varSigma ^{2}_{02}, ~ \varSigma ^{1}_{02}=\varSigma ^{2}_{01},~ \varSigma ^{1}_{13}=-\varSigma ^{2}_{23},\nonumber \\ \varSigma ^{1}_{23}= & {} \varSigma ^{2}_{13},~ \varSigma ^{3}_{11}=-\varSigma ^{3}_{22},~ \varSigma ^{3}_{12}. \end{aligned}$$
(22)

By expanding the second-order terms of the perturbations (22) in the action (15) and varying the action with respect to the perturbations, we can obtain the linear perturbation equations describing the tensor gravitational waves. These equations are very lengthy and easy to obtain, so they are not listed here.

Now we have obtained the linear perturbation equations describing the tensor gravitational waves. Next, we will use the equations to obtain the speed of the tensor gravitational waves.

Before that, we will take metric Horndeski theory as an example to demonstrate how to obtain the speed of tensor gravitational waves from the linear perturbation equation. In metric Horndeski theory, the linear perturbation equation describing the tensor gravitational waves is given by [27]:

$$\begin{aligned} \ddot{h}+b(t){\dot{h}}-\frac{{c_t}^2(t)}{a^2(t)}\varDelta h=0, \end{aligned}$$
(23)

where h is the component \(h_{11}\) or \(h_{12}\), b and \(c_t\) are functions of time, and \(\varDelta \) is the Laplace operator. For h(tz) propagating along the \(+z\) direction, we make a Fourier transform:

$$\begin{aligned} h(t,z)=\int d^3 k_3 f_{k_3}(t) e^{-ik_{3 z}}. \end{aligned}$$
(24)

By substituting Eq. (24) into Eq. (23), and using the linearity of Eq. (23), we obtain the following equation:

$$\begin{aligned} \ddot{f}_{k_3}+b(t){\dot{f}}_{k_3}+\frac{{c_t}^2(t)}{a^2(t)} k_3^2 f_{k_3}=0. \end{aligned}$$
(25)

This allows us to consider only the case with a single spatial wave vector \(k_3\):

$$\begin{aligned} h=f(t) e^{-i {k_3} z}, \end{aligned}$$
(26)

where f(t) can always be expressed as

$$\begin{aligned} f(t)=F(t) e^{i {k_0}(t) t}. \end{aligned}$$
(27)

Here, F is the norm of f(t) and \({k_0}(t) t\) is the argument. Therefore, F and \({k_0}\) are real numbers.

Considering that the gravitational wave is observed near time \(t_0\) and the observation duration is \(\varDelta T\), that is, the observation time \(t \in [t_0-\frac{\varDelta T}{2},t_0+\frac{\varDelta T}{2}]\). The duration \(\varDelta T\) is about the same order of magnitude as the period of the gravitational wave, and during this time, the amplitude and phase of the gravitational wave change very little:

$$\begin{aligned} \varDelta T \sim \frac{2\pi }{k_{0}} \sim \frac{1}{k_{0}}, \quad {\dot{F}}\varDelta T \ll F, \quad \dot{k_0}\varDelta T \ll k_0. \end{aligned}$$
(28)

Thus, \(h=F(t)e^{i[{k_0(t) t-k_3 z}]}\) can be approximated as a plane gravitational wave near \(t_0\):

$$\begin{aligned} h=F(t_0)e^{i[k_0(t_0) t-k_3 z]}. \end{aligned}$$
(29)

For the evolution of the cosmic background, the changes of ab and \(c_t\) in Eq. (23) during this period are also small:

$$\begin{aligned} {\dot{a}}\varDelta T \ll a, \quad {\dot{b}}\varDelta T \ll b, \quad \dot{c_t}\varDelta T \ll c_t. \end{aligned}$$
(30)

So Eq. (23) near \(t_0\) can be approximated as

$$\begin{aligned} \ddot{h}+b(t_0){\dot{h}}-\frac{{c_t}^2(t_0)}{a^2(t_0)}\varDelta h=0. \end{aligned}$$
(31)

By substituting Eq. (29) into Eq. (31), we can obtain

$$\begin{aligned} -{k_0}^2(t_0)+i b(t_0) k_{0}(t_0)+\frac{{c_t}^2(t_0)}{a^2(t_0)} {k_3}^2=0. \end{aligned}$$
(32)

The gravitational waves we can observe have large \(k_0\) and \(k_3\), which makes the linear term of wave vector component (uniformly recorded as k) in the above equation negligible compared with the quadratic term of k. Thus, by Eq. (32), the relationship between \(k_0\) and \(k_3\) will satisfy

$$\begin{aligned} -{k_0}^2(t_0)+\frac{{c_t}^2(t_0)}{a^2(t_0)} {k_3}^2=0. \end{aligned}$$
(33)

Just write (29) as

$$\begin{aligned} h=F(t_0)e^{ik_0(t_0) t}e^{-\left( \frac{k_3}{a(t_0)}\right) \left( a(t_0)z\right) }, \end{aligned}$$
(34)

and using Eq. (33), we can see that the tensor gravitational wave speed \(c_g\) at time \(t_0\) is

$$\begin{aligned} c_g(t_0)=\frac{a(t_0)k_0(t_0)}{k_3}=\frac{a(t_0)}{k_3} \frac{c_t(t_0)}{a(t_0)} k_3=c_t(t_0). \end{aligned}$$
(35)

The speed (1) obtained by this method is the same as that of Refs. [27, 71].

Similar to the above analysis, for Palatini–Horndeski theory near a certain time, we also approximate the coefficients of perturbations in the linear perturbation equations to constants which are independent of time. In addition, we also approximate the perturbations (22) to the form of plane gravitational waves:

$$\begin{aligned} h_{\mu \nu }={\bar{h}}_{\mu \nu } e^{i\left( k_0 t-k_3 z\right) },\quad \varSigma ^{\lambda }_{\mu \nu }={\bar{\varSigma }}^{\lambda }_{\mu \nu } e^{i\left( k_0 t-k_3 z\right) }. \end{aligned}$$
(36)

Here, \({\bar{h}}_{\mu \nu }\) and \({\bar{\varSigma }}^{\lambda }_{\mu \nu } \) are amplitudes. Similar to the above example of metric Horndeski theory, by substituting (36) into the approximated linear perturbation equations, we can obtain the linear equations with amplitudes \({\bar{h}}_{\mu \nu }\) and \({\bar{\varSigma }}^{\lambda }_{\mu \nu }\) as the variable. This equations can be written in matrix form:

$$\begin{aligned} AX=0, \end{aligned}$$
(37)

where A is a \(10 \times 10\) matrix and it depends on variables \(k_0\) and \(k_3\). X is a column vector composed of the components of the amplitudes \({\bar{h}}_{\mu \nu }\) and \({\bar{\varSigma }}^{\lambda }_{\mu \nu }\). The specific expression of A is very lengthy and easy to obtain, so it is not listed in this paper.

Equation (37) has a gravitational wave solution if and only if

$$\begin{aligned} \det (A)=0. \end{aligned}$$
(38)

As in the above example, we consider \(k_0\) and \(k_3\) to be large. Thus, the lower power terms of k in \(\det (A)\) are ignored, and only the highest power terms of k are retained. We mark the remaining quantity in \(\det (A)\) as \({\mathcal {A}}\). Therefore, according to the equation

$$\begin{aligned} {\mathcal {A}}(k_0,k_3)=0, \end{aligned}$$
(39)

we can know the relationship between \(k_0\) and \(k_3\), so as to solve the speed of tensor gravitational waves.

Now, we will calculate the speed of tensor gravitational waves propagating in a spatially flat cosmological background.

We divide the parameter space of Palatini–Horndeski theory into two classes.

Class I: \(G_{5}(\phi ,X)=0\). In this class, by solving Eq. (39), we find that the tensor gravitational wave speed \(c_g\) is given by

$$\begin{aligned} c_g^2=\frac{a^2 k_0^2}{k_3^2}=\left( 1+\frac{1}{2} \frac{G_{4,X}}{G_4} {{\dot{\phi }}}^2\right) ^2. \end{aligned}$$
(40)

Thus, the condition that the tensor gravitational wave speed is always the speed of light in any spatially flat cosmological background requires

$$\begin{aligned} {G_{4,X}} {{\dot{\phi }}}^2=0 \end{aligned}$$
(41)

for any spatially flat cosmological background.

In this class, by solving the background equation (18), we find that \(\big (\ddot{a},\dddot{\phi },\varGamma ^{0}_{00},{\varGamma }^{1}_{01},{\varGamma }^{0}_{11}\big )\) can be expressed as functions of \(\big (a,{\dot{a}},\phi ,{\dot{\phi }},\ddot{\phi },P,\epsilon \big )\). Further considering the equation of state and the energy conservation equation of the ideal fluid, we can also use \(\big (P,a,{\dot{a}}\big )\) to express \(\big ({\dot{P}},{\dot{\epsilon }}\big )\). Therefore, as long as we know \(\big (a,{\dot{a}},\phi ,{\dot{\phi }},\ddot{\phi },P,\epsilon \big )\), we can obtain \(\big (\ddot{a},\dddot{\phi },\varGamma ^{0}_{00},{\varGamma }^{1}_{01},{\varGamma }^{0}_{11},{\dot{P}},{\dot{\epsilon }}\big )\). This determines the initial value condition of the background equation (18). The specific expressions of these variables are very lengthy and easy to obtain, so we have not listed them.

Considering that there are different equations of state for different types of matters, P and \(\epsilon \) can be considered as independent variables. Therefore, the condition that the tensor gravitational wave speed is the speed of light in any spatially flat cosmological background is equivalent to the following condition: at any values of the variables \(\big (a,{\dot{a}},\phi ,{\dot{\phi }},\ddot{\phi },P,\epsilon \big )\), condition (41) is always true. This requires that \(G_{4,X}\) is always vanishing.

In this way, we find that in Class I, only subclass

$$\begin{aligned} G_5=0,\quad G_{4,X}=0 \end{aligned}$$
(42)

satisfies the condition that the tensor gravitational wave speed is the speed of light in any spatially flat cosmological background.

Class II: \(G_{5}(\phi ,X)\ne 0\). In this class, by solving the background equation (18), we can see that \(\big ({\dot{a}},\dddot{\phi },{\dot{\varGamma }}^{0}_{00},{\dot{\varGamma }}^{0}_{11},{\dot{\varGamma }}^{1}_{01}\big )\) can be expressed as the functions of \(\big (a,\phi ,{\dot{\phi }},\ddot{\phi },{\varGamma }^{0}_{00},{\varGamma }^{0}_{11},{\varGamma }^{1}_{01},P,\epsilon \big )\). Further considering the state equation and the energy conservation equation of the ideal fluid, we can also use \(\big (P,a,{\dot{a}}\big )\) to express \(\big ({\dot{P}},{\dot{\epsilon }}\big )\). Therefore, as long as we know \(\big (a,\phi ,{\dot{\phi }},\ddot{\phi },{\varGamma }^{0}_{00},{\varGamma }^{0}_{11},{\varGamma }^{1}_{01},P,\epsilon \big )\), we can solve \(\big ({\dot{a}},\dddot{\phi },{\dot{\varGamma }}^{0}_{00}, {\dot{\varGamma }}^{0}_{11},{\dot{\varGamma }}^{1}_{01},{\dot{P}},{\dot{\epsilon }}\big )\). This determines the initial value condition of the background equation (18). The specific expressions of these variables are very lengthy, so we do not list them. By substituting these expressions into Eq. (39) and solving it, we can obtain the tensor gravitational wave speed expressed by the variables \(\big (a,\phi ,{\dot{\phi }},\ddot{\phi },{\varGamma }^{0}_{00},{\varGamma }^{0}_{11},{\varGamma }^{1}_{01},P,\epsilon \big )\).

In fact, the tensor gravitational wave speed we solved is not unique in this class, and it has two possible solutions \(c_{g1}\) and \(c_{g2}\). These two speeds are generally different. However, when the matter pressure \(P=0\), we have \(c_{g1}=c_{g2}\). The specific expressions of \(c_{g1}\) and \(c_{g2}\) are very lengthy, so we do not list them.

The first speed \(c_{g1}^2\) can be expressed as a fraction

$$\begin{aligned} c_{g1}^2=\frac{{\mathcal {N}}}{{\mathcal {D}}}. \end{aligned}$$
(43)

It can be seen that \(c_{g1}=1\) is equivalent to the numerator part \({\mathcal {N}}\) on the right side of Eq. (43) minus the denominator part \({\mathcal {D}}\) equal to 0:

$$\begin{aligned} M\equiv {{\mathcal {N}}}-{{\mathcal {D}}}=0. \end{aligned}$$
(44)

By expanding the brackets, M can be expressed as a polynomial about the variables \(\big (a,\phi ,{\dot{\phi }},\ddot{\phi },{\varGamma }^{0}_{00},{\varGamma }^{0}_{11},{\varGamma }^{1}_{01},P,\epsilon \big )\). If we require the tensor gravitational wave speed \(c_{g1}\) to be the speed of light under any spatially flat cosmological background, then for any values of the variables \(\big (a,\phi ,{\dot{\phi }},\ddot{\phi },{\varGamma }^{0}_{00},{\varGamma }^{0}_{11},{\varGamma }^{1}_{01},P,\epsilon \big )\), this polynomial should be 0. We notice that in this polynomial, the term where \((\varGamma ^{0}_{00})^4\epsilon \) appears is

$$\begin{aligned} 6 a^{14} (G_5)^5 {{\dot{\phi }}}^5 \Big ( 5 G_{5,X} {{\dot{\phi }}}^2 -2 {G_5} \Big ) (\varGamma ^{0}_{00})^4\epsilon . \end{aligned}$$
(45)

Therefore, the above condition requires

$$\begin{aligned} G_5=0 ~~\text {or}~~G_5=\frac{5}{2}{{\dot{\phi }}}^2G_{5,X}. \end{aligned}$$
(46)

If substituting the condition \(G_5=\frac{5}{2}{{\dot{\phi }}}^2G_{5,X}\) into Eq. (44), we again notice that in this polynomial, the term where \(({\varGamma ^{1}_{01}})^4\epsilon \) appears is

$$\begin{aligned} -\frac{1171875}{16} a^{14} (G_{5,X})^6 {{\dot{\phi }}}^{17} (\varGamma ^{1}_{01})^4\epsilon . \end{aligned}$$
(47)

Therefore, the above condition further requires \(G_{5,X}=0\). Combining with the condition (46) we have \(G_5=0\). However, this is inconsistent with the assumption \(G_5 \ne 0\) in this class.

For the second solution \(c_{g2}\), using the same analysis method as that used to analyze the first solution \(c_{g1}\), we find that the condition of \(c_{g2}=1\) also requires \(G_5=0\).

To sum up, for Palatini–Horndeski theory, the parameter space satisfying the condition that the tensor gravitational wave speed is the speed of light under any spatially flat cosmological background is only \(G_{4,X}=0\) and \(G_5=0\).

4 Conclusion

In this paper, we calculated the speed of tensor gravitational waves in the spatially flat cosmological background. Unlike the metric formalism, in the Palatini formalism, the background perturbation describing tensor gravitational waves is no longer just the tensor perturbation of metric \(h^{TT}_{\mu \nu }\), but there are also several tensor perturbations of connection (see Eq. (22)). Therefore, linear perturbation equations describing tensor gravitational waves in the Palatini formalism are often a system of equations. For such a system of equations, we developed a general method in the Palatini formalism to obtain the speed of tensor gravitational waves in the spatially flat cosmological background in Sect. 3. This method is generally applicable to any theory in the Palatini formalism.

It is worth noting that we found that there are two possible speeds of tensor gravitational waves in Class II. This is due to the additional degrees of freedom introduced by the tensor perturbations of the connection. It seems to imply that if we observe two tensor gravitational waves with different speeds in the future, the theory of gravitation describing our world may be described by the Palatini formalism.

However, if we further require the tensor gravitational wave speed to be the speed of light c in any spatially flat cosmological background, then only

$$\begin{aligned}{} & {} S\left( g,\varGamma ,\phi \right) \nonumber \\{} & {} \quad =\int d^4x \sqrt{-g}\big [K(\phi ,X)-G_{3}(\phi ,X) {{\tilde{\Box }}}\phi +G_{4}(\phi ){\tilde{R}} \big ]\nonumber \\ \end{aligned}$$
(48)

is left as the possible action in the above two subclasses of Palatini–Horndeski theory. Reference [60] pointed out that the action (48) in the Palatini formalism is actually equivalent to the following action in the metric formalism:

$$\begin{aligned}{} & {} S\left( g,\phi \right) \nonumber \\{} & {} \quad =\int d^4x \sqrt{-g}\big [{\bar{K}}(\phi ,X) -G_{3}(\phi ,X){{\Box }}\phi +G_{4}(\phi ){R}\big ].\nonumber \\ \end{aligned}$$
(49)

Here,

$$\begin{aligned} {\bar{K}}=K+\left( -2G_3 G_{4,\phi }+3 G_{4,\phi }^2-\frac{2}{3} G_3^2\right) \frac{X}{G_4}. \end{aligned}$$
(50)

It can be seen that the action (48) in the Palatini formalism actually still belongs to metric Horndeski theory. Therefore, Palatini–Horndeski theory described by action (48) does not have the Ostrogradsky instability. It should be noted that the action (49) is the only subclass of metric Horndeski theory that is compatible with the condition that the tensor gravitational wave speed is the speed of light c in any spatially flat cosmological background.

Despite some progress, the maximum parameter space without the Ostrogradsky instability in Palatini–Horndeski theory has not yet been fully found. Specifically, when \(G_{5}(\phi ,X)\ne 0\), the connection will introduce additional degrees of freedom. It is completely unknown whether the theory has the Ostrogradsky instability in this case [60]. When only considering the case of the evolution of a spatially flat Universe, conducting the Hamiltonian analysis of the action (13), one can find that there is at least one constraint in the theory. It can be expected that by carefully analyzing the constraint, all parameter spaces without the Ostrogradsky instability can be found in the considered case. Providing such an analysis will help us deepen our understanding of the Ostrogradsky instability in the Palatini-Horndeski theory. This requires future research.

Finally, it should be pointed out that this does not mean that the scalar–tensor gravity in the Palatini formalism must not beyond the framework of metric Horndeski theory. This is because in this paper, we have taken the simplest and natural assumption for the speed of tensor gravitational waves, that is, in any spatially flat cosmological background, the tensor gravitational wave speed is the speed of light c. By weakening this assumption, it is possible to find other parameter spaces of Palatini–Horndeski theory compatible with GW170817. In addition, Palatini–Horndeski theory considered in this paper is not the most general theory of scalar–tensor gravity in the Palatini formalism. A more general discussion needs to study more general action. These all need to be studied in future work.