The idea that inflation may be due to degrees of freedom already present in the standard model of particle physics or quantum general relativity is extremely attractive and has received much attention in the recent years. In particular two models stand out by their simplicity and elegance. Higgs inflation [13] with a large non-minimal coupling of the Higgs boson H to the Ricci scalar (\(\xi H^\dagger H R\)) and Starobinsky’s inflation model [4] based on \(R^2\) gravity are both minimalistic and perfectly compatible with the latest Planck data.

These two models should not be considered as physics beyond the standard model but rather both operators \(\xi H^\dagger H R\) and \(R^2\) are expected to be generated when general relativity is coupled to the standard model of particle physics. We will come back to that point shortly. The aim of this paper is to point out an intriguing distinct possibility, namely that Starobinsky inflation is generated by quantum effects due to a large non-minimal coupling of the Higgs boson to the Ricci scalar. In that framework, we do not need to posit that the Higgs boson starts at a high field value in the early universe which would alleviate constraints coming from the requirement of having a stable Higgs potential even for large Higgs field values [57].

We shall now argue that both terms necessary for Higgs inflation or Starobinsky’s model are naturally present when the standard model of particle physics is coupled to general relativity. While the quantization of general relativity remains one of the outstanding challenges of theoretical physics, it is possible to use effective field theory methods below the energy scale \(M_\star \) at which quantum gravitational effects are expected to become large. The energy scale \(M_\star \) is usually assumed to be of the order of the Planck scale \(M_P=\sqrt{8 \pi G_N}^{-1}=2.4335 \times 10^{18}\) GeV; however, recent work has shown that even in four space-time dimensions this energy scale is model dependent. At energies below \(M_\star \), we can describe all of particle physics and cosmology with the following effective field theory (see e.g. [810]):

$$\begin{aligned} S= & {} \int \mathrm{d}^4x \, \sqrt{-g} \biggl ( \left( \frac{1}{2} M^2 + \xi H^\dagger H \right) R\nonumber \\&- \Lambda _C^4 + c_1 R^2 + c_2 C^2 +c_3 E + c_4 \Box R\nonumber \\&- L_{SM} - L_{DM} + O(M_\star ^{-2}) \biggr ) \end{aligned}$$
(1)

where we have restricted our considerations to dimension four operators which are expected to dominate at least at low energies. Note that we are using the Weyl basis and the following notations: R stands for the Ricci scalar, \(R^{\mu \nu }\) for the Ricci tensor, \(E=R_{\mu \nu \rho \sigma }R^{\mu \nu \rho \sigma }- 4 R_{\mu \nu }R^{\mu \nu }+R^2\), \(C^2=E+2R_{\mu \nu }R^{\mu \nu } -2/3 R^2\), the dimensionless \(\xi \) is the non-minimal coupling of the Higgs boson H to the Ricci scalar, the coefficients \(c_i\) are dimensionless free parameters, the cosmological constant \(\Lambda _C\) is of order of \(10^{-3}\) eV, the Higgs boson vacuum expectation value, \(v=246\) GeV contributes to the value of the Planck scale,

$$\begin{aligned} (M^2+\xi v^2)=M_P^2, \end{aligned}$$
(2)

\(L_{\mathrm{SM}}\) contains all the usual standard model interactions (including mass terms for neutrinos), and finally \(L_{\mathrm{DM}}\) describes the dark matter sector (this is the only part of the model which has not been tested yet experimentally). Submillimeter pendulum tests of Newton’s law [11] lead to extremely weak limits on the parameters \(c_i\). In the absence of accidental cancellations between these coefficients, they are constrained to be less than \(10^{61}\) [12]. The discovery of the Higgs boson and precision measurements of its couplings to fermions and bosons at the LHC can be used to set a limit on \(\xi \). One finds that \(|\xi | < 2.6 \times 10^{15}\) [13]. Clearly very little is known about the values of \(c_i\) and \(\xi \).

Besides describing all of particle physics and late time cosmology, the action given in Eq. (1) can also describe inflation if some of its parameters take specific values and if some of its fields fulfil specific initial conditions in the early universe. This action, depending on the initial conditions can describe either Higgs inflation if \(\xi \sim 10^4\) and the Higgs field is chosen to take large values in the early universe or Starobinsky inflation if \(c_1 \sim 10^9\) and the corresponding scalar extra degree of freedom, which can be made more visible by going to the Einstein frame, takes large values in the early universe.

If we assume that the Higgs fields take small values in the early universe, Eq. (1) reduces to

$$\begin{aligned} S_{\mathrm{Starobinsky}}^J=\int \mathrm{d}^4 x \sqrt{g} \frac{1}{2} \left( M_P^2 R+ c_S R^2 \right) \end{aligned}$$
(3)

during inflation, which in the Einstein frame gives

$$\begin{aligned} S_{\mathrm{Starobinsky}}^E= & {} \int \mathrm{d}^4 x \sqrt{g}\nonumber \\&\times \left( \phantom {\left. \left( 1-\exp \left( - \sqrt{\frac{2}{3}} \frac{\sigma }{M_P}\right) \right) ^2 \right) }\frac{M_P^2}{2} R - \frac{1}{2} \partial _\mu \sigma \partial ^\mu \sigma - \frac{M_P^4}{c_S}\right. \nonumber \\&\times \left. \left( 1-\exp \left( - \sqrt{\frac{2}{3}} \frac{\sigma }{M_P}\right) \right) ^2 \right) . \end{aligned}$$
(4)

We have assumed that the scalar degree of freedom \(\sigma \) hidden in \(R^2\) takes large field values in the early universe. A successful prediction of the density perturbation \(\delta \rho /\rho \) requires \(c_S=0.97 \times 10^9\) [14, 15]. On the other hand, if we assume that only the Higgs field takes large values in the early universe,the action (1) reduces to

$$\begin{aligned} S_{\mathrm{Higgs}}^J= & {} \int \mathrm{d}^4 x \sqrt{-g} \left( \frac{M^2}{2} R+ \xi _H H^\dagger H R - L_{\mathrm{SM}} \right) \nonumber \\= & {} \int \mathrm{d}^4 x \sqrt{-g}\nonumber \\&\times \left( \frac{M^2+\xi _H h^2}{2} R -\frac{1}{2} \partial _\mu h \partial ^\mu h +\frac{\lambda }{4} (h^2-v^2)^2 \right) \nonumber \\&+\ldots \end{aligned}$$
(5)

In the Einstein frame, one obtains

$$\begin{aligned} S_{\mathrm{Higgs}}^E{=}\int \mathrm{d}^4 x \sqrt{\hat{g}} \left( \frac{M_P^2}{2} \hat{R} -\frac{1}{2} \partial _\mu \chi \partial ^\mu \chi +U(\chi ) +\cdots \right) \nonumber \\ \end{aligned}$$
(6)

with

$$\begin{aligned} \frac{\mathrm{d}\chi }{\mathrm{d} h}=\sqrt{\frac{\Omega ^2 + 6 \xi _H^2 h^2/M_P^2}{\Omega ^4}} \end{aligned}$$
(7)

where \(\Omega ^2=1+ \xi _H^2 h^2/M_P^2\) and

$$\begin{aligned} U(\chi )=\frac{1}{\Omega (\chi )^4} \frac{\lambda }{4} (h(\chi )^2 -v^2)^2. \end{aligned}$$
(8)

A successful prediction of the density perturbation \(\delta \rho /\rho \) requires \(\xi _H=1.8 \times 10^4\).

These two models are very attractive because they do not necessitate physics beyond the standard model. Furthermore, they are compatible with current cosmological observations which favor small tensor perturbations that so far have not been observed. It has actually been pointed out that both models are phenomenologically very similar [16, 17]. However, while Starobinsky’s inflation model does not suffer from any obvious problem, it has recently been pointed out that in the case of Higgs inflation, which necessitate the Higgs field to take very large field values, our universe will not end up in the standard model Higgs vacuum if it is metastable as suggested by the latest measurement of the top quark mass, but rather in the real vacuum of the theory which does not correspond to the world we observe. In this paper we point out that there is an alternative possibility. Namely when quantum corrections are taken into account, a large non-minimal coupling of the Higgs boson can generate Starobinsky inflation by generating a large coefficient for the coefficient of \(R^2\) in the early universe. While the model corresponds to Starobinsky’s model, the Higgs boson plays a fundamental role as it triggers inflation by generating a large coefficient for \(R^2\).

The action given in Eq. (1) needs to be renormalized. We will work in dimensional regularization to avoid having to discuss the dependence of observables on the cut-off (this problem is due to the non-renormalizability of quantum gravity). We shall neglect the cosmological constant which is not important for inflation purposes. In that case, Newton’s constant does not receive any correction to leading order. On the other hand, the coefficient \(c_1\) of \(R^2\) gets renormalized and one can define a renormalization group equation for this coupling constant. \(N_s\) scalar fields with a non-minimal coupling to the Ricci scalar \(\xi \) will lead to the following renormalization group equation [810]:

$$\begin{aligned} \mu \partial _\mu c_1(\mu )=\frac{(1- 12 \xi )^2}{1152 \pi ^2} N_s \end{aligned}$$
(9)

to leading order (i.e. neglecting the graviton contribution which is suppressed by \(1/\xi \)), note that fermions and vector fields do not contribute to the renormalization of \(c_1\) in the Weyl basis. The renormalization group equation can easily be integrated, one finds [810]

$$\begin{aligned} c_1(\mu _2)=c_1(\mu _1)+\frac{(1- 12 \xi )^2 N_s}{1152 \pi ^2} \log \frac{\mu _2}{\mu _1}. \end{aligned}$$
(10)

The bounds on \(c_1\) in today’s universe are very weak as mentioned before. Even if \(c_1(\mathrm{today})\) is of order unity, it would have been large in the early universe if the Higgs non-minimal coupling \(\xi \) is large. Indeed, we assume that inflation took place at some high energy scale e.g. \(\mu \sim 10^{15}\) GeV, the \(\log \) term is a factor of order 60 if we take the scale \(\mu _1\) of the order of the cosmological constant. A Higgs non-minimal coupling to the Ricci scalar of \(\xi =1.8 \times 10^4\) would lead to a coefficient \(c_1=0.97\times 10^9\) for \(R^2\). Assuming that the scalar extra degree contained in \(R^2\) took large field values in the early universe, a large non-minimal coupling of the Higgs boson to the Ricci scalar can trigger Starobinsky inflation even if the standard model vacuum is metastable as the Higgs boson itself does not roll down its potential during inflation. Inflation is due entirely to \(R^2\), but it is triggered by the Higgs large non-minimal coupling.

Let us emphasize two important points. The first one is that \(c_1 \sim 0.97\times 10^9\) is fixed by the CMB constraint. This parameter only takes such a large value at inflationary energy scales due to its renormalization group evolution. The second one, is that we are neglecting the running of the Higgs boson non-minimal coupling to the Ricci scalar. However, this is a very good approximation. The leading contributions of the standard model to the beta-function of the non-minimal coupling are known [20]:

$$\begin{aligned} \beta _\xi =\frac{6 \xi +1}{(4 \pi )^2} \left[ 2 \lambda + y_t^2 -\frac{3}{2} g^2 -\frac{1}{4} g^{\prime 2} \right] \end{aligned}$$
(11)

where \(\lambda \) is the self-interaction coupling of the Higgs boson, g the SU(2) gauge coupling, and \(g^\prime \) the U(1) gauge coupling. Quantum gravitational corrections will be suppressed by powers of the Planck mass and can thus be safely ignored as long as we are at energies below the Planck mass.

One might worry that if the large non-minimal coupling of the Higgs boson triggers a large coefficient for the operator \(R^2\), it might also generate new terms in the effective action, which could destabilize the potential. The leading order effective action to the second order in the curvature expansion induced by scalar fields non-minimally coupled to gravity is known [8, 9]:

$$\begin{aligned}&S_{\mathrm{EFT}}= \frac{1}{16 \pi G} \int \mathrm{d}^4x \sqrt{-g}\nonumber \\&\quad \times \left( R + \alpha R^2 + \beta R \log \frac{-\Box }{\mu ^2} R + \gamma C^2 + \cdots \right) . \end{aligned}$$
(12)

Note that here we are neglecting the cosmological constant, \(\alpha = c_1 \times 16 \pi G \) and \(\gamma = c_2 \times 16 \pi G \) are renormalized coupling constants and we shall assume that \(c_2\) is small at the scale of inflation, it is not sensitive to the Higgs boson’s non-minimal coupling, while we have fixed the Higgs non-minimal coupling such that \(c_1=0.97\times 10^9 \). The coefficient \(\beta \) is a prediction of the effective action and is given by \(N_s (1-12 \xi )^2/(2304 \pi ^2) \times 16 \pi G\) where \(N_s\) is the number of scalar field degrees of freedom in the model, in our case 4. The coefficient \(N_s (1- 12\xi )^2/(2304 \pi ^2)\) is indeed large and of the order of \(7.8 \times 10^6\) and we have to check that the log term does not lead to sizable contributions to the effective potential of the Starobinsky field. Before verifying this explicitly, let us mention that the large non-minimal coupling between the Higgs boson and the Ricci scalar which is necessary to induce Starobinsky inflation does not lead to perturbative unitarity problems [18] (see the appendix).

Note that the coefficients of E and of \(C^2\) do not depend on the non-minimal coupling of the Higgs boson to the Ricci scalar. Furthermore in 4 dimensions, E does not contribute to the equations of motion. The coefficient of the term \(C^2\) is assumed before renormalization to be of the same order as that of \(R^2\), i.e. of order 1. However, after renormalization the coefficient of \(R^2\) is tuned to be very large and of the order of \(10^9\) while the coefficient of \(C^2\) remains small compared to the renormalized coefficient of \(R^2\). \(C^2\) is thus negligible.

We shall treat the effective action (12) as a F(R) gravity with \(F(R)=R + \alpha R^2 + \beta R \log \frac{-\Box }{\mu ^2} R\). There is a well established procedure to map such models from the Jordan frame to the Einstein frame; see e.g. [19]. The potential for the inflaton in the Einstein frame is given by

$$\begin{aligned} V(\phi )= \frac{1}{2 \kappa ^2} \left( e^{\sqrt{\frac{2}{3}} \kappa \phi } R(\phi ) - e^{2 \sqrt{\frac{2}{3}} \kappa \phi } F(R(\phi )) \right) \end{aligned}$$
(13)

where \(\kappa ^2 = 8 \pi G\) and \(R(\phi )\) is a solution to the equation

$$\begin{aligned} \phi = -\sqrt{\frac{3}{2}} \frac{1}{\kappa } \log \frac{\mathrm{d}F(R)}{\mathrm{d}R}. \end{aligned}$$
(14)

We can find a formal solution to this equation

$$\begin{aligned} R(\phi )= \frac{1}{2 \alpha } \left( \frac{1}{1+\frac{\beta }{2 \alpha } \log \left( \frac{-\Box }{\mu ^2} \right) } \right) \left( e^{-\sqrt{\frac{2}{3}} \kappa \phi } -1 \right) . \end{aligned}$$
(15)

This expression for \(R(\phi )\) can be understood as a series in \(\frac{\beta }{2 \alpha }\), which is a small parameter:

$$\begin{aligned} R(\phi )= & {} \frac{1}{2 \alpha } \left( 1 - \sum _{n=1}^\infty (-1)^{n+1} \left( \frac{\beta }{2 \alpha } \log \left( \frac{-\Box }{\mu ^2} \right) \right) ^n \right) \nonumber \\&\times \left( e^{-\sqrt{\frac{2}{3}} \kappa \phi } -1 \right) , \end{aligned}$$
(16)

where the \( \log \) term can be expressed using

$$\begin{aligned} \log \left( \frac{-\Box }{\mu ^2} \right) = \int _0^\infty \mathrm{d}s \left( \frac{1}{\mu ^2 + s} - \frac{1}{-\Box + s} \right) . \end{aligned}$$
(17)

The zeroth order term in \(\frac{\beta }{2 \alpha }\sim 4 \times 10^{-3}\) corresponds to the usual Starobinsky solution:

$$\begin{aligned} R(\phi )^{(0)}=R(\phi )_{\mathrm{Starobinsky}}= \frac{1}{2 \alpha } \left( e^{-\sqrt{\frac{2}{3}} \kappa \phi } -1 \right) . \end{aligned}$$
(18)

The series expansion will generate higher order terms corresponding to operators of the type \(\exp ({-\sqrt{\frac{2}{3}} \kappa \phi }) (2/3 \kappa ^2 \partial _\mu \phi \partial ^\mu \phi -\sqrt{2/3} \kappa \Box \phi )\) and higher derivatives thereof. These new terms are, however, suppressed by powers of \(\frac{\beta }{2 \alpha }\) and can be safely ignored. It is easy to check that the \(\log \) term appearing in the F(R) term of the potential (13) is also suppressed by \(\frac{\beta }{2 \alpha }\) compared to the usual Starobinsky’s potential.

We conclude that the large quantum corrections induced by the large Higgs boson non-minimal coupling do not affect the flatness of Starobinsky’s potential. Let us add a few remarks. The model discussed above is not a new model. Physics (including reheating or preheating and all of particle physics) is identical to that predict in Starobinsky’s model. We merely identify a new connection between the Higgs boson and inflation. As in the case of the standard Starobinsky model, a coupling \(\phi ^2 h^2\) will be generated. It is, however, suppressed by factors of \(m_{\mathrm{Higgs}}^2/M_P^2\), which is a small number, particle physics will thus not be affected and the Higgs boson behaves as the standard model Higgs boson. Furthermore, the Higgs field does not take large values in the early universe, we can thus safely ignore the term \(H^\dagger H R\) when studying the inflationary potential. Note that there are subtleties when considering the equivalence of quantum corrections in different parameterizations/representations of the theory (i.e. when going from the Jordan frame to the Einstein frame). Here we are avoiding this problem: we renormalized the theory in the Jordan frame where the model is defined and then map the effective action to the Einstein frame. When proceeding this way, there are no ambiguities or risks to mix up the orders in perturbation theory and the expansion in the conformal factor (see e.g. [2123]).

In this paper, we have identified a new connection between the Higgs boson and inflation. In the model envisaged here, the Higgs boson is not the inflaton but it generates inflation by creating a large Wilson coefficient for the \(R^2\) operator and it is thus at the origin of Starobinsky’s inflation. This mechanism is interesting as it does not require physics beyond the standard model. The Higgs boson does not need to take large field values in the early universe and we could thus be living in a metastable potential.