1 Introduction

The simplest extension of pure \({\mathcal {N}}=1\) supergravity in flat spacetime is the anti de Sitter (AdS) supergravity, where a negative cosmological constant \(\Lambda \) is included [1]. In order to preserve local supersymmetry, a gravitino effective mass term has to be added, linked to \(\Lambda \) through \(\Lambda =-3m_{3/2}^2\), which describes a massless gravitino in AdS spacetime. It is simply obtained by considering a constant superpotential \(W=m_{3/2}\). An arbitrary cosmological constant cannot be introduced without breaking explicitly supersymmetry, or considering non-linear realisation [2]. In the presence of an abelian vector multiplet a constant Fayet–Iliopoulos (FI) term can be introduced only if the U(1) gauges the R-symmetry, in which case a constant superpotential is forbidden, leading to a de Sitter (dS) supergravity describing a massive gravitino through curvature effects [3, 4].

Recently, a new type of constant FI-term was introduced which does not require the gauging of the R-symmetry [5]. It assumes that the D-auxiliary component of the U(1) vector multiplet has a non-vanishing vacuum expectation value (VEV) breaking spontaneously supersymmetry, in which case it can be expanded as D + fermion terms of higher dimensions. In the unitary gauge where the gravitino absorbs the U(1) gaugino and becomes massive, the fermion terms vanish and the new FI-term amounts adding a positive contribution to the cosmological constant of the AdS supergravity, since a constant superpotential is now allowed as the U(1) does not gauge the R-symmetry. In the presence of matter, the construction of [5] leads to a scalar potential but breaks Kähler invariance. On the other hand, the new and standard FI-terms can coexist in the case of gauge R-symmetry, providing interesting models of D-term inflation [6]. An alternative construction was made in [7] preserving Kähler invariance and leading to a constant FI-term in the presence of matter, that generates a constant uplift of the vacuum energy. More recently, such FI-terms were written in \({\mathcal {N}}=2\) supergravity exhibiting a much richer structure [8].

In this work, we generalise the above constructions in \({\mathcal {N}}=1\) supergravity, preserving the Kähler invariance and keeping the form of the bosonic action to be linear in D up to a field dependent coefficient. We show that the most general FI-term is characterised by an arbitrary function of the gravitino mass, taken as a functional of the chiral superfields. We then study applications to cosmology, building new models of inflation compatible with CMB observations and possessing a dS vacuum with tuneable (tiny) energy. We specialise to no-scale models [9, 10] of one chiral multiplet containing the inflaton, supplemented by a U(1) gauge symmetry with the new FI-term. Moreover, we choose for the latter a simple characteristic function of the gravitino mass which is a single power and an additive constant, thus depending on three parameters. We show that there is a region in the parameter space where the resulting scalar potential possesses an inflationary plateau describing successfully the cosmological observations with the inflaton rolling down to a minimum with tuneable vacuum energy, where the gravitino mass and the supersymmetry breaking scale are fixed in terms of the parameters of the model.

This paper is organised as follows. In Sect. 2, we review the recent construction of the new FI-term in \({\mathcal {N}}=1\) supergravity without gauging the R-symmetry, and its generalisation to a Kähler invariant FI Lagrangian leading to a positive constant uplift of the scalar potential in the presence of arbitrary matter chiral multiplets. In Sect. 3, we propose the most general modification of this construction that preserves Kähler invariance and is characterised by an arbitrary function of the gravitino mass as functional of chiral multiplets. We then study the consequences of such terms on inflation and supersymmetry breaking in a de Sitter vacuum with tuneable energy in Sect. 4, for the case of two no-scale models and for a simple choice of the functional dependence of the new FI D-term. Finally in Sect. 5, we discuss the gauging of the shift symmetry that gets rid of the massless particles in the spectrum without altering the inflationary predictions. Moreover, inspired by the low-energy limit of the heterotic string, we identify the inflaton with the string dilaton and gauge the perturbative axionic symmetry by the Green–Schwarz anomaly cancellation mechanism. These models provide new examples of inflation by supersymmetry breaking [11], where the inflaton belongs to the same multiplet with the Goldstino [12, 13], without gauging the R-symmetry. Our conclusions are presented in Sect. 6. There are also two appendices containing a summary of the conformal supergravity multiplets calculus (Appendix A) and details of the computation of the fermion masses in our models (Appendix B).

Throughout this work, we will use natural units \(\hbar =c\equiv 1\). The reduced Planck mass \(\kappa ^{-1}=(8\pi G)^{-1/2}=2.4\times 10^{18}\mathrm{GeV}\) is set equal to one, and numerical values are given in these units. We adopt the metric convention \((-,+,+,+)\).

2 Fayet–Iliopoulos term without gauged R-symmetry: a review

In \({\mathcal {N}}=1\) supergravity, a new Fayet–Iliopoulos term associated to a non-gauged R symmetry has first been introduced in [5]. In the superconformal formalism, denoting \(S_0=(s_0,P_L\Omega _0,F_0)\) and \({\bar{S}}_0=({\bar{s}}_0,P_R\Omega _0,{\bar{F}}_0)\) the chiral and anti-chiral compensator superfields, with (Weyl, Chiral) weights (1, 1) and \((1,-1)\) respectively, this new FI Lagrangian reads [5]:

$$\begin{aligned} {\mathcal {L}}_{FI}=-\xi \left[ S_0{\bar{S}}_0\frac{{\mathcal {W}}^2\bar{{\mathcal {W}}^2}}{T(\bar{{\mathcal {W}}^2}){\bar{T}}({\mathcal {W}}^2)}(V)_D \right] _D, \end{aligned}$$
(1)

where \(\xi \) is a constant parameter, \((V)_D\) is a real linear multiplet defined by , whose lowest component D is the real auxiliary field of the vector superfield V, the latter having (anti)-chiral field strength (\(\bar{{\mathcal {W}}}\)) \({\mathcal {W}}\) given by

$$\begin{aligned} {\mathcal {W}}^2=\frac{{\bar{\lambda }}P_L\lambda }{S_0^2},\quad \bar{{\mathcal {W}}^2}=\frac{{\bar{\lambda }} P_R\lambda }{{\bar{S}}_0^2}, \end{aligned}$$
(2)

so that \((V)_D\) is given by the super-covariant derivative of \({\mathcal {W}}\). The chiral projection operator T acting on an anti-chiral multiplet \({\bar{X}}=({\bar{X}},P_R\Omega ,{\bar{F}})\) of weights \((1,-1)\) gives a chiral multiplet of weights (2, 2) defined as . \({\bar{\lambda }}P_L\lambda \) has weights (3, 3) and reads, in components form:

(3)

with the covariant field strength \({\hat{F}}_{ab}\) and the self-dual and anti self-dual tensors \({\hat{F}}_{ab}^{\pm }\) given by

$$\begin{aligned} {\hat{F}}_{ab}= & {} e_a^{\mu }e_b^{\nu }\left( 2\partial _{[\mu }A_{\nu ]}+{\bar{\psi }}_{[\mu }\gamma _{\nu ]}\lambda \right) , \end{aligned}$$
(4)
$$\begin{aligned} {\hat{F}}_{ab}^{\pm }= & {} \frac{1}{2}({\hat{F}}_{ab}\pm \tilde{{\hat{F}}}_{ab}). \end{aligned}$$
(5)

The dual field strength is \(\tilde{{\hat{F}}}_{ab}=-\frac{i}{2}\epsilon _{abcd}{\hat{F}}^{cd}\), while the covariant derivative \({\mathcal {D}}_{\mu }\lambda \) is defined by

$$\begin{aligned} {\mathcal {D}}_{\mu }\lambda= & {} \left( \partial _{\mu }-\frac{3}{2}b_{\mu } +\frac{1}{4}w_{\mu }^{ab}\gamma _{ab}-\frac{3}{2}i\gamma _*{\mathcal {A}}_{\mu }\right) \lambda \nonumber \\&-\left( \frac{1}{4}\gamma ^{ab}{\hat{F}}_{ab}+\frac{1}{2}i\gamma _*D\right) \psi _{\mu }. \end{aligned}$$
(6)

\(\psi _{\mu }\) is the gravitino, the fields \(b_{\mu }\), \(w_{\mu }^{ab}\) and \({\mathcal {A}}_{\mu }\) are the gauge fields corresponding to dilatations, Lorentz transformations and \(T_R\) symmetry of the conformal algebra respectively, and \(\gamma _*\equiv i\gamma _0\gamma _1\gamma _2\gamma _3\).

Considering first pure supergravity coupled to an abelian U(1) gauge multiplet plus the FI term (1), the full Lagrangian reads:

$$\begin{aligned} {\mathcal {L}}=-3\left[ S_0{\bar{S}}_0\right] _D+\left[ S_0^3W_0\right] _F-\frac{1}{4g^2}\left[ {\bar{\lambda }}P_L\lambda \right] _F+{\mathcal {L}}_{FI}. \end{aligned}$$
(7)

Supersymmetry is broken via a non-vanishing VEV of the D-auxiliary component of the vector multiplet driven by the linear term in D, with the Goldstino being the U(1) gaugino. In component form, after having gauge fixed the compensator through \(S_0=1\), integrated the auxiliary fields, and in the unitary gauge where the Goldstino vanishes, one gets [5]:

$$\begin{aligned} e^{-1}{\mathcal {L}}= & {} \frac{1}{2}\left( R-{\bar{\psi }}_{\mu }\gamma ^{\mu \nu \rho }D_{\nu }\psi _{\rho } +m_{3/2}{\bar{\psi }}_{\mu }\gamma ^{\mu \nu }\psi _{\nu }\right) \nonumber \\&-\frac{1}{4g^2}F^{\mu \nu }F_{\mu \nu }-\left( -3m_{3/2}^2+\frac{1}{2}\xi ^2\right) \end{aligned}$$
(8)

with \(m_{3/2}=W_0\), a constant superpotential. Therefore, in the absence of chiral matter superfields, any \(\xi \ne 0\) uplifts the vacuum energy by a constant term \({\mathcal {V}}_{FI}={\xi ^2}/{2}\) and breaks supersymmetry. One can then tune \(\xi \) to get a de Sitter vacuum configuration, matching with the observational data. For instance, \(\xi =\sqrt{6}m_{3/2}\) gives a massive gravitino in flat Minkowski spacetime with spontaneously broken supersymmetry.

Introducing chiral matter multiplets \(X^i\) in the previous model, the Lagrangian is now given by:

$$\begin{aligned} {\mathcal {L}}= & {} -3\left[ S_0{\bar{S}}_0e^{-K(X^i,\bar{X^i})/3}\right] _D+\left[ S_0^3W(X^i)\right] _F\nonumber \\&-\frac{1}{4g^2}\left[ {\bar{\lambda }}P_L\lambda \right] _F+{\mathcal {L}}_{FI}. \end{aligned}$$
(9)

In component form, after having gauge fixed the compensator through \(S_0=e^{K/6}\) and integrated the auxiliary fields, the bosonic part of the previous Lagrangian reads [5]:

$$\begin{aligned} \left. e^{-1}{\mathcal {L}}\right| _{\text {bos}}= & {} \frac{1}{2}R-\frac{1}{4g^2}F^{\mu \nu }F_{\mu \nu }-G_{i\bar{j}}\partial X^i\cdot \partial {\bar{X}}^{\bar{j}}\nonumber \\&-\left( e^K(|\nabla _iW|^2-3|W|^2)+\frac{\xi ^2g^2}{2}e^{2/3K}\right) .\nonumber \\ \end{aligned}$$
(10)

Therefore, when matter fields are coupled, the scalar potential contribution from (1) becomes field dependent, \({\mathcal {V}}_{FI}=\frac{\xi ^2g^2}{2}e^{2K/3}\), and no longer Kähler invariant, which basically comes from the fact that the FI Lagrangian (1) is not itself Kähler invariant. To remedy this, a generalized Kähler invariant FI term has been built in [7]. From the generic Kähler transformations for a Kähler potential \(K(X,{{\bar{X}}})\), a superpotential W(X) and the compensator \(S_0\),

$$\begin{aligned} K(X,{{\bar{X}}})\rightarrow & {} K(X,{{\bar{X}}})+J(X)+{{\bar{J}}}({{\bar{X}}}),\nonumber \\ W(X)\rightarrow & {} W(X)e^{-J(X)},\nonumber \\ S_0\rightarrow & {} S_0e^{J(X)/3}, \end{aligned}$$
(11)

this new construction is based on the modification of the FI term (1) by introducing in it the Kähler potential according to

$$\begin{aligned} {\mathcal {L}}_{FI}=-\xi \left[ \left( S_0{\bar{S}}_0e^{-K/3}\right) ^{-3}\frac{({\bar{\lambda }}P_L\lambda )({\bar{\lambda }} P_R\lambda )}{T(\bar{{\mathcal {W}}'^2}){\bar{T}}({\mathcal {W}}'^2)}(V)_D \right] _D.\nonumber \\ \end{aligned}$$
(12)

The modified and henceforth Kähler invariant gauge field strengths are given byFootnote 1

$$\begin{aligned} {\mathcal {W}}'^2=\frac{{\bar{\lambda }}P_L\lambda }{(S_0\bar{S_0}e^{-K/3})^2},\quad \bar{{\mathcal {W}}}'^2=\frac{{\bar{\lambda }} P_R\lambda }{(S_0\bar{S_0}e^{-K/3})^2}. \end{aligned}$$
(13)

The new bosonic contribution to the scalar potential arising from this new term reads \({\mathcal {V}}_{FI}=\frac{\xi ^2g^2}{2}\), which is constant whether matter fields are included or not. The aim of this letter is to generalise the work carried out in [5, 7] by building the most general extended FI terms whose bosonic component is linear in the auxiliary field D, up to a general field dependent coefficient, while preserving Kähler invariance at the same time.

3 A set of Kähler invariant Fayet–Iliopoulos terms

The starting point of the approach followed in this work is to modify the field strengths (2) by introducing the superpotential W in order to make them Kähler invariant. This can be done in the following wayFootnote 2:

$$\begin{aligned} {\mathcal {W}}^2=\frac{{\bar{\lambda }}P_L\lambda }{{S}_{0}^2 W(X)^{\frac{2}{3}}},\quad \bar{{\mathcal {W}}^2}=\frac{{\bar{\lambda }} P_R\lambda }{{\bar{S}}_{0}^2{\bar{W}}(\bar{X)}^{\frac{2}{3}}}, \end{aligned}$$
(14)

where the \(\frac{2}{3}\) exponent of W(X) is uniquely fixed by the Kähler transformations (11) to get Kähler invariant \({\mathcal {W}}^2\) and \(\bar{{\mathcal {W}}}^2\). The superpotential W has vanishing Weyl and Chiral weights and is assumed to have a non-vanishing VEV. Therefore, \({\mathcal {W}}^2\) and \(\bar{{\mathcal {W}}}^2\) have the same (Weyl, Chiral) weights (1, 1) and \((1,-1)\) as those of (2), and one can thus still apply the (anti-) chiral projection operators (\({\bar{T}}\)) T. The resulting multiplets \(T(\bar{{\mathcal {W}}}^2)\) and \({\bar{T}}({\mathcal {W}}^2)\) then carry weights (2, 2) and \((2,-2)\). The operation \([~]_D\) has to act on a multiplet of weights (2, 0). \((V)_D\) having already weights (2, 0), we need to multiply it with a multiplet with vanishing weights, which can be chosen \((S_0{\bar{S}}_0)^{-1}\frac{({\bar{\lambda }}P_L\lambda )({\bar{\lambda }} P_R\lambda )}{T(\bar{{\mathcal {W}}}^2){\bar{T}}({\mathcal {W}}^2)}\) as it can be easily checked knowing the weights (1, 1) and (3, 3) of \(S_0\) and \({\bar{\lambda }}P_L\lambda \), respectively. The Kähler potential K and the superpotential W having vanishing weights, we can include them for free in the previous combination in the following form:

$$\begin{aligned} {\mathcal {R}}(V)_D\equiv (S_0{\bar{S}}_0)^{-1}e^{nK}W^{\alpha }{\bar{W}}^{\alpha }\frac{({\bar{\lambda }}P_L\lambda )({\bar{\lambda }} P_R\lambda )}{T(\bar{{\mathcal {W}}^2}){\bar{T}}({\mathcal {W}}^2)}(V)_D, \end{aligned}$$
(15)

where at this point the parameters n and \(\alpha \) are arbitrary. As it can be seen from (11), the quantity \({\mathcal {R}}(V)_D\) is Kähler invariant provided that n and \(\alpha \) are related by

$$\begin{aligned} n=\frac{1}{3}+\alpha . \end{aligned}$$
(16)

Therefore, the most general Kähler invariant FI term involving both the Kähler potential and the superpotential is in fact a set of Lagrangians labelled by one free parameter \(\alpha \) according to:

$$\begin{aligned} {\mathcal {L}}_{FI}^{(\alpha )}&=-\xi _\alpha \nonumber \\&\left[ (S_0{\bar{S}}_0)^{-1}e^{(\frac{1}{3}+\alpha )K}W^{\alpha }{\bar{W}}^{\alpha }\frac{({\bar{\lambda }}P_L\lambda )({\bar{\lambda }} P_R\lambda )}{T(\bar{{\mathcal {W}}^2}){\bar{T}}({\mathcal {W}}^2)}(V)_D \right] _D.\nonumber \\ \end{aligned}$$
(17)

We now add a series of terms (17) in \({\mathcal {N}}=1\) supergravity coupled to the U(1) gauge multiplet (whose gauge kinetic function is chosen to be one for simplicity), plus a set of matter chiral multiplets denoted generically \(\{X\}\). Before gauge fixing the superconformal generators, the lagrangian for this model reads:

$$\begin{aligned} {\mathcal {L}}= & {} -3\left[ S_0{\bar{S}}_0 e^{-\frac{{\mathcal {K}}(X,\bar{X})}{3}}\right] _D\nonumber \\&+\left[ S_0^3W(X)\right] _F-\frac{1}{4g^2}\left[ {\bar{\lambda }}P_L\lambda \right] _F+\underset{i}{\sum } {\mathcal {L}}_{FI}^{(\alpha _i)}, \end{aligned}$$
(18)

where the sum is running for now over an arbitrary set of parameters \(\alpha _i\).

We are interested in the contribution of Eq. (17) to the scalar potential, and in particular we would like to check that Kähler invariance is preserved. For simplicity and in order to highlight the cosmological applications, we focus on the bosonic sector. The contribution to the fermion masses arising from these new FI terms is studied in Appendix B. Putting all fermions to zero for now, the remaining components of the chiral multiplet \({\bar{\lambda }}P_L\lambda \) given in Eq. (3) are:

$$\begin{aligned} {\bar{\lambda }}P_L\lambda =\left( 0,~0,~{\hat{F}}^-\cdot {\hat{F}}^--D^2\right) . \end{aligned}$$
(19)

With the chiral and anti-chiral compensators \(S_0=(s_0,P_L\Omega _0,F_0)\) and \({\bar{S}}_0=({\bar{s}}_0,P_R\Omega _0,{\bar{F}}_0)\), the bosonic components of the composite chiral and anti-chiral multiplets \(S_0^{-1}{\bar{\lambda }}P_L\lambda \) and \({\bar{S}}_0^{-1}{\bar{\lambda }} P_R\lambda \) are given byFootnote 3:

$$\begin{aligned} S_0^{-1}{\bar{\lambda }}P_L\lambda= & {} (0,~0,~s_0^{-1}({\hat{F}}^-\cdot {\hat{F}}^--D^2))\nonumber \\= & {} (0,~0,~-2s_0^{-1}({\hat{F}}^-\cdot {\hat{F}}^--D^2),~0,~0,~0,~0),\nonumber \\ \end{aligned}$$
(20)
$$\begin{aligned} {\bar{S}}_0^{-1}{\bar{\lambda }} P_R\lambda= & {} (0,~0,~{\bar{s}}_0^{-1}({\hat{F}}^+\cdot {\hat{F}}^+-D^2))\nonumber \\= & {} (0,~0,~0,~-2{\bar{s}}_0^{-1}({\hat{F}}^+\cdot {\hat{F}}^+-D^2)~0,~0,~0).\nonumber \\ \end{aligned}$$
(21)

Similarly, the bosonic components of the chiral superfield \({\mathcal {W}}^2\) given in Eq. (14) are:

$$\begin{aligned} {\mathcal {W}}^2=\left( 0,~0,~s_0^{-2}W^{-\frac{2}{3}}({\hat{F}}^-\cdot {\hat{F}}^--D^2)\right) , \end{aligned}$$
(22)

from which we deduce the components of the anti-chiral superfield \({\bar{T}}({\mathcal {W}}^2)\):

$$\begin{aligned} {\bar{T}}({\mathcal {W}}^2)=\left( s_0^{-2}W^{-\frac{2}{3}}({\hat{F}}^-\cdot {\hat{F}}^--D^2),~0,~0\right) . \end{aligned}$$
(23)

The product of the multiplets (20) and (21) already being a \(\theta ^2{\bar{\theta }}^2\) term, only the lowest components of the remaining quantities entering in the Lagrangian (17) will contribute to the bosonic sector. We can therefore rewrite it as:

$$\begin{aligned}&{\mathcal {L}}_{FI}^{(\alpha _i)}\nonumber \\&=-\xi _i\frac{e^{(\frac{1}{3}+\alpha _i)K}W^{\alpha _i}{\bar{W}}^{\alpha _i}D}{(s_0{\bar{s}}_0)^{-2}(W{\bar{W}})^{-\frac{2}{3}}({\hat{F}}^-\cdot {\hat{F}}^--D^2)({\hat{F}}^+\cdot {\hat{F}}^+-D^2)}\left[ {\mathcal {R}}\right] _D,\nonumber \\ \end{aligned}$$
(24)

with the real multiplet \({\mathcal {R}}\) defined as:

$$\begin{aligned} {\mathcal {R}}\equiv (S_0^{-1}{\bar{\lambda }}P_L\lambda )({\bar{S}}_0^{-1}{\bar{\lambda }} P_R\lambda ). \end{aligned}$$
(25)

Looking at the seven-components notation (20) and (21) for the multiplets \(S_0^{-1}{\bar{\lambda }}P_L\lambda \) and \({\bar{S}}_0^{-1}{\bar{\lambda }} P_R\lambda \) and the multiplication law (A8), we see that the only non-vanishing bosonic term of \({\mathcal {R}}\) arises from \(\frac{1}{2}f_{ij}{\mathcal {K}}^i{\mathcal {H}}^j\) in its D-component. More precisely, it reads \(({\mathcal {R}})_D=\frac{1}{2}f_{ij}{\mathcal {K}}^i{\mathcal {H}}^j=\frac{1}{2}f_{21}{\mathcal {K}}^2{\mathcal {H}}^1=2(s_0{\bar{s}}_0)^{-1}({\hat{F}}^-\cdot {\hat{F}}^--D^2)({\hat{F}}^+\cdot {\hat{F}}^+-D^2)\). The operation \([~]_D\) defined in (A7) immediately leads to \([{\mathcal {R}}]_D=\frac{e}{2}({\mathcal {R}})_D=e(s_0{\bar{s}}_0)^{-1}({\hat{F}}^-\cdot {\hat{F}}^--D^2)({\hat{F}}^+\cdot {\hat{F}}^+-D^2)\). The FI Lagrangian (24) is therefore given by:

$$\begin{aligned} e^{-1}{\mathcal {L}}_{FI}^{(\alpha _i)}=-\xi _is_0{\bar{s}}_0 e^{(\alpha _i+\frac{1}{3})K}(W{\bar{W}})^{\alpha _i+\frac{2}{3}}D. \end{aligned}$$
(26)

Since we are interested in matter coupled \({\mathcal {N}}=1\) supergravity, we use the Einstein frame where the conformal symmetry is gauge fixed through \(s_0={\bar{s}}_0=e^{\frac{K}{6}}\). This leads to a set of Kähler invariant terms parametrised by some constants \(\{\alpha _i,\xi _i\}\) according to:

$$\begin{aligned} e^{-1}{\mathcal {L}}_{FI}^{(\alpha _i)}=-\xi _ie^{\left( \alpha _i+\frac{2}{3}\right) K}(W{\bar{W}})^{\alpha _i+\frac{2}{3}}D=-\xi _ie^{\left( \alpha _i+\frac{2}{3}\right) {\mathcal {G}}}D\,,\nonumber \\ \end{aligned}$$
(27)

where \({\mathcal {G}}\equiv K+\ln |W|^2\). Therefore, after gauge fixing the conformal symmetry and integrating out the auxiliary fields, the pure bosonic sector arising from the Lagrangian (18) is given by

$$\begin{aligned} e^{-1}{\mathcal {L}}^{(B)}=\frac{1}{2}R-\frac{1}{4g^2}F_{\mu \nu }F^{\mu \nu }-G_{I\bar{J}}\partial X^I\cdot \partial {\bar{X}}^{{{\bar{J}}}}-{\mathcal {V}}\nonumber \\ \end{aligned}$$
(28)

with the scalar potential

$$\begin{aligned} {\mathcal {V}}=e^{{\mathcal {G}}}\left( \partial _{I}{\mathcal {G}}~G^{I\bar{J}}~\partial _{{{\bar{J}}}}{\mathcal {G}}-3\right) +{\mathcal {V}}_{FI}. \end{aligned}$$
(29)

The new FI contribution to the scalar potential, \({\mathcal {V}}_{FI}\), arising from Eq. (27) reads

$$\begin{aligned} {\mathcal {V}}_{FI}=\frac{g^2}{2}\left( \underset{i}{\sum }\xi _ie^{\left( \alpha _i+\frac{2}{3}\right) {\mathcal {G}}}\right) ^2, \end{aligned}$$
(30)

which is obviously Kähler invariant while field dependent at the same time.

The above construction therefore provides a way to obtain an arbitrary set of (Kähler invariant) FI terms from a single U(1) gauge field, in the presence of a superpotential W with non-vanishing expectation value. Each term of the sum is parametrised by two real constants \(\xi _i\) and \(\alpha _i\). A constant FI term is obviously recovered by choosing one \(\alpha _{i_0}=-{2}/{3}\). For this value, the bosonic part of the Lagrangian (17) is equal to the one of the Lagrangian (12). Being independent of W, it is in particular valid even for vanishing superpotential, like the new FI term (12). However, it is not clear that the fermionic parts of the Lagrangians (17) and (12) are equal, as well, for \(\alpha _{i_0}=-{2}/{3}\).

A general sum appearing in the Lagrangian (18), involving terms of the form (27), using that \(e^{{\mathcal {G}}/2}=m_{3/2}[X]\), amounts to adding a general function of the gravitino mass \(m_{3/2}[X]\) considered as a functional of the scalar fields \(\{X\}\):

$$\begin{aligned} e^{-1}{\mathcal {L}}_{FI}^{(B)}= & {} -f(m_{3/2}[X])D \quad \longrightarrow \quad {\mathcal {V}}_{FI}\nonumber \\= & {} \frac{g^2}{2} \mid f(m_{3/2}[X])\mid ^2 \end{aligned}$$
(31)

This construction allows us to refine the scalar potential by adding new field dependent and Kähler invariant terms. In the following, we will restrict ourselves as an illustration to the study of the simple case of one term of the type (27) up to an additive constant, corresponding to the choice \(i=1,2\) with \(\alpha _1\) an arbitrary parameter and \(\alpha _2=-2/3\). Considering the Kähler potential of no-scale type and a constant superpotential, we will show that this choice is sufficient to produce inflationary models compatible with the slow-roll conditions and consistent with the CMB observations, with the inflaton rolling towards a de Sitter vacuum with tuneable energy and spontaneously broken supersymmetry.

4 No-scale models and cosmological applications

In this section, we study the cosmological consequences of the previous modified FI-term construction in the case of simple no-scale models. Considering one chiral superfield X associated to the inflaton, we successively choose the Kähler potentials

$$\begin{aligned} K(X,{{\bar{X}}})=-\ln (X+{{\bar{X}}})\quad \text {and}\quad K(X,{{\bar{X}}})=-3\ln (X+\bar{X}),\nonumber \\ \end{aligned}$$
(32)

together with a constant superpotential \(W=W_0\) and an exponential one \(W(X)=e^{\beta X}\), respectively. In the context of string theory, these forms of Kähler potentials arise in all toroidal/orbifold compactifications as well as in the large volume limit of Calabi–Yau compactifications, both in heterotic string and in type II orientifolds. In this context, the first Kähler potential could describe for instance the kinetic term of the dilaton, associated to the string coupling, while the second may describe the internal volume of the 3-complex dimensional compact space. We will therefore refer to the “dilaton case” and “compact volume case” to describe these two models. From now on, we also restrict the sum (30) to only two terms parametrised by three constants \(\xi _1\), \(\alpha _1\equiv \alpha \) and \(\xi _2\), while \(\alpha _2=-\frac{2}{3}\).

4.1 Dilaton case

We first consider the Kähler potential \(K=-\ln (X+{\bar{X}})\). In terms of the gravitino mass \(m_{3/2}^2=e^{{\mathcal {G}}}\), this yields the scalar potential:

$$\begin{aligned} {\mathcal {V}}=-2m_{3/2}^2+\frac{1}{2}\left( \xi _1(m_{3/2}^2)^{\alpha +2/3}+\xi _2\right) ^2, \end{aligned}$$
(33)

where we have redefined the parameters \(\xi _i\) to absorb the gauge coupling constant g. As we will show below, there is a region of the parameter space \(\xi _1\), \(\xi _2\) and \(\alpha \), such that the above potential has an inflational plateau allowing slow-roll inflation compatible with the cosmological observations, and a minimum, where supersymmetry is spontaneously broken, with a tuneable vacuum energy by a fine tuning of the parameters (for instance to obtain a vanishing cosmological constant in the vacuum).

In order to compute the slow-roll parameters, one needs to work with the canonically normalised field \(\chi \), defined by its kinetic term through

$$\begin{aligned} \frac{\partial _{\mu }X\partial ^{\mu }{\bar{X}}}{(X+\bar{X})^2}=\frac{1}{2}\partial _{\mu }\chi \partial ^{\mu }\chi +\cdots \end{aligned}$$
(34)

where the dots denote terms containing the imaginary part of X, which has no influence on the discussion of this section. We will come back to it in Sect. 5, where the shift symmetry associated to this imaginary part will be gauged by the U(1). Focusing on the real part for now, we deduce from (34)

$$\begin{aligned} \mathrm{Re} X=e^{\sqrt{2} \chi }, \end{aligned}$$
(35)

and thus

$$\begin{aligned} m_{3/2}^2=e^{{\mathcal {G}}}=\frac{|W_0|^2}{2}e^{-\sqrt{2} \chi }. \end{aligned}$$
(36)

In the following, the ‘dilaton’ \(\chi \) will be identified with the inflaton, dynamically driving inflation starting from a large value, slightly rolling down along the potential, attaining the horizon exit denoted by \(\chi _*\) and ending at a value \(\chi _{\text {end}}\) when slow-roll stops. The field then continues to fall down towards the minimum, when reheating takes place. From now on, quantities observed at the horizon exit are specified with a star *, and the approximation of large inflaton field \(\chi>>1\) is assumed in this region.Footnote 4 The gravitino mass (36) vanishes exponentially and the potential (33) for \(\alpha \ge -2/3\) is therefore dominated by a constant, as required by slow-roll inflation. In the following, we will thus restrict to the region \(\alpha > -2/3\), with \({\mathcal {V}}_*\simeq \frac{\xi _2^2}{2}\). In terms of the canonical field \(\chi \), the slow-roll parameters are given as usual by

$$\begin{aligned} \epsilon \equiv \frac{1}{2}\left( \frac{\partial V/\partial \chi }{V}\right) ^2;\quad \eta \equiv \frac{\partial ^2V/\partial \chi ^2}{V}. \end{aligned}$$
(37)

At large field \(\chi \), by further assuming \(\alpha >1/3\), they can be expanded into

$$\begin{aligned} \epsilon \underset{\chi>>1}{\approx }\left( \frac{4m_{3/2}^2}{\xi _2^2}\right) ^2\underset{\chi>>1}{\approx }\frac{1}{4}\eta ^2;\quad \eta \underset{\chi>>1}{\approx }-\frac{8m_{3/2}^2}{\xi _2^2}. \end{aligned}$$
(38)

Actually, the large field condition is not really necessary. The required approximation is that the gravitino mass (36) should be small during inflation, so that the potential  (33) is approximately constant. This is an important point, implying that the models we study are consistent with small field inflation, avoiding trans-planckian initial conditions for the normalised inflaton field.

A central quantity to be taken into account in inflation is the number N of e-folds between the horizon exit and the end of inflation, a period observable through the CMB. This quantity, which must be set within the range [40, 60] to satisfy CMB observations, is given by:

$$\begin{aligned} N=\int _{\chi _*}^{\chi _{\text {end}}}\frac{d\chi }{\sqrt{2\epsilon (\chi )}}\in [40,60]. \end{aligned}$$
(39)

Two other observable quantities at the horizon exit are the amplitude of primordial density fluctuations \(A_S\) and the spectral index, or tilt \(n_s\), respectively given by

$$\begin{aligned} A_S= & {} \frac{{\mathcal {V}}_*}{24\pi ^2\epsilon _*}=2.2\times 10^{-9}, \end{aligned}$$
(40)
$$\begin{aligned} n_S= & {} 1+2\eta _*-6\epsilon _*=0.96, \end{aligned}$$
(41)

where the numerical equalities also follow from the CMB data.

To be consistent with observations, the inflaton potential during inflation should respect the three conditions (39), (40) and (41), which we now use in order to constrain the three parameters \(\xi _1\), \(\xi _2\) and \(\alpha \). In the large field limit at the horizon exit, the tilt (41) gives \(n_S=1+2\eta _*-6\epsilon _*\simeq 1+2\eta _*-\frac{3}{2}\eta _*^2\approx 1+2\eta _*=1-{16m_{3/2}^{*2}}/{\xi _2^2}\simeq 0.96\), from which we deduce that \(\xi _2^2\simeq 400 m_{3/2}^{*2}\). Moreover, the amplitude (40) leads to \(\frac{1}{768\pi ^2}\frac{\xi _2^6}{m_{3/2}^{*4}}=2.2\times 10^{-9}\). Therefore, in the large field limit, the \(\xi _1\) and \(\alpha \) dependence drops, and one can immediately find from these two relations the numerical values for the parameter \(\xi _2\) and the gravitino mass at the horizon exit \(m_{3/2}^{*2}\), namely:

$$\begin{aligned} \xi _2^2= & {} 1.04\times 10^{-10}, \end{aligned}$$
(42)
$$\begin{aligned} m_{3/2}^{*2}= & {} 2.6\times 10^{-13}. \end{aligned}$$
(43)

From the value of \(\epsilon \) at the horizon, given by the first equation of (38), and the two relations (42) and (43), we find the predicted value for the tensor-to-scalar ratio of primordial perturbations to be:

$$\begin{aligned} r=16\epsilon _*\simeq \left( \frac{16m_{3/2}^{*2}}{\xi _2^2}\right) ^2=1.6\times 10^{-3}, \end{aligned}$$
(44)

which is fixed and independent of any parameters of the model, as long as \(\alpha >1/3\) is considered.

On the other hand, the condition to have a (almost) vanishing potential at its minimum,Footnote 5 for a value of the gravitino mass denoted \({{\tilde{m}}}_{3/2}^2\) in what follows, can then be used in order to determine the parameter \(\xi _1\) in terms of \(\alpha \). This is obtained by numerically solving \({\mathcal {V}}({{\tilde{m}}}_{3/2}^2)=0\), with the result denoted \(\xi _1(\alpha )\) in the following. In order to constrain the last remaining parameter \(\alpha \) via the number of e-folds equation (39), we first need to determine the value of the inflaton field at the end of inflation, depending on \(\alpha \). Inflation stops when \(\chi \) reaches a value \(\chi _{\text {end}}\) such that \(\epsilon (\chi _{\text {end}})=1\) or \(|\eta (\chi _{\text {end}})|=1\). In this model, the condition first fulfilled turns out to be \(\eta (\chi _{\text {end}})=-1\), which leads to the equation:

$$\begin{aligned}&6m_{3/2}^{2~\text {end}}-\xi _1^2(\alpha )(m_{3/2}^{2~\text {end}})^{2\alpha +4/3} \left( \frac{1}{2}+4\left( \alpha +\frac{2}{3}\right) ^2\right) \nonumber \\&\quad -\xi _1(\alpha )\xi _2(m_{3/2}^{2~\text {end}})^{\alpha +2/3}\left( 1+2\left( \alpha +\frac{2}{3}\right) ^2\right) -\frac{\xi _2^2}{2}=0.\nonumber \\ \end{aligned}$$
(45)

This equation is solved numerically to get \(m_{3/2}^{2~\text {end}}\) in terms of \(\alpha \). The number of e-folds is then used in order to determine the parameter \(\alpha \). Indeed, Eq. (39) becomes:

$$\begin{aligned}&N(\alpha )=-\frac{1}{4}\int _{m_{3/2}^{*2}}^{m_{3/2}^{2~\text {end}}(\alpha )}\nonumber \\&\quad \frac{-4m_{3/2}^2+\left[ \xi _1(\alpha )(m_{3/2}^2)^{\alpha +2/3}+\xi _2\right] ^2}{2m_{3/2}^2-(\alpha +\frac{2}{3})\xi _1^2(\alpha )(m_{3/2}^2)^{2\alpha +4/3}-(\alpha +\frac{2}{3})\xi _1(\alpha )\xi _2(m_{3/2}^2)^{\alpha +2/3}}\,\nonumber \\&\quad \frac{dm_{3/2}^2}{m_{3/2}^2}\,. \end{aligned}$$
(46)

Using \(m_{3/2}^{2~\text {end}}\) given by the largest solution of Eq. (45), the value for \(\xi _2^2\) (42), and the expression for \(\xi _1(\alpha )\) given from the solution of \({\mathcal {V}}({{\tilde{m}}}_{3/2}^2)=0\), the above integral can be numerically evaluated in terms of \(\alpha \). It turns out that any \(\alpha \) larger or approximately equal to 1 leads to an acceptable e-fold number \(N\in [40,60]\). Thus, the only fine tuning of the model, besides fixing the overall scale of the potential by its asymptotic value determined by \(\xi _2\), is related to the vacuum energy. The gravitino mass at the minimum of the potential \({\tilde{m}}_{3/2}\) can be between \(10^{10}\)TeV and the Planck scale by choosing \(\alpha \) between 1 and 10.5 respectively.

As an illustration, we now choose \(\alpha \simeq 1\), which gives \(\xi _1(\alpha =1)\simeq 10^{11}\).Footnote 6 With these values, the scalar potential and the slow-roll parameters are plotted in terms of \(\chi \) in Figs. 1 and 2 respectively, where we have also set \(W_0=\sqrt{2}\). The vertical grey lines indicate the horizon exit and the end of inflation (from the right to the left). The corresponding numerical values for the gravitino mass are \(m_{3/2}^{*2}=2.64\times 10^{-13}\) and \(m_{3/2}^{2\text {end}}=1.56\times 10^{-11}\) in Planck units. The minimum is reached at \({{\tilde{m}}}_{3/2}^2=5.29\times 10^{-11}\). The associated values for the inflaton field are \(\chi _*=20.48+\frac{1}{\sqrt{2}}\ln {\frac{|W_0|^2}{2}}\), \(\chi _{\text {end}}=17.59+\frac{1}{\sqrt{2}}\ln {\frac{|W_0|^2}{2}}\) and \({\tilde{\chi }}=16.73+\frac{1}{\sqrt{2}}\ln {\frac{|W_0|^2}{2}}\). Notice that the values of the inflaton can be made less than one for an appropriate choice of \(W_0\), as we already mentioned in the beginning of the section. Finally, note that because of the space-time curvature during inflation, the value of \(m_{3/2}\) entering in the Lagrangian is not the physical gravitino mass, which should be computed taking into account the curvature contribution in an approximate de Sitter spacetime [14, 15].

Fig. 1
figure 1

Scalar potential as a function of the canonically normalised field \(\chi \), for \(\alpha =1\)

Fig. 2
figure 2

Slow-roll parameters \(\epsilon \) and \(\eta \) as a function of \(\chi \), for \(\alpha =1\)

The spectrum at the minimum contains the imaginary part of X and the U(1) gauge boson, which remain massless in this model, as it can be seen from the expression of the scalar potential, as well as the massive gravitino and inflaton whose masses are given by:

$$\begin{aligned} {{\tilde{m}}}_{3/2}^2&= 5.29\times 10^{-11},&{{\tilde{m}}}_{\chi }^2&= \left. \frac{\partial ^2V}{\partial \chi ^2}\right| _{\chi =\chi _{\text {min}}}=2.46\times 10^{-10}. \end{aligned}$$
(47)

There is also a massive spin-1/2 fermion corresponding to a linear combination of the U(1) gaugino and the fermionic component of the inflaton superfield, orthogonal to the Goldstino direction. Indeed at the minimum, supersymmetry is spontaneously broken by a non-vanishing expectation value of both a D and F-term. The Goldstino \(P_L\nu \) is thus a linear combination of the gaugino \(\lambda \) and of the chiral fermion \(\Omega \): \(P_L\nu =\frac{1}{\sqrt{2}}\Omega ^Xg_{X{{\bar{X}}}}{\bar{F}}^{{{\bar{X}}}}-\frac{i}{2}DP_L\lambda \), with \({\bar{F}}^{{{\bar{X}}}}\equiv -e^{K/2}g^{X{{\bar{X}}}}\nabla _X W\), evaluated at the minimum. In order to compute the direction of supersymmetry breaking, we consider:

$$\begin{aligned} \Vert F\Vert\equiv & {} \sqrt{F^Xg_{X{{\bar{X}}}}{\bar{F}}^{{{\bar{X}}}}}=\sqrt{e^{{\mathcal {G}}}\partial _X{\mathcal {G}}G^{X{{\bar{X}}}}\partial _{{\bar{X}}}{\mathcal {G}}}=m_{3/2}, \end{aligned}$$
(48)
$$\begin{aligned} \Vert D\Vert\equiv & {} \xi _1e^{5/3{\mathcal {G}}}+\xi _2=\xi _1(m_{3/2}^2)^{5/3}+\xi _2. \end{aligned}$$
(49)

At the minimum, we have:

$$\begin{aligned} \left\| \frac{D}{F}\right\| _{{\tilde{m}}_{3/2}}=\left[ \xi _1(\tilde{m}_{3/2}^2)^{7/6}+\xi _2({{\tilde{m}}}_{3/2}^2)^{-1/2}\right] \simeq 1.5, \end{aligned}$$
(50)

where we have used the values \(\xi _1=10^{11}\), \(\xi _2=10^{-5}\) and \({{\tilde{m}}}_{3/2}^2=5.29\times 10^{-11}\) obtained previously. At the minimum, the Goldstino is thus an approximately equal mixing of the chiral fermion \(\Omega \) and the gaugino \(\lambda \).

The computation of the fermion masses is detailed in Appendix B. The mass squared \(m_f^2\) of the physical fermion which remains after elimination of the Goldstino is given in Eq. (B36). For \(p=1\), its numerical value at the minimum where \({\tilde{m}}_{3/2}^2=5.29\times 10^{-11}\) is (in Planck units):

$$\begin{aligned} m_f^2=5.9\times 10^{-12}. \end{aligned}$$
(51)

4.2 Compact volume case

In this subsection, we consider the no-scale model with Kähler potential \(K(X,{{\bar{X}}})=-3\ln (X+\bar{X})\). If one takes a constant superpotential as in the previous subsection, the F-term of the scalar potential will vanish, and the new Fayet-Iliopoulos term will be ill-defined at the minimum, where D now vanishes.Footnote 7 Instead, we consider a superpotential of the form \(W(X)=e^{\beta X}\), with \(\beta \) a real constant. Note that the the imaginary shift of X becomes now a (global) R-symmetry [11]. The full scalar potential is then given by:

$$\begin{aligned} {\mathcal {V}}= & {} m_{3/2}^2\left[ -3+\frac{1}{3}\left( \beta (X+\bar{X})-3\right) ^2\right] \nonumber \\&+\frac{1}{2}\left( \xi _1(m_{3/2}^2)^{\alpha +2/3}+\xi _2\right) ^2. \end{aligned}$$
(52)

Choosing \(\beta<<\left. (X+{{\bar{X}}})^{-1}\right| _{*}\), the first term of (52) can be neglected at the horizon exit as well as during the inflationary period. However, outside of the inflationary plateau, the D-term starts decreasing significantly and the F-term cannot be neglected anymore. Supersymmetry at the minimum of the potential is then spontaneously broken by non-vanishing expectation values of both D and F-terms, and a tuning of the parameters would be required in order to get a vanishing potential at its minimum, as in the previous case studied above. We will not study this region in the following, focusing on the inflationary period where the F contribution to \({\mathcal {V}}\) can be neglected and the scalar potential is only given by its D-term:

$$\begin{aligned} \left. {\mathcal {V}}\right| _{\text {infla.}}=\frac{1}{2}\left( \xi _1(m_{3/2}^2)^{\alpha +2/3}+\xi _2\right) ^2. \end{aligned}$$
(53)

Now the normalised field \(\chi \) and the gravitino mass are given by:

$$\begin{aligned} \mathrm{Re}X=e^{\sqrt{\frac{2}{3}}\chi };\quad m_{3/2}^2=\frac{|W|^2}{(X+{\bar{X}})^3}=\frac{|W|^2}{8}e^{-\sqrt{6}\chi }. \end{aligned}$$
(54)

Like in the previous subsection, the potential at the horizon exit, where \(\chi>>1\) is assumed,Footnote 8 is given by \({\mathcal {V}}_*={\xi _2^2}/{2}\). The slow-roll parameters expanded in this limit read:

$$\begin{aligned}&\epsilon \underset{\chi>>1}{\approx }\frac{12\xi _1^2(\alpha +\frac{2}{3})^2(m_{3/2}^2)^{2\alpha +4/3}}{\xi _2^2};\nonumber \\&\eta \underset{\chi>>1}{\approx }\frac{12\xi _1(\alpha +\frac{2}{3})^2(m_{3/2}^2)^{\alpha +2/3}}{\xi _2}\,, \end{aligned}$$
(55)

and thus \(\eta ^2\underset{\chi>>1}{\approx }12(\alpha +\frac{2}{3})^2\epsilon \). With these two quantities, the tilt and amplitude analysis yields:

$$\begin{aligned} \xi _2(\alpha )&= \frac{6\times 10^{-6}}{\alpha +2/3},\nonumber \\ (m_{3/2}^{*2})^{\alpha +2/3}(\xi _1,\alpha )&= -\frac{1}{\xi _1}\,\frac{1.02\times 10^{-8}}{(\alpha +2/3)^3}\,. \end{aligned}$$
(56)

The gravitino mass at the end of inflation, \(m_{3/2}^{2~\text {end}}\), is still given by the condition \(\eta (m_{3/2}^{2~\text {end}})=\pm 1\), which is now solution of the equation:

$$\begin{aligned}&\xi _1^2\,(m_{3/2}^{2~\text {end}})^{2\alpha +4/3}\left( \frac{1}{2}\mp 12\left( \alpha +\frac{2}{3}\right) ^2\right) \nonumber \\&\quad +\xi _1\,\xi _2(\alpha )\,(m_{3/2}^{2~\text {end}})^{\alpha +2/3}\left( 1\mp 6\left( \alpha +\frac{2}{3}\right) ^2\right) +\frac{\xi _2(\alpha )^2}{2}=0.\nonumber \\ \end{aligned}$$
(57)

This can be solved analytically at fixed \(\alpha \), yielding:

$$\begin{aligned}&(m_{3/2}^{2~\text {end}})^{\alpha +2/3}_{\pm }(\xi _1,\alpha )=-\frac{\xi _2(\alpha )}{\xi _1}\nonumber \\&\quad \times \frac{1\mp 6(\alpha +2/3)^2-2\sqrt{3}(\alpha +2/3)\sqrt{3(\alpha +2/3)^2\pm 1}}{1\mp 24(\alpha +2/3)^2}.\nonumber \\ \end{aligned}$$
(58)

On the other hand, the number of e-folds is given by:

$$\begin{aligned} N_{\pm }(\alpha )= & {} \frac{-1}{12(\alpha +\frac{2}{3})}\int _{m_{3/2}^{*2}(\xi _1,\alpha )}^{{m_{3/2}^{2~\text {end}}}_{\pm }(\xi _1,\alpha )}\nonumber \\&\times \frac{\left[ \xi _1(m_{3/2}^2)^{\alpha +2/3}+\xi _2(\alpha )\right] ^2}{\xi _1^2(m_{3/2}^2)^{2\alpha +4/3}+\xi _1\xi _2(\alpha )(m_{3/2}^2)^{\alpha +2/3}}\frac{dm_{3/2}^2}{m_{3/2}^2},\nonumber \\ \end{aligned}$$
(59)

which is independent of \(\xi _1\), as can be seen from the change of variable \(m_{3/2}^2\rightarrow m_{3/2}^2\xi _1^{\frac{1}{\alpha +2/3}}\) and by using the second equation of (56) and Eq. (58).

Two regions for \(\alpha \) have to be considered: (i) \(-\frac{2}{3}<\alpha <\frac{\sqrt{3}-2}{3}\), where \(\eta (m_{3/2}^{2~\text {end}})=1\) is first fulfilled, and where the gravitino mass at the end of inflation and the number of e-folds are respectively given by \((m_{3/2}^{2~\text {end}})_{+}\) and \(N_{+}\); (ii) \(\alpha \ge \frac{\sqrt{3}-2}{3}\), where \(\eta (m_{3/2}^{2~\text {end}})=-1\) is first fulfilled, and where the gravitino mass at the end of inflation and the number of e-folds are respectively given by \((m_{3/2}^{2~\text {end}})_{-}\) and \(N_{-}\). Both e-fold numbers are plotted in terms of \(\alpha \) in Figs. 3 and 4. Any \(\alpha \gtrsim -0.46\) leads to an acceptable \(N\in [40,60]\), while the parameter \(\xi _1\) remains undetermined. From the value of \(\epsilon \) at the horizon given by the first equation of (55) and the two relations (56), one sees that the predicted value for the tensor-to-scalar ratio r of primordial perturbations remains independent of \(\xi _1\), and depends only on \(\alpha \):

$$\begin{aligned} r(\alpha )= & {} 16\,\epsilon _*(\alpha )=16\frac{12(\alpha +2/3)^2\xi _1^2(m_{3/2}^{*2})^{2\alpha +4/3}(\xi _1,\alpha )}{\xi _2^2(\alpha )}\nonumber \\\simeq & {} \frac{5.4\times 10^{-4}}{(\alpha +2/3)^2}. \end{aligned}$$
(60)

Thus, \(\alpha \) can be chosen such that the tensor-to-scalar ratio is large and close to the experimental bound, for instance \(r(\alpha =-0.45)\simeq 10^{-2}\) with \(N_{+}(\alpha =-0.45)\simeq 41\).

Fig. 3
figure 3

Number of e-folds \(N_{+}\) as a function of \(\alpha \), for \(-\frac{2}{3}<\alpha <\frac{\sqrt{3}-2}{3}\)

Fig. 4
figure 4

Number of e-folds \(N_{-}\) as a function of \(\alpha \), for \(\alpha \ge \frac{\sqrt{3}-2}{3}\)

5 Gauging the axion shift symmetry

In the two previous models, the spectrum contained two massless particles: the imaginary part of the complex inflaton field X, and the U(1) gauge boson \(A_{\mu }\), which is an unwanted phenomenological property. This can be avoided by gauging the imaginary shift symmetry by the U(1). Under a gauge transformation \(A_{\mu }\rightarrow A_{\mu }-2\partial _{\mu }\lambda \), one then has for the complex scalar \(X\rightarrow X+ic\lambda \), with \(\lambda \) the gauge parameter and c a constant related to the charge \(e^c\) of the field \(e^X\). In terms of superfields, this transformation reads \(X\rightarrow X+c\Lambda \), with \(\Lambda \) a chiral superfield gauge parameter. The gauge transformation of the vector superfield V is \(V\rightarrow V-\Lambda -{\bar{\Lambda }}\). In order to keep a gauge invariant Kähler potential with shift symmetry, \(K(X+{{\bar{X}}})\) must be modified as:

$$\begin{aligned} K(X+{{\bar{X}}})\rightarrow K(X+{{\bar{X}}}+cV)\,. \end{aligned}$$
(61)

Note that this modification does not change the pure bosonic part of the FI Lagrangian (17). Indeed, when fermions are put to zero, the only non-vanishing components of the chiral multiplets \({\bar{\lambda }}P_L\lambda \) and \(\bar{\lambda }P_R \lambda \) are their \(\theta \theta \) and \({\bar{\theta }}{\bar{\theta }}\) components. Therefore, only the lowest components of the other superfields involved in (17) contribute to the bosonic sector, and the lowest component of \(e^{(\alpha +\frac{1}{3})K}\) does not receive additional contributions from cV in the Wess-Zumino gauge.

In order to see how a massive gauge boson arises from this gauging, we work in global supersymmetry and compute the (bosonic) new terms appearing from this modification. Putting fermions to zero and expanding in components, we have

$$\begin{aligned} \left. X+{{\bar{X}}}+cV\right| _{\text {bos}}= & {} 2\mathrm{Re}X-\theta \sigma ^{\mu }{\bar{\theta }}(cA_{\mu }+2\partial _{\mu }\mathrm{Im}X)\nonumber \\&+\frac{1}{2}\theta ^2{\bar{\theta }}^2(cD-\partial ^2\mathrm{Re}X)+\theta ^2 F+{\bar{\theta }}^2{{\bar{F}}},\nonumber \\ \end{aligned}$$
(62)

from which we deduce:

$$\begin{aligned} \left. K(X+\bar{X}+cV)\right| _{\theta ^2{\bar{\theta }}^2}= & {} \frac{K'}{2}(cD-\partial ^2\mathrm{Re}X)\nonumber \\&-\frac{K''}{4}(cA_{\mu }+2\partial _{\mu }\mathrm{Im}X)^2+K''F{\bar{F}}.\nonumber \\ \end{aligned}$$
(63)

It follows that

$$\begin{aligned} \int d^4\theta K(X+{{\bar{X}}}+cV)= & {} \int d^4\theta K(X+{{\bar{X}}})\nonumber \\&-\frac{c^2}{4} K''A_{\mu }A^{\mu } -cK''A_{\mu }\partial ^{\mu }\mathrm{Im}X\nonumber \\&+\frac{c}{2}K'D+\, \mathrm{fermions}. \end{aligned}$$
(64)

As a result, there is a mass term for the gauge boson \(A_{\mu }\), as well as a new field dependent FI term \(-\xi (X)D\), with \(\xi (\chi )=-cK'/2\). It modifies the D-term of the scalar potential (30) according to \(D=g^2\left[ -cK'/2+\underset{i}{\sum }\xi _ie^{(\alpha _i+2/3){\mathcal {G}}}\right] \), which leads to the following D-term contribution to the scalar potential:

$$\begin{aligned} {\mathcal {V}}_{FI}=\frac{g^2}{2}\left( \underset{i}{\sum }\xi _ie^{\left( \alpha _i+\frac{2}{3}\right) {\mathcal {G}}}-\frac{c}{2}K'\right) ^2. \end{aligned}$$
(65)

It is easy to show that the extra contribution proportional to c, due to the gauging of the shift symmetry, does not alter the inflationary predictions discussed in the previous section, when restricting the D-auxiliary field to only two non-vanishing terms, as in the previous section. Consider for example the compact volume case with a Kähler potential \(K=-3\ln (X+{{{\bar{X}}}})\). The second term in (65) then becomes proportional to \(m_{3/2}^{2/3}\) which may be identified as a particular case of the potential (53) studied before for \(\alpha =-1/3\) and \(\xi _1=(3cg)/(2|W_0|^{2/3})\). \(\xi _2\) can then be obtained from the first sum in (65) by choosing one non-vanishing term, say \(\xi _2\) with \(\alpha _2=-2/3\). The analysis is then reduced to the one of the last section, in the compact volume case with a fixed value of the parameter \(\alpha =-1/3\), which is within the allowed region of the parameter space compatible with observational data, as seen in Fig. 3.

Let us finally consider another example inspired by the heterotic string with X identified with the string dilaton, as in the first model considered in the previous section, where its axionic imaginary part is dual to the Neveu–Schwarz antisymmetric tensor in four dimensions. In this case, the constant c is related to a U(1) anomaly which is cancelled by a Green–Schwarz term. The gauging of the shift symmetry is a consequence of the anomaly cancellation and the axion is absorbed by the U(1) becoming massive and no massless particle remains in the spectrum [16]. The gauge coupling is not anymore constant but is fixed by X: \(g^2=1/\mathrm{Re}X\), corresponding to a gauge kinetic function linear in X. The scalar potential can be easily obtained from Eq. (65) using \(K=-\ln (X+{{{\bar{X}}}})\) and the expressions (35) and (36):

$$\begin{aligned} {\mathcal {V}}=-2m_{3/2}^2+\frac{e^{-\sqrt{2}\chi }}{2}\left( \underset{i}{\sum }\xi _ie^{\left( \alpha _i+\frac{2}{3}\right) {\mathcal {G}}}+\frac{c}{4}e^{-\sqrt{2}\chi }\right) ^2.\nonumber \\ \end{aligned}$$
(66)

Again we restrict the D-term to only two non-vanishing contributions. In order to get an asymptotically constant potential at infinity, we choose \(\alpha _2=-\frac{7}{6}\), while \(\alpha _1=\frac{1}{3}\) is chosen to be able to absorb the constant c in \(\xi _1\). We obtain in this way a potential with the same form as in Eq. (33):

$$\begin{aligned} {\mathcal {V}}=-2m_{3/2}^2+\frac{1}{2}\left( \xi _1'(m_{3/2}^2)^{3/2}+\xi _2'\right) ^2, \end{aligned}$$
(67)

where we have defined \(\xi _1'\equiv \frac{\xi _1\sqrt{2}}{W_0}+\frac{c}{\sqrt{2}W_0^3}\) and \(\xi _2'\equiv \frac{\xi _2\sqrt{2}}{W_0}\).

The potential is thus the same as the one of Eq. (33), with \(\alpha =\frac{5}{6}\). This is an acceptable value since it leads to a number of e-folds \(N(5/6)\simeq 51\). The numerical predictions obtained in Sect. 4.1 are not modified by the gauging of the shift symmetry. The main improvement is that now the imaginary part of the inflaton has been absorbed by the U(1) gauge boson which acquires a mass. In order to compute this mass, one needs to rescale \(A_{\mu }\rightarrow gA_{\mu }\) so that the gauge field kinetic term becomes canonical. After this rescaling, the gauge boson mass square reads:

$$\begin{aligned} m_{A}^2(\chi )=\frac{e^{-3\sqrt{2}\chi }}{8}c^2=\frac{g^6}{8}c^2. \end{aligned}$$
(68)

With the values of \(\chi \) at the horizon crossing and at the minimum found above, we get \(m_{A}^2\) in terms of the parameters c and \(W_0\):

$$\begin{aligned} m_A^{*2}= & {} 1.84\times 10^{-38}\frac{c^2}{W_0^6}, \end{aligned}$$
(69)
$$\begin{aligned} m_A^{2\text {end}}= & {} 3.05\times 10^{-34}\frac{c^2}{W_0^6}, \end{aligned}$$
(70)
$$\begin{aligned} {{\tilde{m}}}_A^2= & {} 1.93\times 10^{-31}\frac{c^2}{W_0^6}, \end{aligned}$$
(71)

which can therefore vary in a large range of values consistent with all experimental bounds. The rest of the masses do not present any significant change from the previous analysis in the dilaton case without the gauging.

6 Conclusion

In this work, we generalised the construction of new FI D-terms in \({\mathcal {N}}=1\) supergravity that do not require the gauging of R-symmetry and preserve invariance under ordinary Kähler transformations. Their bosonic part is just linear in the D-auxiliary field with a multiplicative factor which is an arbitrary function of the gravitino mass, expressed as a functional of the chiral multiplets. We then used these terms to construct new models of D-term inflation. Considering just a U(1) and the inflaton multiplet with a no-scale Kähler potential and constant superpotential, we restricted to a simple form of the function associated to the new FI D-term which is a single positive power of the gravitino mass up to an additive constant. The later dominates the inflationary period by an asymptotically de Sitter regime, because the gravitino mass vanishes asymptotically in this region. The resulting models are consistent with observational CMB data and share common properties with the Starobinsky \(R^2\) model [17] on one hand and with the models of inflation by supersymmetry breaking on the other hand, where the inflaton is identified with the superpartner of the goldstino [6, 11]. Moreover, they predict a variable range of primordial gravitational waves that can be within experimental reach. After the end of inflation, the inflaton rolls down to the minimum of the potential which has a tuneable (tiny) vacuum energy and supersymmetry is broken by a combination of F and D-term VEVs. An interesting open problem is whether there exists a microscopic origin of these new FI D-terms, for instance within string theory.