1 Introduction

The inflationary scenario [1,2,3,4,5] has become the paradigm of early universe cosmology. However, it is unclear whether cosmological inflation can be embedded in an ultraviolet complete theory. Inflation is usually obtained using slow rolling on an almost constant potential in the context of an effective field theory of a canonically normalized scalar field \(\phi \) minimally coupled to Einstein gravity. Recently, it has been argued that any such effective field theory which is consistent with superstring theory must satisfy a number of conditions (which are called the swampland constraints) [6, 7]. The first condition is the distance conjecture [8] which states that any effective field theory has an applicability range which is limited by \(|\Delta \phi | < c_1 m_{pl}\), where \(m_{pl}\) is the four space-time dimensional Planck mass, and \(c_1\) is a coefficient of the order 1. The second condition (de Sitter conjecture) states [9, 10] that the potential is either sufficiently steep or sufficiently tachyonic, i.e.

$$\begin{aligned} |\frac{V^\prime m_{pl}}{V}| \,> & {} \, c_2 \,\,\,\,\, \mathrm{{or}} \nonumber \\ \frac{V^{\prime \prime } m_{pl}^2}{V} \,< & {} \, -c_3 \end{aligned}$$
(1)

where \(c_2\) and \(c_3\) are positive coefficients of the order 1. These conditions effectively rule out standard single field slow-roll models of inflation [11].Footnote 1 They also rule out false vacuum models of inflation constructed using the effective field theory approach, while still leaving room for non-perturbative constructions of metastable de Sitter space such as those discussed recently in [14,15,16].

While the swampland criteria severely constrain standard slow-roll inflation, they can be consistent with other approaches to inflation such as warm inflation [17], trapped inflation [18] and chromo-natural inflation [19]. In these scenarios, there are extra contributions to the equation of motion of the scalar field \(\phi \) due to couplings with other matter fields which allow for accelerated expansion of space on a steep potential. Note, however, that these models continue to be in tension with the recently proposed Trans-Planckian Censorship Conjecture (TCC) [20, 21], as discussed in [22]. The latter constrains severely the future dynamics of many dark energy models [23].

Before the swampland criteria gained attention, there had been many attempts to embed cosmological inflation into string theory (see e.g. [24] for a textbook treatment). Axion monodromy inflation [25, 26] is considered to be one of the most promising scenarios. In this approach, a large effective field range is rendered consistent with the sub-Planckian range of the size of the extra spatial dimensions by a monodromy effect. An intuitive way to visualize this [25] is to consider \(\phi \) to be related to the length of a string which connects a bulk brane to the brane we live on, and having the string wind the base manifold many times. The overall potential for \(\phi \) is then modulated each time the brane positions come to overlap. Initially, axion monodromy inflation was suggested as a way to implement standard slow-roll large field inflation. Later on it was realized [27] (see also [28, 29]) that at the brane crossing points the masses of other string degrees of freedom become small, and that these fields can then be produced by the rolling \(\phi \) field by a mechanism similar to how matter can be produced in the preheating process at the end of inflation [30,31,32,33,34,35,36]. The particle production creates a new effective friction term which can slow down the motion, as already shown in [18, 22].Footnote 2

In this note we take a closer look at the effects of moduli field production at enhanced symmetry points. We show that particle production via the intrinsic nonlinearities in the scalar field equation of motion produced by the modulation of the potential is insufficient to render axion monodromy inflation consistent with the swampland conjectures – the effect of \(\phi \) particle production is either negligible, or else it traps \(\phi \) in a false vacuum state, leading to a graceful exit problem. Moduli field production, on the other hand, leads to sufficient effective friction and allows for accelerated expansion for sub-Planckian field values for which the swampland criteria are satisfied, without trapping \(\phi \) in a false vacuum.

In the following section we will consider \(\phi \) particle production due to the nonlinear terms in the \(\phi \) equation of motion generated by the modulation of the \(\phi \) potential. We show that, depending on the amplitude of the modulation term, \(\phi \) is either trapped in one of the false vacua it encounters, or else the effect of \(\phi \) particle production is negligible.

From the point of view of string theory, however, it is not consistent to just consider the modulation of the potential for \(\phi \) but to neglect the effect of the rolling \(\phi \) on the moduli fields which become light at the enhanced symmetry points. In Sect. 3 we hence study, similar to what was done in [18], moduli production at these points, making use of the analysis of [27]. We find that for sufficiently large values of the coupling between \(\phi \) and the moduli fields, moduli production slows down the motion of \(\phi \) sufficiently such that accelerated expansion can occur for field values \(|\phi | \le m_{pl}\) for which the swampland criteria are obeyed: the relative slope of the potential is sufficiently large, and super-Planckian field values are not required. Thus, both the distance and de Sitter criteria are obeyed.

We will be using natural units in which the speed of light and Planck’s constant are set to 1. We analyze the evolution of the matter fields in a background cosmological model with scale factor a(t), in terms of which the Hubble expansion rate is \(H(t) = {\dot{a}}/a\), where t is the physical time and an overdot indicates a derivative with respect to t. The coordinates \(\mathbf{{x}}\) are comoving spatial coordinates, and \(\mathbf{{k}}\) are the associated comoving momenta, with k denoting the magnitude. The reduced Planck mass is denoted by \(m_{pl} \equiv [1/(8 \pi G)]^{1/2}\).

2 Particle production from the modulation of the potential

2.1 Model

We consider a particular version of the axion monodromy inflation model in which the axion is described as a scalar field \(\phi \) with a linear potential which is modulated by a sinusoidal correction term induced by instantons [26]

$$\begin{aligned} V(\phi ) \, = \, \mu ^3 \phi + \Lambda ^4 \cos \Bigg (\frac{\phi }{2\pi f}\Bigg ), \end{aligned}$$
(2)

where \(\mu \) and \(\Lambda \) are model-dependent parameters determined by the brane compactification and the instanton action respectively, satisfying \(\Lambda \ll \mu \), while f characterizes the frequency of the periodic correction, and \(f \ll m_{pl}\).

Assuming that the axion is the inflaton, and neglecting the effect of the sinusoidal term, the inflaton must start at \(\phi \sim 11 m_{pl}\) in order to obtain 60 e-foldings of accelerated expansion [26]. However, such field values are obviously in tension with the distance conjecture, and also the de Sitter conjecture, since for the linear potential of the axion the first line of the condition (1) demands that

$$\begin{aligned} \phi \, \lesssim \, 1 m_{pl} \, , \end{aligned}$$
(3)

setting the constant \(c_2 = 1\), and is thus in contradiction with the requirement from 60 e-foldings of inflation.

However, due to the nonlinearities in the equation of motion for \(\phi \) induced by the periodic correction to the potential, there will be production of \(\phi \) particles, and this particle production effect will back-react and slow down the evolution of \(\phi \). In the following, we will consider \(\phi \) particle production and its effect on the evolution of the background value of \(\phi \), in order to examine whether this effect is sufficient to cause slow-roll inflation with field values which are consistent with the de Sitter constraint.

2.2 First order perturbation of the axion evolution

Using the axion potential Eq. (2) in the scalar field action, the equation of motion of the spatially homogeneous background solution \(\phi _0(t)\) is

$$\begin{aligned} \ddot{\phi _0} + 3H\dot{\phi _0} + \mu ^3 - \frac{\Lambda ^4}{2\pi f}\sin \frac{\phi _0}{2\pi f} \, = \, 0 \, \end{aligned}$$
(4)

where the Hubble expansion rate is given by

$$\begin{aligned} H \, = \, \frac{1}{\sqrt{3}m_{pl}}\sqrt{\frac{1}{2}\dot{\phi _0}^2+V(\phi _0)} \, . \end{aligned}$$
(5)

We will study the space-dependent \(\phi \) fluctuations in first order perturbation theory which are induced by the nonlinearities. The ansatz for \(\phi \) is

$$\begin{aligned} \phi (\mathbf {x},t) \, = \, \phi _0(t)+\varphi (\mathbf {x},t). \end{aligned}$$
(6)

Expanding the equation of motion for \(\phi \) to first order in the amplitude of the fluctuations, and neglecting for a moment the expansion of space (we will include this effect later on), we obtain the following equation of motion for \(\varphi \)

$$\begin{aligned} \ddot{\varphi } - \nabla ^2 \varphi + V''(\phi _0)\varphi \, = \, 0. \end{aligned}$$
(7)

Since the equation of motion is linear, each Fourier mode \(\varphi _k (t)\) evolves independently according to the equation

$$\begin{aligned} \ddot{\varphi }_k + \Bigg (k^2 - \frac{\Lambda ^4}{(2\pi f)^2}\cos \frac{\phi _0}{2\pi f}\Bigg )\varphi _k \, = \, 0. \end{aligned}$$
(8)

This equation describes a harmonic oscillator with a time-dependent mass. The time-dependence of the mass can lead to particle production. Particle production is expected to be important if the adiabaticity condition on the effective frequency \(\omega _k\) is violated, i.e.

$$\begin{aligned} \alpha \, \equiv \, \frac{d\omega ^2/ dt }{\omega ^3} \, > \, 1. \end{aligned}$$
(9)

For modes with

$$\begin{aligned} k \, \gg \, \frac{\Lambda ^2}{2\pi f}, \end{aligned}$$
(10)

the adiabaticity condition is not violated, and we obtain oscillating behavior, no amplification and hence no particle production. Therefore, they do not effect the evolution of the background .

In contrast, modes with

$$\begin{aligned} k \, \ll \, \frac{\Lambda ^2}{2\pi f} \end{aligned}$$
(11)

can be amplified. This draws energy from the background field and can slow down the rolling. The typical time scale of the instability is expected to be shorter than the Hubble time step, thus justifying the neglect of the Hubble friction term. In the numerical parts of the following analysis, we use the parameter values \(\mu \sim 0.1\), \(\Lambda \sim 0.01\), \(f\sim 0.001\), all in Planck units.

For modes in the range (11), the approximate form of the mode equation becomes

$$\begin{aligned} \ddot{\varphi }_k-\frac{\Lambda ^4}{(2\pi f)^2}\cos \Bigg ( \frac{\phi _0}{2\pi f}\Bigg )\varphi _k=0. \end{aligned}$$
(12)

The evolution of \(\varphi _k\) in one period of \(\phi _0\) can be approximated separately in four phases, as shown in Fig. 1. In each phase, the sinusoidal term is taken to be linear in \(\phi _0\), and is hence also linear in time if we use the background solution \(\phi _0(t)\).

Fig. 1
figure 1

The periodical term in the equation of motion of \(\varphi _k\). \(\phi _0\) decreases as time proceeds, and the evolution of \(\varphi _k\) in each period is analysed separately for the phases (1), (2), (3), (4)

The division of the phases is determined by the adiabaticity parameter \(\alpha \) defined in (9). We start with

$$\begin{aligned} \omega ^2 \, \equiv \, -\frac{\Lambda ^4}{(2\pi f)^2}\cos \frac{\phi _0}{2\pi f}, \end{aligned}$$
(13)

and then \(\alpha \) becomes

$$\begin{aligned} \alpha \,= & {} \, \frac{1}{\Lambda ^2}\left| \frac{\sin \Bigg (\frac{\phi _0}{2\pi f}\Bigg )\dot{\phi }_0}{\cos ^{3/2}\Bigg (\frac{\phi _0}{2\pi f}\Bigg )}\right| \nonumber \\\simeq & {} \, \frac{1}{\Lambda ^2}\left| \frac{\sin \Bigg (\frac{\phi _0}{2\pi f}\Bigg )}{\cos ^{3/2}\Bigg (\frac{\phi _0}{2\pi f}\Bigg )}\right| \frac{m_{pl}}{\sqrt{3}}\mu ^{3/2}\phi _0^{-1/2} \end{aligned}$$
(14)

where we made use of

$$\begin{aligned} \dot{\phi }_0 \, \simeq \, -\frac{\mu ^3}{3H} \end{aligned}$$
(15)

and

$$\begin{aligned} H \, \simeq \, \frac{1}{\sqrt{3}m_{pl}}\sqrt{\mu ^3\phi _0} \, . \end{aligned}$$
(16)

The junctions between phases occur when \(\alpha =1\), dividing the period according to whether \(\omega ^2\) changes rapidly or not.

In phases (1) and (3), the adiabaticity condition is satisfied (\(\alpha \lesssim 1\)), so that \(\omega ^2\) can be approximated with a constant \(\omega ^2\simeq \pm \frac{\Lambda ^4}{(2\pi f)^2}\). However, the time duration of these phases is found to be much shorter than that of the whole period, and thus the \(\varphi _k\) evolution in these two phases has negligible effects.

In phase (2), \(\omega ^2(t)\) can be expanded linearly around \(\phi _0(t_2)\) where \(\cos \frac{\phi _0(t_2)}{2\pi f}=0\)

$$\begin{aligned} \omega ^2 \, \simeq \, -\frac{\Lambda ^4}{(2\pi f)^3}\dot{\phi }_0(t_2) (t-t_2). \end{aligned}$$
(17)

Substituting Eq. (17) into Eq. (12) we get

$$\begin{aligned} \ddot{\varphi }_k+P (t-t_2)\varphi _k \, = \, 0, \end{aligned}$$
(18)

in which

$$\begin{aligned} P \, \equiv \, -\frac{\Lambda ^4}{(2\pi f)^3}\dot{\phi _0}(t_2) \, \simeq \, \frac{\Lambda ^4}{(2\pi f)^3}\frac{m_{pl}}{\sqrt{3}}\mu ^{3/2}\phi _0^{-1/2}(t_2) \, . \end{aligned}$$
(19)

This equation has a solution

$$\begin{aligned} \varphi _k \,= & {} \, C_1 Ai(P^{1/3}e^{i\frac{\pi }{3}}(t-t_2))\nonumber \\&+ \, C_2 Bi(P^{1/3}e^{i\frac{\pi }{3}}(t-t_2)), \end{aligned}$$
(20)

where Ai and Bi are Airy functions, and \(C_1\) and \(C_2\) are constants. The solution can be further expanded around \(t=t_2\), because \(|P^{1/3}(t-t_2)|\lesssim 0.05\) given the values of the parameters which we use, and the Airy functions are approximately linear in this range:

$$\begin{aligned} Ai(x) \,= & {} \, \frac{1}{3^{2/3}\Gamma \Bigg (\frac{2}{3}\Bigg )}-\frac{x}{3^{1/3} \Gamma \Bigg (\frac{1}{3}\Bigg )}+\mathcal {O}(x^3),\nonumber \\ Bi(x) \,= & {} \, \frac{1}{3^{1/6}\Gamma \Bigg (\frac{2}{3}\Bigg )} +\frac{3^{1/6}x}{\Gamma \Bigg (\frac{1}{3}\Bigg )}+\mathcal {O}(x^3). \end{aligned}$$
(21)

Therefore, \(\varphi _k\) evolves linearly in phase (2). Similarly, in phase (4), it also has a linear behavior.

In summary, the evolution of \(\varphi _k\) in one period is nearly linear in time, when the boundary conditions between phases are matched. Connecting the solutions in different periods, \(\varphi _k\) is found to grow linearly during the whole process

$$\begin{aligned} \varphi _k \, = \, \frac{1}{\sqrt{2k}}+\sqrt{\frac{k}{2}}t, \end{aligned}$$
(22)

with the initial conditions determined by assuming that the modes start out in their vacuum state, and using quantum vacuum fluctuation initial conditions

$$\begin{aligned} \varphi _k(0) \,= & {} \, \frac{1}{\sqrt{2k}} \nonumber \\ \dot{\varphi }_k(0) \,= & {} \, \sqrt{\frac{k}{2}}. \end{aligned}$$
(23)

However, when we consider the \(\varphi _k\) evolution during the whole time evolution instead of during a single period, the effects of the expansion of space cannot be neglected. The equation of motion becomes

$$\begin{aligned} \ddot{\varphi }_k + 3H\dot{\varphi }_k - \frac{\Lambda ^4}{(2\pi f)^2}\cos \Bigg ( \frac{\phi _0}{2\pi f}\Bigg )\varphi _k \, = \, 0 \end{aligned}$$
(24)

instead of Eq. (12). The Hubble damping term can be removed by the following field redefinition [42]

$$\begin{aligned} \Phi _k \, \equiv \, a^{\frac{3}{2}}\varphi _k. \end{aligned}$$
(25)

In terms of the new field variable, the equation becomes

$$\begin{aligned} \ddot{\Phi }_k - \left( \frac{\Lambda ^4}{(2\pi f)^2}\cos \Bigg ( \frac{\phi _0}{2\pi f}\Bigg )+\delta \right) \Phi _k \, = \, 0, \end{aligned}$$
(26)

in which \(\delta :=\frac{9}{4}H^2+\frac{3}{2}\dot{H}\) is negligible compared to the other terms, so that the behavior of \(\Phi _k\) is similar to the solution Eq. (22), but with different initial conditions. Setting the initial time to be \(t = 0\), and normalizing the scale factor as \(a(t=0) = 1\), we get

$$\begin{aligned} \Phi _k(0) \,= & {} \, \frac{1}{\sqrt{2k}} \nonumber \\ \dot{\Phi }_k(0) \,= & {} \, \frac{3}{2}H(0)\frac{1}{\sqrt{2k}}+\sqrt{\frac{k}{2}} \end{aligned}$$
(27)

so that

$$\begin{aligned} \Phi _k \, = \, \frac{1}{\sqrt{2k}}+\left( \frac{3}{2}H(0) \frac{1}{\sqrt{2k}}+\sqrt{\frac{k}{2}}\right) t, \end{aligned}$$
(28)

and hence

$$\begin{aligned} \varphi _k \, = \, \frac{a^{-\frac{3}{2}}}{\sqrt{2k}}+a^{-\frac{3}{2}} \left( \frac{3}{2}H(0)\frac{1}{\sqrt{2k}}+\sqrt{\frac{k}{2}}\right) t. \end{aligned}$$
(29)

Note that in a background with accelerated expansion, the dilution due to the expansion of space overwhelms the increase in the number of particles.

2.3 Back-reaction of the perturbations

At quadratic order in the amplitude of the fluctuations, these back-react on the homogeneous mode of \(\phi \). Expanding the equation of motion for \(\phi \) in Eq. (6) to second order and averaging over the fluctuation field \(\varphi \) yields the equation

$$\begin{aligned} \ddot{\phi _0}+ & {} 3H\dot{\phi _0} + \mu ^3 \nonumber \\- & {} \frac{\Lambda ^4}{2\pi f}\sin \frac{\phi _0}{2\pi f}+ \frac{1}{2}\frac{\Lambda ^4}{(2\pi f)^3}\sin \left( \frac{\phi _0}{2\pi f}\right) \langle \varphi ^2\rangle \, = \, 0.\nonumber \\ \end{aligned}$$
(30)

This equation differs from Eq. (4) in that it includes the effects of the back-reaction of the fluctuation field \(\varphi \) on the spatial homogeneous solution \(\phi _0\).

Fig. 2
figure 2

The evolution of \(\phi _0\) for different values of f, with fixed \(\mu =0.1\), \(\Lambda =0.01\), compared with the background solution. Both the field values and the time are in Planck units

The expectation value \(\langle \varphi ^2\rangle \) can be found by integrating the contributions of all of the Fourier modes which are being excited, i.e. from \(k = 0\) until the critical value \(k=\frac{\Lambda ^2}{2\pi f}\):

$$\begin{aligned} \langle \varphi ^2\rangle&\simeq \int _0^{\frac{\Lambda ^2}{2\pi f}}dk 4\pi k^2 \langle \varphi ^2_k\rangle \nonumber \\&=\frac{4\pi }{a^3}\left[ \frac{1}{8}\left( \frac{\Lambda ^2}{2\pi f}\right) ^4+\frac{9}{16}H^2(0)\left( \frac{\Lambda ^2}{2\pi f}\right) ^2\right. \nonumber \\&\quad \left. +\frac{1}{2}H(0)\left( \frac{\Lambda ^2}{2\pi f}\right) ^3\right] t^2\nonumber \\&\quad +\frac{4\pi }{a^3}\left[ \frac{1}{3}\left( \frac{\Lambda ^2}{2\pi f}\right) ^3+\frac{3}{4}\left( \frac{\Lambda ^2}{2\pi f}\right) ^2H(0)\right] t \end{aligned}$$
(31)

where we used Eq. (30) and assumed \(\langle \varphi ^2\rangle |_{t=0}=0\). Substituting Eq. (31) into Eq. (30), the \(\phi _0\) evolution can be determined and compared with the background solution.

2.4 Numerical solutions of the \(\phi _0\) evolution

In the following we numerically solve the Eq. (30) for the homogeneous field mode corrected by the back-reaction effects and compare the result with the solution \(\phi _0\) of the background without any back-reaction terms. The background solution \(\phi _0\) comes from solving Eq. (4). We have checked that the precise values of \(\Lambda \) and F have a negligible effect on our overall conclusions, as long as \(\Lambda \ll \mu \).

The \(\phi _0\) evolution including the back-reaction of the produced particles comes from solving Eq. (30), using

$$\begin{aligned} H \, = \, \frac{1}{\sqrt{3}m_{pl}}\sqrt{\frac{1}{2}\dot{\phi _0}^2+V(\phi _0)}, \end{aligned}$$
(32)

and

$$\begin{aligned} a(t) \, = \, \exp \int _0^t H(t')dt'. \end{aligned}$$
(33)

To check whether \(\phi \) particle production can render the model consistent with the swampland conjectures, we start the evolution with the largest field value consistent with the distance conjecture, namely \(\phi _0 = 1\) (in Planck units). Fixing \(\mu \) and \(\Lambda \), we find that the parameter f influences the fate of \(\phi _0\), as shown in Fig. 2. If the period f of the oscillations of the potential is small, then there is a big back-reaction effect and \(\phi _0\) gets trapped almost immediately, leading to a graceful exit problem. On the other hand, if the local minima of the potential are well separated, then the effect of particle production is negligible, and no inflation can occur. As shown in the graph, the transition in the behavior of the solution as a function of f is very sharp.

3 Including the effects of moduli field production

3.1 Model

From the point of view of string theory, the model considered in the previous section is incomplete. It includes the modulation of the potential due to the monodromy effects, but it does not take into account that the monodromy points (the local minima of the modulated potential) are enhanced symmetry states at which a tower of string modes (which we call “moduli” in the following) become massless. If these fields couple to the inflaton, as is expected, then the rolling inflaton field will lead to the production of moduli quanta [27].Footnote 3

To study the effect of moduli production on the inflaton dynamics we consider the following Lagrangian [18]

$$\begin{aligned} \mathcal {L} \,= & {} \, \frac{1}{2}\partial _{\mu }\phi \partial ^{\mu }\phi -V(\phi ) \nonumber \\&+ \frac{1}{2}\sum _i (\partial _{\mu }\chi _i\partial ^{\mu }\chi _i-g^2(\phi -\phi _i)^2\chi _i^2)\, , \end{aligned}$$
(34)

where the potential is assumed to be

$$\begin{aligned} V(\phi ) \, = \, \mu ^3 \phi \, , \end{aligned}$$
(35)

where we are neglecting the modulation of the potential. As discussed in the previous section, this is a good approximation as long as the minima are not too closely packed. This model describes the axion monodromy field coupled to the string degrees of freedom \(\chi _i\), which become light periodically when the rolling axion field \(\phi \) reaches enhanced symmetry points \(\phi _i\). At each such value \(\phi _i\), a modulus field \(\chi _i\) becomes massless [25, 26].

3.2 Estimation of the back-reaction effect

In the Lagrangian Eq. (34), the \(\chi _i\) fields become massless at the enhanced symmetry points \(\phi _i\), which are expected to be evenly spaced in \(\phi \). We denote the interval between enhanced symmetry points by

$$\begin{aligned} \Delta \, \equiv \, \phi _{i+1}-\phi _i . \end{aligned}$$
(36)

Similar to Eqs. (4) and (30), the background solution of this model satisfies

$$\begin{aligned} \ddot{\phi }_0 + 3H\dot{\phi }_0 + V'(\phi _0) \, = \, 0 \, , \end{aligned}$$
(37)

while the full solution including the back-reaction effects of the \(\chi _i\) fluctuations produced when \(\phi \) crosses the enhanced symmetry points is

$$\begin{aligned} \ddot{\phi } + 3H\dot{\phi } + V'(\phi ) - \sum _i g^2|\phi -\phi _i|\langle \chi _i^2\rangle \, = \, 0. \end{aligned}$$
(38)

Following [42], the back-reaction term \(g^2|\phi -\phi _i|\langle \chi _i^2\rangle \) at the ith crossing can be approximated as \(gn_{\chi _i}\), where \(n_{\chi _i}\) is the number density of the \(\chi _i\) particles. As discussed in [18, 27], particle production happens when the adiabaticity condition is violated in the equation of motion of \(\chi _i\), which happens when

$$\begin{aligned} |\phi -\phi _i| \, \lesssim \, \delta \phi \, \equiv \, \sqrt{\frac{|\dot{\phi }(t_i)|}{g}} \, , \end{aligned}$$
(39)

and only for spatial Fourier modes \(\chi _{ik}\) satisfying

$$\begin{aligned} \frac{k^2}{g|\dot{\phi }(t_i)|} \, \lesssim \, 1 \, . \end{aligned}$$
(40)

We will show in Eq. (53) that \(\dot{\phi }\) is constant during inflation under the slow-roll approximation, so that \(\delta \phi \) is nearly the same for all of the enhanced symmetry points. The particles produced at the i’th crossing are later diluted by the expansion of space, with a dilution factor \(\frac{a(t_i)^3}{a(t)^3}\). As derived in [27], the full computation of the particle production for the ith crossing gives

$$\begin{aligned} n_{\chi _{ik}} \, = \, e^{-\pi \frac{k^2}{g|\dot{\phi }(t_i)|}} \frac{a(t_i)^3}{a(t)^3}, \end{aligned}$$
(41)

for particles of a specific Fourier mode k, and

$$\begin{aligned} n_{\chi _i} \, = \, \int \frac{d^3k}{(2\pi )^3}n_{\chi _{ik}} \, = \, \frac{g^{\frac{3}{2}}}{(2\pi )^3}(\dot{\phi }(t_i))^{\frac{3}{2}} \frac{a(t_i)^3}{a(t)^3} \end{aligned}$$
(42)

for the total number of \(\chi \) particles. To obtain a qualitative understanding of this result, note that the back-reaction effect is dominated by the largest value of k for which the mode functions undergo amplification, because the phase space of modes grows as \(k^3\). Thus, it is the critical value \(k\sim \sqrt{g|\dot{\phi }|}\) which dominates. The time interval of particle production is given by

$$\begin{aligned} \Delta t \, \sim \, \frac{\delta \phi }{\dot{\phi }}\sim \frac{1}{\sqrt{g|\dot{\phi }|}} \, . \end{aligned}$$
(43)

The Lagrangian (34) cannot be trusted to describe the interaction between \(\chi _i\) and \(\phi \) for values of \(\phi \) which are outside of the interval \(|\phi - \phi _i| < \delta \phi \) since the interaction term was based on a quadratic approximation about the enhanced symmetry point. We will hence consider a modified Lagrangian in which we truncate the \(\phi -\chi \) interaction terms to be non-zero only inside the regions of non-adiabatic particle production, and take the mass of \(\chi _i\) to be constant outside

$$\begin{aligned} \mathcal {L} \,= & {} \, \frac{1}{2}\partial _{\mu }\phi \partial ^{\mu }\phi - V(\phi ) \nonumber \\&+ \frac{1}{2}\sum _i (\partial _{\mu }\chi _i\partial ^{\mu }\chi _i\!-\!g^2(\phi -\phi _i)^2\chi _i^2 \Theta (\delta \phi \!-\!|\phi -\phi _i|)) \nonumber \\&- \frac{1}{2}\sum _i m^2\chi _i^2\Theta (|\phi -\phi _i|-\delta \phi ) \, , \end{aligned}$$
(44)

where \(\Theta (x)\) is the step function, and \(m \equiv g\delta \phi \) is the \(\chi \) mass outside the non-adiabatic regions. In Fig. 3, we show the evolution of the \(\chi _i\) mass during the rolling of \(\phi \).

Fig. 3
figure 3

The \(\chi _i\) mass near the enhanced symmetry point \(\phi _i\)

Fig. 4
figure 4

The evolution of \(\phi \) starting from \(\phi =1 m_{pl}\) for different values of F, with fixed \(\mu = 0.1\) (in Planck units), and compared with the background solution

Therefore, the equation of motion for \(\phi \) becomes

$$\begin{aligned} \ddot{\phi }+ & {} 3H\dot{\phi } + V'(\phi ) \nonumber \\- & {} \sum _i g^2|\phi -\phi _i|\langle \chi _i^2\rangle \Theta (\delta \phi -|\phi -\phi _i|) \, = \, 0 \, , \end{aligned}$$
(45)

which is equivalent to

$$\begin{aligned} \ddot{\phi } + 3H\dot{\phi } + V'(\phi ) - \sum _i gn_{\chi _i}\Theta (\delta \phi -|\phi -\phi _i|) \, = \, 0 \, .\nonumber \\ \end{aligned}$$
(46)

The back-reaction terms in Eq. (46) are multiplied by the additional step functions and thus differ from Eq. (38). In our model, the regions of production of \(\chi _i\) particles lead overlaps of a number \({\tilde{N}}\) of \(\phi \) intervals where particle production of different \(\chi _i\) fields occurs. The number is given by

$$\begin{aligned} {\tilde{N}} \, \simeq \, \frac{2\delta \phi }{\Delta } \, \simeq \, \frac{2\sqrt{|\dot{\phi }|/g}}{\Delta } \, . \end{aligned}$$
(47)

Combining this result with (42), the summation of the back-reaction terms can be approximated as

$$\begin{aligned} \sum _i gn_{\chi _i}\Theta (\delta \phi -|\phi -\phi _i|) \, \simeq \, \frac{2g^2}{(2\pi )^3}\frac{|\dot{\phi }|^2}{\Delta }, \end{aligned}$$
(48)

where we neglected the expansion of space in (42) since at any time it will be new instability bands which dominate the back-reaction. The effective equation of motion of the \(\phi \) field thus becomes

$$\begin{aligned} \ddot{\phi } + 3H\dot{\phi } + V'(\phi ) - \frac{2g^2}{(2\pi )^3}\frac{|\dot{\phi }|^2}{\Delta } \, = \, 0 \end{aligned}$$
(49)

with

$$\begin{aligned} H \, = \, \frac{1}{\sqrt{3}m_{pl}}\sqrt{\frac{1}{2}\dot{\phi }^2+V(\phi )} \, . \end{aligned}$$
(50)

We see that the amplitude of the \(\chi \) back-reaction term is determined by a free parameter

$$\begin{aligned} F \, \equiv \, \frac{g^2}{\Delta } \, . \end{aligned}$$
(51)

We have assumed in Eq. (50) that the energy density of the \(\chi \) particles is sub-dominant compared to \(V(\phi )\)

$$\begin{aligned} \rho _\chi\simeq & {} \sum _i g\delta \phi n_{\chi _i} \, \simeq \, \sum _i g\sqrt{\frac{|\dot{\phi }(t_i)|}{g}}\frac{g^{\frac{3}{2}}}{(2\pi )^3} \dot{\phi }(t_i)^{\frac{3}{2}}\frac{a(t_i)^3}{a(t)^3} \nonumber \\\simeq & {} \frac{g^2}{3H\Delta (2\pi )^3}|\dot{\phi }(t)|^3 \, \ll \, V(\phi ), \end{aligned}$$
(52)

so that it is negligible in the Friedmann equation. If we assume slow roll \(|\ddot{\phi }|\ll 3H|\dot{\phi }| \ll V'\), the Eq. (49) can be solved as

$$\begin{aligned} |\dot{\phi }| \, \simeq \, \left( \frac{(2\pi )^3 V'\Delta }{2g^2}\right) ^{\frac{1}{2}}. \end{aligned}$$
(53)

Substituting Eq. (53) into Eq. (52), and using the approximation \(V(\phi ) \simeq \mu ^3 m_{pl}\), we get a constraint on H, F and \(\mu \)

$$\begin{aligned} H^{-1}F^{-\frac{1}{2}}\mu ^{\frac{3}{2}} \, \ll \, m_{pl}. \end{aligned}$$
(54)

Using Eq. (53) and assuming \(\dot{\phi }^2\ll V(\phi )\) and \(\phi (t_{end}) \simeq 0\), the number N of e-foldings of the expansion of space is approximately’

$$\begin{aligned} N \, \equiv \, \int _0^{t_{end}} Hdt \, \simeq \, \frac{2\sqrt{2}m_{pl}^{\frac{1}{2}}}{3\sqrt{3} (2\pi )^{\frac{3}{2}}}F^{\frac{1}{2}}. \end{aligned}$$
(55)

3.3 Numerical solutions and observational constraints

In Fig. 4, we show the numerical solution of Eq. (49) for various values of the parameter F, but fixed \(\mu \), and compare the results with the background solution from Eq. (37). In contrast to the results for the previous model (where only modulation effects but no \(\chi \) particle production was considered) shown in Fig. 2, in the model considered in this section we see that slow rolling of \(\phi \) starting with Planckian initial field value can be realized provided that the parameter F is large enough.

Furthermore, we checked for which values of our model parameters we can get a sufficient number of e-foldings of accelerated expansion, fixing the initial value of \(\phi \) to be \(\phi = m_{pl}\) in order to satisfy the distance criterion, but varying the parameters \(\mu \) and F, in order to check whether there exists a range satisfying both of the criteria. Note that for field values \(|\phi | < m_{pl}\) the de Sitter constraint is automatically satisfied.

In order to obtain an inflationary model consistent with current observational constraints, we will fix the inflationary slow-roll parameters to be [43] (at the beginning of the period of inflation)

$$\begin{aligned} \epsilon \,\equiv & {} \, -\frac{\dot{H}}{H^2} \, \simeq \, 0.01 \,\,\, \mathrm{{and}} \nonumber \\ A_s \,\equiv & {} \, \frac{H^2}{8\pi ^2\epsilon m_{pl}^2} \, \simeq \, 2\times 10^{-9} \, , \end{aligned}$$
(56)

the second condition coming from the amplitude of the spectrum of cosmological perturbations, and the first from its slope. From the \(A_s\) (scalar amplitude) constraint and using Eq. (49), we get

$$\begin{aligned} H \, \lesssim \, \frac{1}{\sqrt{3}m_{pl}}\sqrt{\mu ^3 m_{pl}} \, \simeq \, 4\times 10^{-5} m_{pl}, \end{aligned}$$
(57)

which sets a range for \(\mu \):

$$\begin{aligned} \mu \, \lesssim \, 0.0017 m_{pl}. \end{aligned}$$
(58)

We also require at least 60 e-foldings of spatial expansion during the whole period of inflation

$$\begin{aligned} N \, \ge \, 60 \, , \end{aligned}$$
(59)

where the end time of inflation is determined by \(\epsilon (t_{end}) = 1\). It can also be numerically checked that \(\phi (t_{end}) \simeq 0\), \(\dot{\phi }^2(t_{end}) \lesssim \mu ^3 \phi (t_{end})\), verifying that the potential energy of \(\phi \) is larger than the kinetic energy during the period of inflation. Using Eq. (55), we get the constraint

$$\begin{aligned} F \, \simeq \, \frac{27(2\pi )^3N^2}{8m_{pl}}\simeq 3\times 10^6 m_{pl}^{-1}. \end{aligned}$$
(60)

Substituting Eqs. (57), (60) and (58) into (54), we find that the consistency constraint \(\rho _{\chi }\ll \rho _{\phi }\) is satisfied for the chosen parameters.

In Fig. 5, we show the results of a scan of the parameter space

$$\begin{aligned} 0 \,< & {} \, \mu \lesssim \, 0.01 m_{pl} \nonumber \\ 10^6 m_{pl}^{-1} \,\le & {} \, F \, \le \, 9\times 10^6 m_{pl}^{-1} \, , \end{aligned}$$
(61)

and calculate the number of e-foldings of inflation (which is shown on the vertical axis in orange colour) for each grid point. Comparing the results with the required 60 e-foldings (which is shown as the horizontal surface in blue), we find that inflation consistent with both observations and the swampland constraints can be obtained if the parameter values are close to the intersection line of the two surfaces. The numerical results are compatible with what can be obtained from Eq. (60). Note that – if we add the oscillating term in the potential considered in Sect. 2 – the value of \(\Delta \) is bounded from below by the analysis of Sect. 2. The rough bound is \(\Delta > 10^{-3} m_{pl}\). Hence, the requirement of a large value of F from (60) implies that the coupling of \(\phi \) to the moduli fields must be strong, i.e. \(g^2 > 10^3\).

To conclude, the axion monodromy model of Eq. (44) which includes the effects of moduli production at enhanced symmetry points can be consistent with both the slow-roll condition and the de Sitter Conjecture given suitable parameters. This result agrees with the conclusions of [27] (see also [22]).

4 Conclusions

We have studied the evolution of the axion field \(\phi \) in an axion monodromy model without and with the consideration of moduli production at enhanced symmetry points. Without considering the production of such moduli fields, the production of \(\phi \) particles due to the nonlinearities in the \(\phi \) potential which is induced by the monodromy effects is unable to render the model consistent with the swampland criteria. Either the particle production effects are too weak to significantly affect the background dynamics, or else they lead to trapping in a local minimum and thus to a graceful exit problem. On the other hand, moduli production at enhanced symmetry points is able to slow down (for a certain range of the model parameters) the background field dynamics sufficiently such that a long enough period of inflation starting with field values consistent with the distance conjecture is possible. The model is then also consistent with the de Sitter conjecture. The model which emerges is, however, not standard slow roll inflation, but rather a trapped inflation scenario.

Fig. 5
figure 5

The e-folds in a 2D parameter scan of \(\mu \) and F in Plank units (orange), compared with the 60 e-folds’ surface (blue)

There is a connection between our study and the arguments [11] based on the swampland criteria that minimal slow-roll inflation might not be realizable in string theory (naturally, it is also possible that string theory may provide a counterexample to the swampland conjectures). In the example we have studied here, string theory considerations transform a model which at first sight looks as if it could yield a realization of the standard slow roll scenario into a model which is different and which is consistent with the swampland criteria. The resulting model, however, is still in tension with the TCC, which states [20] that no length scale which was initially smaller than the Planck length is allowed to become super-Hubble. This sets an upper bound on the the duration of inflation. Demanding that inflation can explain the origin of structure on all currently observed scales sets a lower bound on the duration of inflation. These bounds are only consistent [21] if the energy scale of inflation is lower than about \(\eta \sim 10^{9} \mathrm{{GeV}}\) (for power law inflation the bound is somewhat relaxed [44, 45]). Models with such a low energy scale require severe fine tuning in order to be consistent with observations [21]. Once again, however, it is possible that the TCC criteria can be violated in the context of a more consistent theory of the early universe (see e.g. [46]).