1 Introduction

Spontaneous breaking of global \(N=1\) supersymmetry (SUSY) gives rise to a massless Goldstone fermion, or goldstino, which at low energies can be described via the non-linear realization of SUSY first studied by Volkov and Akulov [1]. Subsequently, it was realized that non-linear \(N=1\) SUSY can also be described in superspace formalism [2,3,4,5,6,7,8] (a review of the subject of constrained superfields can be found in Ref. [9]). Other approaches for realising \(N=1\) SUSY non-linearly can be found in [10,11,12]. In particular Komargodski and Seiberg [6] showed that the low-energy non-linear action of the goldstino can be constructed with the help of nilpotent chiral superfields, and in Ref. [13] Kuzenko and Tyler found non-linear field redefinitions relating Volkov–Akulov and Komargodski–Seiberg models.

In supergravity, constrained superfields have attracted a lot of interest due to their utility in constructing de Sitter vacua even in the absence of scalar fields [14,15,16,17,18,19,20,21,22,23,24,25]. The fact that they spontaneously break SUSY and automatically decouple heavy degrees of freedom makes them useful for building inflationary models as well [26,27,28,29,30,31,32,33,34,35,36,37,38].

The minimal goldstino model derived via the constrained superfield approach includes a single chiral superfield \({{\mathbf{S}}}\) satisfying the quadratic nilpotency constraint \({\mathbf{S}}^2=0\).Footnote 1 This superfield constraint eliminates the scalar component of \({\mathbf{S}}\), which we call S, in terms of the goldstino bilinear \(\chi ^2\) and introduces higher-derivative terms in the Lagrangian. In the cases where the goldstino is accompanied by other light fields, these fields can be embedded in orthogonal constrained superfields [6,7,8, 31, 32]. For example if we have a light Goldstone or a pseudo-Goldstone boson, which we will refer to as an axion, generated by a broken U(1) symmetry, we can embed it in a chiral superfield \({\mathbf{T}}\) satisfying the orthogonality constraint \({\mathbf{S}}({\mathbf{T}}+\overline{\mathbf{T}})=0\) (in which case the U(1) symmetry transforms \({\mathbf{T}}\) by imaginary shifts), such that the real part of \({\mathbf{T}}\) is eliminated, while its imaginary part can be associated with the axion. From the orthogonality constraint it follows that \(({\mathbf{T}}+\overline{\mathbf{T}})^3=0\). It is then natural to ask if imposing such a cubic constraint on an independent chiral superfield will yield non-trivial solutions. In this work we attempt to answer this question.

The outline of our paper is as follows. We first recall nilpotent chiral superfields by looking at a simple example in Sect. 2. In Sect. 3 we consider a chiral superfield transforming under a global U(1) symmetry which we assume is spontaneously broken, and impose cubic nilpotency constraints on a U(1)-invariant function of the chiral superfield. We solve the cubic constraints in the two cases: where the U(1) acts on the chiral superfield as a shift, and as a phase rotation. The general solutions to the constraints describe non-linear dynamics of a goldstino coupled to an axion, and in Sect. 4 we study a minimal Lagrangian as an example. In Sect. 5 we review Komargodski–Seiberg (KS) approach for describing goldstino-axion models, and find non-linear field redefinitions relating our construction to the KS approach.

2 Nilpotent chiral superfield

The minimal globally supersymmetric Lagrangian for a nilpotent chiral superfield is given by (we use Wess–Bagger [39] conventions)

$$\begin{aligned} {{{\mathcal {L}}}}=\int \!d^4\theta \,K+\left( \int \!d^2\theta \,W+{\mathrm{h.c.}}\right) , \end{aligned}$$
(1)

where K is the canonical Kähler potential and W is a linear superpotential. They have the following expressions

$$\begin{aligned} K={\mathbf{S}}\overline{\mathbf{S}},\quad W=\mu {\mathbf{S}}, \end{aligned}$$
(2)

where \(\mu \) is some constant parameter which we assume is real, and \({\mathbf{S}}\) is the chiral superfield. The component expansion of \({\mathbf{S}}\) (as a function of the chiral coordinate \(y^m=x^m+i\theta \sigma ^m{\bar{\theta }}\)) is

$$\begin{aligned} {\mathbf{S}}=S+\sqrt{2}\theta \chi +\theta ^2 F, \end{aligned}$$
(3)

where S is a complex scalar, \(\chi \) is a spin-1/2 fermion and F is an auxiliary scalar. The chiral superfield \({\mathbf{S}}\) satisfies the following nilpotency conditionFootnote 2

$$\begin{aligned} {\mathbf{S}}^2=0. \end{aligned}$$
(4)

In terms of the components of \({\mathbf{S}}\), the solution to constraint (4) is \(S=\chi ^2/(2F)\). This relation eliminates S in terms of the fermion \(\chi \), which can now be identified as the goldstino because the corresponding auxiliary field F must be non-vanishing. Hence SUSY is realised non-linearly.

After expanding Lagrangian (1) in terms of the component fields, and using \(S=\chi ^2/(2F)\), we have

$$\begin{aligned} {{{\mathcal {L}}}}=\frac{\chi ^2}{2F}\Box \frac{{\bar{\chi }}^2}{2\overline{F}}-i\chi \sigma ^m\partial _m{\bar{\chi }}+\mu (F+\overline{F})+F\overline{F}. \end{aligned}$$
(5)

The auxiliary field F can be integrated out via its equation of motion

$$\begin{aligned} F=-\mu -\frac{{\bar{\chi }}^2}{4\mu ^3}\Box \chi ^2+\frac{3\chi ^2{\bar{\chi }}^2}{16\mu ^7}\Box \chi ^2\Box {\bar{\chi }}^2, \end{aligned}$$
(6)

where the leading term is the usual contribution from the superpotential, while the fermionic terms come from the solution to the nilpotency constraint. Plugging this back into the Lagrangian yields,

$$\begin{aligned} {{{\mathcal {L}}}}=-i\chi \sigma ^m\partial _m{\bar{\chi }}-\mu ^2+\frac{\chi ^2}{4\mu ^2}\Box {\bar{\chi }}^2-\frac{\chi ^2{\bar{\chi }}^2}{16\mu ^6}\Box \chi ^2\Box {\bar{\chi }}^2\,.\nonumber \\ \end{aligned}$$
(7)

This minimal Lagrangian describes non-linear dynamics of the goldstino which transforms under SUSY as,

$$\begin{aligned} \delta _\epsilon \chi _\alpha =-\mu \epsilon _\alpha -i\mu ^{-1}\sigma ^m_{\alpha {\dot{\alpha }}}{\bar{\epsilon }}^{{\dot{\alpha }}}(\chi \partial _m\chi )+{{{\mathcal {O}}}}(\chi ^2,{\bar{\chi }}^2), \end{aligned}$$
(8)

where \(\epsilon \) is the constant transformation parameter of global SUSY.

3 Real cubic constraints

A superfield \({\mathbf{S}}\) that satisfies the quadratic chiral constraint (4) also trivially satisfies the real cubic constraints,

$$\begin{aligned} ({\mathbf{S}}+\overline{\mathbf{S}})^3=0,\quad {\mathrm{and}}\quad ({\mathbf{S}}\overline{\mathbf{S}})^3=0, \end{aligned}$$
(9)

where the former is invariant w.r.t. its imaginary shifts, while the latter is invariant w.r.t. phase rotations of \({\mathbf{S}}\). Our goal is to find general solutions to this class of cubic constraints, including the solutions that do not satisfy the quadratic constraints of the form \({\mathbf{S}}^2=0\). As we are going to show, the most general solutions to the cubic constraints eliminate one real scalar, so that the other one can be used to describe a light axion of a broken global abelian symmetry.

A similar class of cubic constraints was studied by Kuzenko [40], where it was applied to the deformed real linear superfield \({\mathbf{L}}\) defined by \(\overline{D}^2 {\mathbf{L}}=-4m\), where m is a non-vanishing deformation parameter.

3.1 Shift-symmetric case

First we consider a chiral superfield \({\mathbf{S}}\), whose components are given in (3), transforming by imaginary shifts under a (spontaneously broken) global U(1), i.e. \({\mathbf{S}}\rightarrow {\mathbf{S}}+i\vartheta \) with a constant transformation parameter \(\vartheta \). If this U(1) is an R-symmetry, \(\chi \) and F must transform accordingly.

Introducing the following invariant quantity,

$$\begin{aligned} {{\varvec{\Sigma }}}\equiv {\mathbf{S}}+\overline{\mathbf{S}}-2a, \end{aligned}$$
(10)

where the (real) constant a is included as a possible VEV of S, we impose the following cubic constraint,

$$\begin{aligned} {{\varvec{\Sigma }}}^3=0. \end{aligned}$$
(11)

In terms of the regular coordinates \(x^m\), \({\varvec{\Sigma }}\) has the expansion,

$$\begin{aligned} {{\varvec{\Sigma }}}= & {} \Sigma +\sqrt{2}\theta \chi +\sqrt{2}{\bar{\theta }}{\bar{\chi }} +i\theta \sigma ^m{\bar{\theta }}\partial _m(S-\overline{S})\nonumber \\&+\frac{i}{\sqrt{2}}\theta ^2{\bar{\theta }}\overline{\sigma }^m\partial _m\chi +\frac{i}{\sqrt{2}}{\bar{\theta }}^2\theta \sigma ^m\partial _m{\bar{\chi }}+\theta ^2 F\nonumber \\&+{\bar{\theta }}^2\overline{F}+\frac{1}{4}\theta ^2{\bar{\theta }}^2\Box (S+\overline{S}), \end{aligned}$$
(12)

where \(\Sigma \equiv S+{\bar{S}}-2a\).

The leading component of the constraint (11) implies the general form of \(\Sigma \),

$$\begin{aligned} \Sigma =\chi ^2\beta +{{\bar{\chi }}}^2{{\bar{\beta }}}+\chi \sigma ^m{{\bar{\chi }}}\,\omega _m, \end{aligned}$$
(13)

where \(\beta \) and \(\omega _m\) are some functions of \(\chi \), \({\bar{\chi }}\), \({\mathrm{Im}}(S)\), and their spacetime derivatives. Let us parametrize S as

$$\begin{aligned} S=\phi +i\varphi , \end{aligned}$$
(14)

so that the real part is related to \(\Sigma \) as \(\phi =a+\Sigma /2\), and the imaginary part \(\varphi \) is the would-be axion of the broken U(1) shift symmetry.

The \(\theta ^2{\bar{\theta }}^2\)-component of constraint (11) reads,

$$\begin{aligned} \left( A+\frac{1}{4}\Sigma \Box \Sigma \right) \Sigma =B, \end{aligned}$$
(15)

where

$$\begin{aligned} A\equiv & {} 2|F|^2-2\partial _m\varphi \partial ^m\varphi -i\chi \sigma ^m\partial _m{\bar{\chi }}+i\partial _m\chi \sigma ^m{\bar{\chi }},\end{aligned}$$
(16)
$$\begin{aligned} B\equiv & {} \chi ^2\overline{F}+{{\bar{\chi }}}^2 F+2\chi \sigma ^m{{\bar{\chi }}}\partial _m\varphi . \end{aligned}$$
(17)

Note that \(B^2={{{\mathcal {O}}}}(\chi ^2{\bar{\chi }}^2)\) and \(B^3=0\). This equation is similar to what was obtained in [40] for the deformed real linear superfield \({\mathbf{L}}\), because the superfield \({{\varvec{\Sigma }}}\) has the same component structure as \({\mathbf{L}}\), except that the auxiliary field F of the chiral superfield \({\mathbf{S}}\) is replaced by the deformation parameter m of the linear superfield, and \(\partial _m\varphi \) is replaced by the Hodge-dual of the field strength of a two-form field in \({\mathbf{L}}\). The two-form field can be dualized to a real scalar (axion) in four dimensions, so the field content is the same as that of a chiral superfield.

Equation (15) can be solved iteratively (because \(\chi ^3=0\)), and the general solution is,

$$\begin{aligned} \Sigma =\frac{B}{A}-\frac{B^2}{4A^4}\Box B. \end{aligned}$$
(18)

After expanding the solution in terms of \(\chi \) and \({\bar{\chi }}\) we arrive at,

$$\begin{aligned} \Sigma =\chi ^2\beta +{\bar{\chi }}^2{{\bar{\beta }}}+\frac{2}{U}\chi \sigma ^m{\bar{\chi }}\partial _m\varphi , \end{aligned}$$
(19)

where \(U\equiv 2(|F|^2-\partial _m\varphi \partial ^m\varphi )\) and Footnote 3

$$\begin{aligned} \begin{aligned} \beta&\equiv \frac{\overline{F}}{U}+\frac{i{\bar{\chi }}}{U^2}\left( \overline{F}\overline{\sigma }^m\partial _m\chi -\partial _m\varphi \partial ^m{\bar{\chi }}+2\partial _m\varphi \overline{\sigma }^{mn}\partial _n{\bar{\chi }}\right) \\&\quad -\frac{{\bar{\chi }}^2}{2U^3}\Big [ F\partial _m{\bar{\chi }}\overline{\sigma }^{mn}\partial _n{\bar{\chi }}+\overline{F}\partial _m\chi \sigma ^{mn}\partial _n\chi \\&\quad +\partial _m\varphi \partial _n\chi (2\sigma ^{m}\eta ^{nk}-\sigma ^{n}\eta ^{km}-\sigma ^{k}\eta ^{mn}-i\epsilon ^{mnkl}\sigma _l)\partial _k{\bar{\chi }}\Big ]. \end{aligned} \end{aligned}$$
(20)

Or, in terms of \(\phi \) we have,

$$\begin{aligned} \phi =a+\frac{\Sigma }{2}=a+\frac{\chi ^2}{2}\beta +\frac{{{\bar{\chi }}}^2}{2}{{\bar{\beta }}}+\frac{1}{U}\chi \sigma ^m{\bar{\chi }}\,\partial _m\varphi . \end{aligned}$$
(21)

Note that we do not have any restrictions on a and it can be consistently set to zero.

Of course, the solution is valid only if \(U= 2(|F|^2-\partial _m\varphi \partial ^m\varphi )\ne 0\). Here we assume that \(U>0\) for physically interesting models, in which case the fluctuations of \(\varphi \) are bounded from above by the SUSY breaking scale, which is not at all surprising since the effective theory is supposed to operate on scales well below it. We verified that the solution (19) is compatible with the rest of the components of the superfield equation (11).

Nilpotent chiral superfield as a special case Solution (19) can be checked for consistency by additionally imposing the quadratic constraint \(({\mathbf{S}}-a)^2=0\), in which case it reduces to \(S-a=\chi ^2/(2F)\) so that

$$\begin{aligned} \Sigma \equiv S+{\bar{S}}-2a=\frac{\chi ^2}{2F}+\frac{{\bar{\chi }}^2}{2\overline{F}}. \end{aligned}$$
(22)

This eliminates \(\varphi \) as

$$\begin{aligned} \varphi =-\,\frac{i}{2}(S-{\bar{S}})=-\frac{i}{4}\left( \frac{\chi ^2}{F}-\frac{{\bar{\chi }}^2}{\overline{F}}\right) . \end{aligned}$$
(23)

Inserting solution (23) into the general solution for \(\Sigma \) in (19) reproduces exactly Eq. (22), as it should.

3.2 Phase-symmetric case

Suppose a theory is symmetric (or approximately symmetric) w.r.t. U(1) phase rotations of a chiral superfield \({\mathbf{Z}}\rightarrow {\mathbf{Z}}e^{i\vartheta }\). We denote the components of \({\mathbf{Z}}\) as,

$$\begin{aligned} {\mathbf{Z}}=Z+\sqrt{2}\theta \,\psi +\theta ^2F^z, \end{aligned}$$
(24)

where the fermion \(\psi \) will be identified with the goldstino. Depending on whether the U(1) is an R-symmetry or not, \(\psi \) and \(F^z\) may transform differently from Z.

We can now define a U(1) invariant combination,

$$\begin{aligned} {{\varvec{\Lambda }}}\equiv {\mathbf{Z}}\overline{\mathbf{Z}}-b^2, \end{aligned}$$
(25)

where b is some real constant, and impose the cubic constraint,

$$\begin{aligned} {{\varvec{\Lambda }}}^3=0. \end{aligned}$$
(26)

It is convenient to parametrize the scalar Z as,

$$\begin{aligned} Z=|Z|e^{i\zeta }, \end{aligned}$$
(27)

where \(\zeta \) is the angular scalar (axion), and the VEV of the radial mode is \(\langle |Z|\rangle =b\). Again, we assume the U(1) is spontaneously broken.

The \(\theta ^2{\bar{\theta }}^2\)-component of the constraint (26) reads,

$$\begin{aligned} \left( G+\frac{1}{4}\Box \Lambda \right) \Lambda ^2+b^2H\Lambda =b^4I, \end{aligned}$$
(28)

where \(\Lambda \equiv {{\varvec{\Lambda }}}|=Z{\bar{Z}}-b^2\), and

$$\begin{aligned} G&\equiv 3|F^z|^2-5b^2\partial _m\zeta \partial ^m\zeta , \end{aligned}$$
(29)
$$\begin{aligned} H&\equiv 2|F^z|^2-2b^2\partial _m\zeta \partial ^m\zeta -i\psi \sigma ^m\partial _m{\bar{\psi }}+i\partial _m\psi \sigma ^m{\bar{\psi }}\nonumber \\&\quad -\tfrac{7}{2b}\psi ^2\overline{F}^ze^{-i\zeta }-\tfrac{7}{2b}{\bar{\psi }}^2F^ze^{i\zeta }-8\psi \sigma ^m{\bar{\psi }}\partial _m\zeta ,\nonumber \\ \end{aligned}$$
(30)
$$\begin{aligned} I&\equiv \tfrac{1}{b}\psi ^2\overline{F}^z e^{-i\zeta }+\tfrac{1}{b}{\bar{\psi }}^2F^z e^{i\zeta }+2\psi \sigma ^m{\bar{\psi }}\partial _m\zeta -\tfrac{2}{b^2}\psi ^2{\bar{\psi }}^2. \end{aligned}$$
(31)

Note that \(I^2={{{\mathcal {O}}}}(\psi ^2{\bar{\psi }}^2)\) and \(I^3=0\).

In the limit \(b=0\) the only meaningful solution is \(\Lambda ^2=Z^2\overline{Z}^2=0\) which means \(Z^2=0\), and eliminates both real scalar degrees of freedom in terms of the goldstino.

For a general solution to Eq. (28) we assume \(b\ne 0\). In this case (28) is solved by,

$$\begin{aligned} \Lambda =\frac{b^2I}{H}-\frac{b^2I^2}{H^3}\left( G+\frac{b^2}{4H}\Box I\right) , \end{aligned}$$
(32)

or after expanding in terms of \(\psi \) and \({\bar{\psi }}\) we get,

$$\begin{aligned} \Lambda =b\left( \psi ^2\gamma +{\bar{\psi }}^2{\bar{\gamma }}+\frac{2b}{Y}\psi \sigma ^m{\bar{\psi }}\partial _m\zeta +\frac{b}{Y^2}\psi ^2{\bar{\psi }}^2\partial _m\zeta \partial ^m\zeta \right) ,\nonumber \\ \end{aligned}$$
(33)

where \(Y\equiv 2(|F^z|^2-b^2\partial _m\zeta \partial ^m\zeta )\), and

$$\begin{aligned} \begin{aligned} \gamma&\equiv \frac{\overline{F}^z}{Y}e^{-i\zeta }+\frac{i{\bar{\psi }}}{Y^2}\left( \overline{F}^z e^{-i\zeta }\overline{\sigma }^m\partial _m\psi -b\partial _m\zeta \partial ^m{\bar{\psi }}\right. \\&\quad \left. +2b\partial _m\zeta \overline{\sigma }^{mn}\partial _n{\bar{\psi }}\right) \\&\quad -\frac{{\bar{\psi }}^2}{2Y^3}\Big [ F^z e^{i\zeta }\partial _m{\bar{\psi }}\overline{\sigma }^{mn}\partial _n{\bar{\psi }}+\overline{F}^ze^{-i\zeta }\partial _m\psi \sigma ^{mn}\partial _n\psi \\&\quad +b\partial _m\zeta \partial _n\psi (2\sigma ^{m}\eta ^{nk}-\sigma ^{n}\eta ^{km}-\sigma ^{k}\eta ^{mn}-i\epsilon ^{mnkl}\sigma _l)\partial _k{\bar{\psi }}\Big ]. \end{aligned} \end{aligned}$$
(34)

We have defined \(\gamma \) in a way such that it resembles \(\beta \) from Eq. (20) in its structure. The solution is well-defined as long as \(Y= 2(|F^z|^2-b^2\partial _m\zeta \partial ^m\zeta )\ne 0\).

Since \(\Lambda =|Z|^2-b^2\), the radial mode of Z can be written with the help of \(\Lambda ^3=0\) as,

$$\begin{aligned} |Z|=\sqrt{b^2+\Lambda }=b+\frac{\Lambda }{2b}-\frac{\Lambda ^2}{8b^3}, \end{aligned}$$
(35)

so that using (33) we obtain,

$$\begin{aligned} |Z|= & {} b+\frac{\psi ^2}{2}\gamma +\frac{{{\bar{\psi }}}^2}{2}{\bar{\gamma }}+\frac{b}{Y}\psi \sigma ^m{\bar{\psi }}\partial _m\zeta \nonumber \\&-\frac{\psi ^2{\bar{\psi }}^2}{4bY^2}\left( |F^z|^2-3b^2\partial _m\zeta \partial ^m\zeta \right) . \end{aligned}$$
(36)

There is an alternative way to derive solution (36): we could start from solution (21) for the shift-symmetric case, and identify \({\mathbf{S}}=\log {\mathbf{Z}}\) assuming \(\langle {\mathbf{Z}}\rangle \ne 0\). Using the same expansion for \({\mathbf{S}}\) and \({\mathbf{Z}}\) as before,

$$\begin{aligned} {\mathbf{S}}=S+\sqrt{2}\theta \chi +\theta ^2F,\quad {\mathbf{Z}}=Z+\sqrt{2}\theta \psi +\theta ^2F^z, \end{aligned}$$
(37)

the components of \({\mathbf{S}}=\log {\mathbf{Z}}\) read,

$$\begin{aligned} S=\log Z,\quad \chi =\frac{\psi }{Z},\quad F=\frac{F^z}{Z}+\frac{\psi ^2}{2Z^2}. \end{aligned}$$
(38)

Thus, the real (\(\phi \)) and imaginary (\(\varphi \)) parts of S can be written as \(\phi =\log |Z|\) and \(\varphi =\zeta \) (also notice that the VEV of the first relation above reads \(a=\log b\), where \(b\ne 0\) while a can be zero). Plugging (38) into solution (21) and expanding in powers of \(\psi \) and \({\bar{\psi }}\), we obtain exactly the solution (36) for |Z|. In other words, the solutions to the constraints \(({\mathbf{Z}}\overline{\mathbf{Z}}-b^2)^3=0\) and \([\log ({\mathbf{Z}}\overline{\mathbf{Z}}/b^2)]^3=0\) coincide.

Nilpotent chiral superfield as a special case Let us once again verify that applying the appropriate quadratic constraint consistently eliminates the axion. The relevant quadratic constraint in this case is \(({\mathbf{Z}}-b)^2=0\), and its solution \(Z-b=\psi ^2/(2F^z)\) can be written in terms of the radial and angular parts as,

$$\begin{aligned} |Z|= & {} b+\frac{\psi ^2}{4F^z}+\frac{{\bar{\psi }}^2}{4\overline{F}^z}+\frac{\psi ^2{\bar{\psi }}^2}{16b|F^z|^2}, \end{aligned}$$
(39)
$$\begin{aligned} \zeta= & {} -\frac{i}{2}\log \frac{Z}{\overline{Z}}=\frac{i}{4b}\left( \frac{{\bar{\psi }}^2}{\overline{F}^z}-\frac{\psi ^2}{F^z}\right) . \end{aligned}$$
(40)

Plugging \(\zeta \) from (40) into Eq. (36), we indeed get Eq. (39).

SUSY transformation laws for the independent fields after solving the cubic constraints can be found in Appendix A for both shift-symmetric and phase-symmetric cases.

4 Goldstino-axion interactions

To demonstrate the Lagrangian and interactions between the goldstino and the axion, let us choose the simplest setup with a single chiral superfield and non-vanishing F-term. We fix the Kähler potential and superpotential as in Sect. 2,

$$\begin{aligned} K={\mathbf{S}}\overline{\mathbf{S}},\quad W=\mu {\mathbf{S}}, \end{aligned}$$
(41)

which leads to the component action,

$$\begin{aligned} {{{\mathcal {L}}}}&=-\partial _m S\partial ^m\overline{S}-i\chi \sigma ^m\partial _m{{\bar{\chi }}}+\mu (F+\overline{F})+|F|^2\nonumber \\&=\phi \Box \phi +\varphi \Box \varphi -i\chi \sigma ^m\partial _m{{\bar{\chi }}}+\mu (F+\overline{F})+|F|^2,\nonumber \\ \end{aligned}$$
(42)

where we again parametrized \(S=\phi +i\varphi \).

The Lagrangian is invariant w.r.t. both constant shifts and constant phase transformations of S. We choose the shift-symmetry in \(\varphi \) direction as the relevant U(1) symmetry, so that \(\varphi \) is the axion which is exactly massless in this case (we assume that a microscopic theory spontaneously breaks the U(1) and generates a large mass for \(\phi \), which can always be arranged by e.g. considering higher order corrections to the minimal Kähler potential [6, 8]). Then, we apply the nilpotency constraint given by Eq. (11) and its solution in terms of \(\phi \) given by Eq. (21). Substituting this solution into the Lagrangian (42) and integrating out F and \(\overline{F}\) we get,

$$\begin{aligned} {{{\mathcal {L}}}}&=\,\,\varphi \Box \varphi -i\chi \sigma ^m\partial _m{{\bar{\chi }}}-\mu ^2\nonumber \\&\quad +\frac{\mu \chi ^2+\mu {{\bar{\chi }}}^2 -2\chi \sigma ^m{{\bar{\chi }}}\partial _m\varphi }{8(\mu ^2-\partial _n\varphi \partial ^n\varphi )^2}\left[ \mu \partial _k\chi \partial ^k\chi +\mu \partial _k{{\bar{\chi }}}\partial ^k{{\bar{\chi }}}\right. \nonumber \\&\quad \left. -2\partial _k\varphi \partial _l\chi \sigma ^k\partial ^l{{\bar{\chi }}}\right] +\cdots , \end{aligned}$$
(43)

where the ellipsis stands for the terms trilinear and quadrilinear in \(\chi \) and \({{\bar{\chi }}}\). The solution to the F-field equation of motion is given in Appendix B, and the full Lagrangian after eliminating F can be seen in Eq. (B.12).

The parameter \(\mu \) becomes an expectation value of F (\(\langle F\rangle =-\mu \)) and acts as the order parameter of SUSY breaking. Notice that the interaction term in (43) includes the factor of \((\mu ^2-\partial \varphi \partial \varphi )^{-2}\) which can be expanded to any given order in the derivatives \(\partial \varphi \partial \varphi \). Similarly, trilinear and quadrilinear terms contain negative powers of \((\mu ^2-\partial \varphi \partial \varphi )\), as can be seen from Eq. (B.12).

5 Relation to Komargodski–Seiberg model

Komargodski and Seiberg [6] showed that when a goldstino is accompanied by an axion at low energies, in addition to the goldstino chiral superfield satisfying \({\mathbf{X}}^2=0\), one can introduce a chiral superfield \({\mathbf{T}}\) containing the axion and satisfying the orthogonality constraints,

$$\begin{aligned} {\mathbf{X}}({\mathbf{T}}+\overline{\mathbf{T}})&=0, \end{aligned}$$
(44)
$$\begin{aligned} {\mathbf{X}}\overline{D}_{{\dot{\alpha }}}\overline{\mathbf{T}}&=0, \end{aligned}$$
(45)

where the latter constraint can derived from the former by acting on it with \(\overline{D}_{{\dot{\alpha }}}\). The role of these constraints is to eliminate all the components of \({\mathbf{T}}\) except its imaginary part which is identified with the axion. Let us denote the components of \({\mathbf{X}}\) and \({\mathbf{T}}\) as follows,

$$\begin{aligned} {\mathbf{X}}=X+\sqrt{2}\theta \lambda +\theta ^2F^x,\quad {\mathbf{T}}=T+\sqrt{2}\theta \kappa +\theta ^2F^t,\nonumber \\ \end{aligned}$$
(46)

where \(T=t+i\tau \), with \(\tau \) being the axion.

The constraint \({\mathbf{X}}^2=0\) sets \(X=\lambda ^2/(2F^x)\), where \(\lambda \) becomes the goldstino. Then the \(\theta ^2\)-component of constraint (45) eliminates \({{\bar{\kappa }}}\) by the relation,

$$\begin{aligned} {{\bar{\kappa }}}_{{\dot{\alpha }}}=-i\frac{\lambda ^\alpha }{F^x}\sigma ^m_{\alpha {\dot{\alpha }}}\partial _m(t-i\tau ), \end{aligned}$$
(47)

while the \(\theta ^2{{\bar{\theta }}}\)-component of the same constraint yields,

$$\begin{aligned} \overline{F}^t=\left[ -\frac{\lambda }{F^x}\sigma ^m\overline{\sigma }^n\partial _m\left( \frac{\lambda }{F^x}\right) \partial _n+\frac{\lambda ^2}{2(F^x)^2}\Box \right] (t-i\tau ).\nonumber \\ \end{aligned}$$
(48)

Finally, constraint (44) eliminates the saxion t as,

$$\begin{aligned} t&=-\,\frac{\lambda }{2F^x}\sigma ^m\frac{{{\bar{\lambda }}}}{\overline{F}^x}\partial _m\tau -\frac{i{{\bar{\lambda }}}^2}{8(\overline{F}^x)^2}\frac{\lambda }{F^x}\sigma ^n\overline{\sigma }^m\partial _n\left( \frac{\lambda }{F^x}\right) \partial _m\tau \nonumber \\&\quad + \frac{i\lambda ^2}{8( F^x)^2}\partial _n\left( \frac{{{\bar{\lambda }}}}{\overline{F}^x}\right) \overline{\sigma }^m\sigma ^n\frac{{{\bar{\lambda }}}}{\overline{F}^x}\partial _m\tau \nonumber \\&\quad -\frac{\lambda ^2{{\bar{\lambda }}}^2}{16|F^x|^4}\partial _m\left( \frac{{{\bar{\lambda }}}}{\overline{F}^x}\right) \left( \overline{\sigma }^k\eta ^{mn}+i\epsilon ^{kmnl}\overline{\sigma }_l\right) \partial _n\left( \frac{\lambda }{F^x}\right) \partial _k\tau .\nonumber \\ \end{aligned}$$
(49)

It was shown in [6] that if the two constrained chiral superfields \({\mathbf{X}}\) and \({\mathbf{T}}\) transform under a global U(1) as,

$$\begin{aligned} {\mathbf{X}}\rightarrow {\mathbf{X}}e^{i\vartheta },\quad {\mathbf{T}}\rightarrow {\mathbf{T}}+i\vartheta , \end{aligned}$$
(50)

they can form a U(1)-invariant combination satisfying the following (secondary) cubic constraint,

$$\begin{aligned} {\mathbf{P}}\equiv \alpha {\mathbf{X}}+e^{\mathbf{T}},\quad (|{\mathbf{P}}|^2-1)^3=0, \end{aligned}$$
(51)

where \(\alpha \) is some complex parameter.

Alternatively one can take the logarithm of \({\mathbf{P}}\),

$$\begin{aligned} {\mathbf{Q}}\equiv \log \left( \alpha {\mathbf{X}}+e^{\mathbf{T}}\right) , \end{aligned}$$
(52)

which satisfies the shift-symmetric constraint,Footnote 4

$$\begin{aligned} \left( {\mathbf{Q}}+\overline{\mathbf{Q}}\right) ^3=0, \end{aligned}$$
(53)

where \({\mathbf{Q}}\) transforms by imaginary shifts under the U(1) transformation. Constraint (53) is exactly of the form given by Eq. (11) which we solved in Sect. 3.1 (here we ignore the constant a). Therefore we can identify \({\mathbf{Q}}={\mathbf{S}}\) and express the independent components of \({\mathbf{S}}\) (which are \(\varphi \), \(\chi \), and F) in terms of the independent components of \({\mathbf{X}}\) and \({\mathbf{T}}\) (which are \(\tau \), \(\lambda \), and \(F^x\)),

$$\begin{aligned} \varphi&=\tau -\frac{i}{4}\left( \frac{\alpha \lambda ^2}{F^x}e^{-i\tau }-\frac{{{\bar{\alpha }}}{{\bar{\lambda }}}^2}{\overline{F}^x}e^{i\tau }\right) , \end{aligned}$$
(54)
$$\begin{aligned} \chi _\alpha&=\alpha \lambda _\alpha e^{-t-i\tau }+i\sigma ^m_{\alpha {\dot{\alpha }}}\frac{{{\bar{\lambda }}}^{{\dot{\alpha }}}}{\overline{F}^x}\partial _m(t+i\tau )\left( 1-\frac{\alpha \lambda ^2}{2F^x}e^{-i\tau }\right) , \end{aligned}$$
(55)
$$\begin{aligned} F&=\alpha e^{-t-i\tau }F^x\Bigg \{1+i\frac{\lambda }{F^x}\sigma ^m\frac{{{\bar{\lambda }}}}{\overline{F}^x}\partial _m(t+i\tau )\nonumber \\&\quad -\frac{\lambda ^2{{\bar{\lambda }}}^2}{4|F^x|^4}\partial ^m\tau \partial _m\tau \Bigg \}\nonumber \\&\quad -\left( 1-\frac{\alpha \lambda ^2}{2F^x}e^{-i\tau }\right) \left[ \partial _m\left( \frac{{{\bar{\lambda }}}}{\overline{F}^x}\right) \overline{\sigma }^n\sigma ^m\frac{{{\bar{\lambda }}}}{\overline{F}^x}\partial _n \right. \nonumber \\&\quad \left. -\frac{{{\bar{\lambda }}}^2}{2(\overline{F}^x)^2}\Box \right] (t+i\tau ), \end{aligned}$$
(56)

where t is given by Eq. (49), and we used the fact that \(e^{-t}=1+{{{\mathcal {O}}}}(\lambda \sigma ^m{{\bar{\lambda }}})\). Equations (54)–(56) make up the nonlinear field redefinitions relating our model of Sect. 3.1 to the Komargodski–Seiberg model [6] of goldstino-axion dynamics. Similarly, the superfield \({\mathbf{Z}}\) of the phase-symmetric case (Sect. 3.2) can be identified with the composite superfield \({\mathbf{P}}\) of Eq. (51) to obtain analogous redefinitions in terms of the components of \({\mathbf{Z}}\).

The superfield \({\mathbf{T}}\) constrained by (45) and (44) also satisfies the secondary constraint \(({\mathbf{T}}+\overline{\mathbf{T}})^3=0\). But it should be noted that the composite scalar \(t={\mathrm{Re}}T\) given by (49) does not satisfy our solution (18) to such cubic constraint because it was obtained under the assumption of non-vanishing F-term, while the auxiliary field \(F^t\) vanishes at the vacuum because it starts with the goldstino and spacetime derivatives (see Eq. (48)).

6 Conclusion

In this work we found non-trivial solutions to real cubic constraints on chiral superfields transforming under global U(1) symmetries. For a chiral superfield \({\mathbf{S}}\) transforming by an imaginary shift we considered the invariant constraint \(({\mathbf{S}}+\overline{\mathbf{S}}-2a)^3=0\), which is solved by Eq. (21) – this eliminates the real part of the scalar \(S\equiv {\mathbf{S}}|\) in terms of the goldstino and the axion which is the imaginary part of S. For a chiral superfield \({\mathbf{Z}}\) transforming under the U(1) by a phase we impose the constraint \((|{\mathbf{Z}}|^2-b^2)^3=0\), which is solved by Eq. (36) assuming \(b\ne 0\). In this case the radial mode of \(Z=|Z|e^{i\zeta }\) is eliminated, while the axion is identified with its angular part \(\zeta \). We showed that the cubic constraints are generalizations of the corresponding quadratic constraints \(({\mathbf{S}}-a)^2=0\) and \(({\mathbf{Z}}-b)^2=0\). In particular, each of the cubic constraints admits two solutions – one that eliminates saxion only (which is our main result), and one that eliminates both axion and saxion and satisfies the corresponding quadratic constraint as well. Along the way we proved that the constraints \((|{\mathbf{Z}}|^2-b^2)^3=0\) and \([\log (|{\mathbf{Z}}|^2/b^2)]^3=0\) lead to the same solution.

We constructed a minimal Lagrangian for the shift-symmetric cubic constraint, where the goldstino-axion interactions can be seen. The full solution to the F-term equation of motion as well as the full on-shell Lagrangian can be found in Appendix B.

Finally, we compared our results to the Komargodski–Seiberg [6] model of effective goldstino-axion dynamics, which uses the well-known quadratic constraint for the goldstino chiral superfield, \({\mathbf{X}}^2=0\), together with the axion chiral superfield \({\mathbf{T}}\) satisfying the orthogonality constraint \({\mathbf{X}}({\mathbf{T}}+\overline{\mathbf{T}})=0\), so that only the imaginary scalar component of \({\mathbf{T}}\) survives. We showed that the independent fields in our approach (axion, goldstino, and F, all in the same chiral superfield) can be related to the independent fields of the KS model (goldstino and F-term in \({\mathbf{X}}\), and axion in \({\mathbf{T}}\)) by non-linear field redefinitions. There are two notable differences between the two approaches. First, our approach uses a single independent chiral superfield satisfying a single constraint (not counting the chirality constraint), whereas in the KS approach we have two chiral superfields and two constraints. And second, solving the cubic constraints leads to the terms in the Lagrangian with negative powers of (\(|F|^2-\partial \varphi \partial \varphi \)), which explicitly shows the upper bound on the kinetic energy of the axion (in the KS model this upper bound is implicit) and suppresses higher-derivative terms in the Lagrangian, acting as a cut-off scale and effectively SUSY breaking scale. To be specific, the upper bound on the axion kinetic energy applies to its spatial derivatives and not temporal (we use “mostly plus” metric signature).

It should also be noted that although the two approaches can be related by non-linear field transformations, they do not necessarily represent the same physics since the actions have different structure. For example the aforementioned suppression scale (\(|F|^2-\partial \varphi \partial \varphi \)) is directly affected by the derivatives of \(\varphi \), in contrast to the KS model.

In future works it would be natural to generalize our results to local abelian symmetries, as well as local supersymmetry. On the other hand our cubic constraints can be applied in axion phenomenology, both in cosmology and particle physics, similarly to orthogonal nilpotent superfields but with possibly different implications.Footnote 5