1 Introduction

The fact that Supersymmetry (SUSY) remains still unobservable in the nature does not invalidate its importance in constructing theories beyond the Standard Model (SM). Most notably, the problem of SUSY breaking is currently under intense scrutiny – see e.g. Refs. [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22] – and new settings are suggested within Supergravity (SUGRA) [23] aiming to obtain natural Minkowski and/or de Sitter vacua – the latter possibility is not so favored, though, because of the controversy [24, 25] surrounding this kind of (meta) stable vacua. A crucial question in this area of research concerns [2, 3] the specific forms of superpotential and the type of geometry which may be adopted.

Trying to reply to these questions in a recent paper [1], we employed as guideline an approximate R symmetry [23, 27] which totally fixes the form of the superpotential, \(W_{\mathrm{H}}\), as a function of the goldstino superfield, Z, – contrary to the well-known and widely adopted Polonyi model [26] where the same R symmetry is badly violated by a constant term in \(W_{\mathrm{H}}\). We show that if we associate \(W_{\mathrm{H}}\) with a Kähler potential, \(K_{\mathrm{H}}\), which parameterizes the SU(1, 1)/U(1) Kähler manifold with constant curvature \(-1/2\) we can obtain a Minkowski vacuum (i.e., SUSY breaking with vanishing classical vacuum energy) without unnatural fine tuning, and non-vanishing soft SUSY-breaking (SSB) parameters [34,35,36], of the order of the gravitino mass, \(m_{3/2}\), which is readily determined at the tree level. A solution to the \(\mu \) problem of Minimal Supersymmetric SM (MSSM) may be also achieved by suitably applying the Giudice–Masiero mechanism [37] – for an updated review see Refs. [38, 39]. The possibly problematic, from astrophysical point of view [40,41,42,43,44], (pseudo) Nambu–Goldstone boson named R axion [33] may acquire acceptably large mass by mildly violating the R symmetry in \(K_{\mathrm{H}}\) without any impact either to the minimization of the SUGRA potential or to the values of the SSB parameters.

In the present paper, motivated by several related models [2, 3, 20, 45,46,47], we extend our approach above, considering different R charges for Z and allowing, thereby, for other (not only linear) monomial forms of \(W_{\mathrm{H}}(Z)\) – see Sect. 2. We show that natural Minkowski solutions, similar to those found in Ref. [1], can be accommodated by conveniently selecting the geometry of the internal space. Contrary to Ref. [1], we here consider compact [28,29,30,31] and non-compact [2, 3, 32] Kähler manifolds, whose the curvature is determined as a simple function of the R charge of Z. In both cases, we specify new SUSY-breaking minima and show that the same quartic term introduced in Refs. [1,2,3] provides mass for the R axion without spoiling, though, the minimization of the SUGRA potential. Following the analysis of Ref. [1], we combine our hidden sectors with some sample observable ones and broaden our results as regards the SSB parameters and the derivation of the \(\mu \) term of MSSM – see Sect. 3. Our conclusions are further discussed in Sect. 4. An alternative formulation of our set-up is presented in Appendix A.

Unless otherwise stated, we use units where the reduced Planck mass \(m_{\mathrm{P}}=2.433\cdot 10^{18}~{\text{ GeV }}\) is taken to be unity and charge conjugation is denoted by a bar.

2 Hidden sector

Here we first – in Sect. 2.1 – specify the hidden sector of our model and then – in Sect. 2.2 – investigate the SUSY-breaking mechanism employing the curvature of the Kähler manifold as free parameter. Lastly, in Sect. 2.3, we introduce the R-symmetry violating term and compute the mass of the R axion.

2.1 Model set-up

Following our strategy in Ref. [1], we consider a hidden sector consisting of just one gauge-singlet superfield Z. The corresponding superpotential \(W_{\mathrm{H}}\) is fixed by imposing an R symmetry under which Z has R charge \(2/\nu \) – and not just 2 as in Ref. [1] – whereas that of \(W_{\mathrm{H}}\) is taken to be 2, i.e.,

$$\begin{aligned} W_{\mathrm{H}} = m Z^\nu , \end{aligned}$$
(1)

where m is a positive, free parameter with mass dimensions. Also \(\nu \ne 0\) is an exponent which may, in principle, acquire any real value. This is possible if we consider \(W_{\mathrm{H}}\) as an effective superpotential valid close to the non-zero vacuum of the theory. E.g., we can obtain \(\nu <0\) from dynamical breaking of a SU(N) continuous symmetry [48] assuming in addition [46] that this is not a gauge symmetry to avoid D-term contributions in the SUGRA potential. Moreover, \(\nu >0\) fractional numbers are already considered in Refs. [2, 3, 45] and, as discussed in Appendix A, support a complete form of \(W_{\mathrm{H}}\) invoking an appropriate field redefinition [47].

Contrary to the Polonyi model [26] we do not consider any R-symmetry violating constant term in \(W_{\mathrm{H}}\). Such a violation is accommodated in the Kähler potential which is

$$\begin{aligned} K_{\mathrm{H}}=N\ln \left( 1+\frac{|Z|^2-kZ_{-}^4}{N}\right) \end{aligned}$$
(2)

with \(Z_\pm =Z\pm {\bar{Z}}\). Here N and k are free parameters of either sign. Motivated by several superstring and D-brane models [50, 51] we consider the integer negative values of N as the most natural. However, positive N’s can not be disregarded if we limit our attention on SUGRA [28,29,30,31]. On the other hand, k is a parameter which mildly violates R symmetry. We call this violation “mild” since (a) k is involved in a higher-order (non quadratic) term in \(K_{\mathrm{H}}\) and (b) tiny k values endow R axion with phenomenologically acceptable masses – see Sect. 2.3. The presence of k in \(K_{\mathrm{H}}\) is totally natural, according to the argument [49] of ’t Hooft, since nullifying it the R symmetry becomes exact. In view of this fact, the positivity of the argument of logarithm in Eq. (2) implies

$$\begin{aligned} 1+|Z|^2/N \gtrsim 0~~\Rightarrow ~~-|Z|^2/N \lesssim 1\end{aligned}$$
(3)

which is true for any \(N>0\) but dictates

$$\begin{aligned} |Z|^2\lesssim -N~~\text{ for }~~N<0\,.\end{aligned}$$
(4)

The curved space parameterized by \(K_{\mathrm{H}}\) has metric

$$\begin{aligned} \mathrm{g}:=\partial _Z\partial _{\bar{Z}}K_{\mathrm{H}}=Nw\left( N+|Z|^2-kZ_{-}^4\right) ^{-2}\end{aligned}$$
(5a)

where we find it convenient to define

$$\begin{aligned} w=N(1+12kZ_{-}^2)+kZ_{-}^2(3Z_{+}^2+4kZ_{-}^4).\end{aligned}$$
(5b)

Substituting Eqs. (1) and (2) in the well-known formula

$$\begin{aligned} V:= e^{G}\left( G^{I{\bar{J}}} G_I G_{\bar{J}}-3\right) ~~\text{ with }~~G:= K_{\mathrm{H}}+ \ln |W_{\mathrm{H}}|^2,\nonumber \\ \end{aligned}$$
(6)

\(I,J=Z\), \(G^{Z{\bar{Z}}}=\mathrm{g}^{-1}\), \(G_I:= \partial _I G\) and \(G_{{\bar{J}}}:= \partial _{{\bar{J}}} G\), we find the hidden-sector (F-term) scalar potential

$$\begin{aligned} V_{\mathrm{H}}=\frac{m^2}{N}e^{K_{\mathrm{H}}}|Z|^{2(\nu -1)}\left( \frac{uv}{w}-3N|Z|^2\right) .\end{aligned}$$
(7)

Here w originates from the numerator in Eq. (5a) and we introduce the quantities

$$\begin{aligned} u&=N\nu +(N+\nu )|Z|^2-kZ_{-}^{3}\left( \nu Z_{-}+4NZ\right) ,\end{aligned}$$
(8a)
$$\begin{aligned} v&=N\nu +(N+\nu )|Z|^2-kZ_{-}^{3}\left( \nu Z_{-}-4N{\bar{Z}}\right) , \end{aligned}$$
(8b)

which arise from the numerators of \(G_Z\) and \(G_{{\bar{Z}}}\) in Eq. (6). Written in this form, \(V_{\mathrm{H}}\) may be directly compared to the one employed in Ref. [1].

2.2 The R-symmetric limit

If we set \(k=0\) in Eq. (2) we obtain the exactly R-symmetric form of \(K_{\mathrm{H}}\), \(K_{\mathrm{H0}}\), which parameterizes the coset spaces SU(1, 1)/U(1) for \(N<0\) or SU(2)/U(1) for \(N>0\) [28], with metric and constant scalar curvature respectively

$$\begin{aligned} \mathrm{g}^{(0)}=\partial _Z\partial _{\bar{Z}}K_{\mathrm{H0}} =\left( 1+\frac{|Z|^2}{N}\right) ^{-2}\text{ and }~~{{\mathcal {R}}}_\mathrm{H}^{(0)}=\frac{2}{N}\,.\end{aligned}$$
(9)

The last quantity reveals that the Kähler manifold is compact or non-compact if \(N>0\) or \(N<0\). Here and hereafter, the superscript (0) and the subscript 0 denote quantities corresponding to the totally R-symmetric case. Working in this simplified framework, it is more convenient to investigate the existence of Minkowski solutions, as done in Sect. 2.2.1, classify the relevant solutions – see Sect. 2.2.2 – and derive the relevant particle spectrum in Sect. 2.2.3.

Let us clarify here that the SU(1, 1)/U(1) manifold is also parameterized by the half-plane coordinates (usually notated by T and \({\bar{T}}\)) which are related to the disc coordinates (Z and \({\bar{Z}}\)) through a Cayley transform [1, 28, 32]. This parameterization, though, violates the R symmetry and so it is inappropriate for our purposes. Note that the parameterization of the SU(2)/U(1) space does not change under that transform.

2.2.1 Minkowski vacua

Substituting \(k=0\) into Eq. (7) and taking into account that

$$\begin{aligned} G^{(0)}_Z={\bar{G}}^{(0)}_{\bar{Z}}=\frac{1}{Z}\left( \sqrt{\mathrm{g}^{(0)}}|Z|^2+\nu \right) \,. \end{aligned}$$
(10)

we derive the corresponding SUGRA potential

$$\begin{aligned} V_{\mathrm{H0}}=\frac{m^2}{N^2}\frac{e^{K_{\mathrm{H0}}}}{|Z|^{2(1-\nu )}} \left( \left( \nu N + \left( N+\nu \right) |Z|^2\right) ^2-3N^2 |Z|^2\right) ,\nonumber \\ \end{aligned}$$
(11)

which, as expected, depends exclusively on \(|Z|^2\). The systematic search of a Minkowski vacuum – defined by the simultaneous fulfilment of the conditions

$$\begin{aligned} \mathsf{\small (a)}~~\left\langle {V_{\mathrm{H0}}} \right\rangle =0,~~\mathsf{\small (b)}~~\left\langle {V_{\mathrm{H0}}'} \right\rangle =0~~~\text{ and }~~~\mathsf{\small (c)}~~\left\langle {V_{\mathrm{H0}}''} \right\rangle >0,\nonumber \\ \end{aligned}$$
(12)

where the derivatives with respect to (w.r.t) \(|Z|^2\) are denoted by a prime – is facilitated if the expression in the parenthesis of Eq. (11) is expanded as follows

$$\begin{aligned} \left( 2\nu (N+\nu )-3N\right) N|Z|^2+\nu ^2N^2+|Z|^4(N+\nu )^2. \end{aligned}$$
(13)

From our experience in Ref. [1], we suspect that the attainment of a Minkowski vacuum is, in principle, possible if the expression in Eq. (13) equals to a perfect square trinomial including a minus in the double product. Since the last two terms consist a sum of squares, the desired perfect square trinomial is achieved, if the two first terms in Eq. (13) coincide with the minus double product of the square root of anyone of the two last terms. More explicitly, we impose the condition

$$\begin{aligned} -2|N||\nu ||N+\nu |=\left( 2\nu (N+\nu )-3N\right) N\end{aligned}$$
(14)

which, fortunately, is independent of \(|Z|^2\). We seek below the \(N=N_{\mathrm{c}}\) values as a function of \(\nu \) that satisfy the condition above. It is obvious that we need a combination of \(\nu \), \(N_{\mathrm{c}}\) and \(N_{\mathrm{c}}+\nu \) such that \(-2|N_{\mathrm{c}}||\nu ||N_{\mathrm{c}}+\nu |=-2N_{\mathrm{c}}\nu (N_{\mathrm{c}}+\nu )\) since, otherwise, this term is cancelled out in Eq. (14) and the only possible solution is \(N_{\mathrm{c}}=0\). As a consequence, we single out the following cases which yield \(N_{\mathrm{c}}\ne 0\):

(i):

\(\nu >0\) and \(N_{\mathrm{c}}>0\) resulting to \(\nu +N_{\mathrm{c}}>0\);

(ii):

\(\nu <0\), \(N_{\mathrm{c}}>0\) and \(\nu +N_{\mathrm{c}}<0\);

(iii):

\(\nu >0\), \(N_{\mathrm{c}}<0\) and \(\nu +N_{\mathrm{c}}<0\).

If we solve Eq. (14) w.r.t \(N_{\mathrm{c}}\), in all cases above, we can derive \(N_{\mathrm{c}}\) using \(\nu \) as free variable. Namely,

$$\begin{aligned} N_{\mathrm{c}}=\frac{4\nu ^2}{3-4\nu }~~\Rightarrow ~~N_{\mathrm{c}}\gtrless 0~~\text{ for }~~\nu \lessgtr 3/4\,.\end{aligned}$$
(15)

Turning the argument around, we can obtain two possible solutions \(\nu _{\pm }\) for every (preferably integer) \(N_{\mathrm{c}}\) value if we solve Eq. (15) w.r.t \(\nu \), i.e.,

$$\begin{aligned} \nu _{\pm }=\frac{1}{2}\left( -N_{\mathrm{c}}\pm \sqrt{N_{\mathrm{c}}(N_{\mathrm{c}}+3)}\right) \end{aligned}$$
(16)

and correspond to real numbers if

$$\begin{aligned} N_{\mathrm{c}}(N_{\mathrm{c}}+3)\ge 0~~\Rightarrow ~~ N_{\mathrm{c}}\ge 0&\text{ or }~~~N_{\mathrm{c}}\le -3.\end{aligned}$$
(17)

For \(N=N_{\mathrm{c}}\), \(V_{\mathrm{H0}}\) in Eq. (11) includes by construction the square of a binomial with a minus inside. Indeed, it reads

$$\begin{aligned} V_{\mathrm{H0c}}=\left( \frac{m}{4\nu }\right) ^2\left( 1+\frac{|Z|^2}{N_{\mathrm{c}}}\right) ^{N_{\mathrm{c}}}|Z|^{2(\nu -1)} \left( 3 |Z|^2 -4\nu ^2\right) ^2.\end{aligned}$$
(18)

Here and hereafter the subscript “c” denotes quantities computed for \(N=N_{\mathrm{c}}\). Enforcing Eq. (12b) we find a honest extremum of \(V_{\mathrm{H0c}}\) as follows

$$\begin{aligned} \left\langle {V_{\mathrm{H0c}}'} \right\rangle =0~\Rightarrow ~\left\langle {|Z|^2} \right\rangle =4|\nu |^2/3. \end{aligned}$$
(19)

This result is, initially, restricted to \(\nu \le 3/2\) for \(N_{\mathrm{c}}<0\) by virtue of Eqs. (4) and (17). It can be further constrained from the observation that Eq. (12a) is fulfilled for

$$\begin{aligned} N_{\mathrm{c}}\ne -3~\Leftrightarrow ~\nu \ne 3/2. \end{aligned}$$
(20)

Indeed, taking \(N_{\mathrm{c}}=-3\) and \(\nu =3/2\), \(V_{\mathrm{H0c}}\) in Eq. (18) is simplified as

$$\begin{aligned} V_{\mathrm{H0c}}=27m^2\sqrt{|Z|^2}/(12-4|Z|^2),\end{aligned}$$
(21)

which exhibits just a SUSY vacuum. Finally, in order to check the validity of Eq. (12c), we compute

$$\begin{aligned} \left\langle {V_{\mathrm{H0c}}''} \right\rangle =2^{2\nu -5}3^{3-\nu }|\nu |^{2(\nu -1)}m^2\langle {\mathrm{g}}\rangle ^{-N_{\mathrm{c}}/2},\end{aligned}$$
(22)

where we introduce the quantity

$$\begin{aligned} \langle {\mathrm{g}}\rangle =\langle {\mathrm{g}^{(0)}} \rangle =\frac{1}{4}\left( 1-\frac{2}{3}\nu \right) ^{-2}=\frac{9N_{\mathrm{c}}}{16\nu ^2(N_{\mathrm{c}}+3)}, \end{aligned}$$
(23)

which controls the stability of the vacuum in Eq. (19) – as we show below, even for \(k\ne 0\) in Eq. (2) the Minkowski vacuum lies along the direction \(Z=Z_{-}\) and so we do not apply the symbolic distinction mentioned below Eq. (9) regarding \(\langle {\mathrm{g}}\rangle \) and \(\langle {\mathrm{g}^{(0)}} \rangle \). Namely, \(\nu <3/2\) assures the validity of Eq. (12c) for any \(N_{\mathrm{c}}\). Taking into account also, the constraint below Eq. (19) for \(N_{\mathrm{c}}<0\) and Eqs. (15) and (17) we end up with the following allowed domains

$$\begin{aligned} \frac{3}{4}<\nu<\frac{3}{2}~~\text{ for }~~N_{\mathrm{c}}<0~~\text{ and }~~\nu <\frac{3}{4}~~\text{ for }~~N_{\mathrm{c}}>0.\end{aligned}$$
(24)

Note that the \(\nu \) range is rather limited for \(N_{\mathrm{c}}<0\) and ampler for \(N_{\mathrm{c}}>0\).

Table 1 Solutions to Eq. (15) for various \(\nu \)’s
Fig. 1
figure 1

Solutions to Eq. (16) for integer \(N_{\mathrm{c}}\) values and \(|N_{\mathrm{c}}|\le 10\). We take \(N_{\mathrm{c}}<0\) (gray and black triangles), or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black circles) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray circles)

2.2.2 Classification of the solutions

To be more specific, we list in Table 1 some pairs \((\nu ,N_{\mathrm{c}})\) compatible with Eqs. (15) and (24). All of these correspond to \(\nu =\nu _{-}\) in Eq. (16) and yield integer \(N_{\mathrm{c}}\) for \(\nu =1\) and \(-3/2\). Confining ourselves on exclusively integer \(N_{\mathrm{c}}\) – favored by the string theories [50, 51] –, we may vary \(N_{\mathrm{c}}\) in the range \([-10, 10]\) and obtain \(\nu \) via Eq. (16). We depict the results in Fig. 1, where we present three families of points of different shapes corresponding to different types of \(V_{\mathrm{H0c}}\) as we explain below. Besides the \(\nu \) values corresponding to \(N_{\mathrm{c}}=-4\) and 1, the others are irrational numbers which, are used mainly for presentational purposes – see, e.g., Refs. [2, 3]. Also, for \(N_{\mathrm{c}}>0\) both outputs, \(\nu _{+}\) and \(\nu _{-}\), are possible and, in particular, the \(\nu _{-}\) values increase almost linearly with \(N_{\mathrm{c}}\) whereas the \(\nu _{+}\) values remain almost unchanged at the level of the values obtained for \(N_{\mathrm{c}}<0\).

The structure of \(V_{\mathrm{H0c}}\) in Eq. (18) for the various pairs \((\nu ,N_{\mathrm{c}})\) satisfying Eq. (15) can be inferred by Figs. 2 and 3, where we present \(V_{\mathrm{H0c}}/m^2m_{\mathrm{P}}^2\) as a function of \(|Z|^2\) for \(N_{\mathrm{c}}<0\) – in Fig. 2 – and for \(N_{\mathrm{c}}>0\) – in Fig. 3. Namely, we draw solid, dashed and dot-dashed lines for \((\nu _{-},N_{\mathrm{c}})=(1, -4)\), \((4/5, -64/5)\) and \((5/4, -25/8)\) respectively in Fig. 2. On the other hand, the dashed, solid and dot-dashed lines in Fig. 3 are obtained for \((\nu _{+},N_{\mathrm{c}})=(1/2, 1)\), \((\nu _{-},N_{\mathrm{c}})=(-3/2, 1)\) and \((\nu _{-}, N_{\mathrm{c}})=(-2, 16/11)\) respectively. The line \(V_{\mathrm{H0c}}=0\) and the values \(\left\langle {|Z|^2} \right\rangle \) at the non-SUSY minimum – see Eq. (19) – are also indicated in both figures. More explicitly, the specific values of \(\left\langle {|Z|^2} \right\rangle \) for each \(\nu \) value employed in the figures above are arranged in Table 2. We see that \(\left\langle {|Z|^2} \right\rangle \) decreases with \(|\nu |\) and may become even subplanckian, in sharp contrast to the well-known Polonyi model [26] and the case of Ref. [1].

Fig. 2
figure 2

The (dimensionless) hidden-sector potential \(V_{\mathrm{H0c}}/m^2m_{\mathrm{P}}^2\) in Eq. (11) as a function of \(|Z|^2\) for \((\nu _{-},N_{\mathrm{c}})=(4/5, -64/5)\), \((1, -4)\) and \((5/4, -25/8)\) (dashed, solid and dot-dashed line respectively). The line \(V_{\mathrm{H0c}}=0\) and the values \(\left\langle {|Z|^2} \right\rangle \) are also indicated

Fig. 3
figure 3

The same as in Fig. 2 but for \((\nu ,N_{\mathrm{c}})=(1/2,1)\), \((-3/2, 1)\), and \((-2, 16/11)\) (dashed, solid and dot-dashed line respectively)

Table 2 Minkowski vacua for the \(\nu \) values used in Figs. 2 and 3

Let’s focus first on Fig. 2. We remark that for \(\nu >1\) in the domain of Eq. (24), \(V_{\mathrm{H0c}}\) features a SUSY minimum at \(\left\langle {|Z|} \right\rangle =0\) apart from the non-SUSY Minkowski one – see dot-dashed line in Fig. 2. This is due to the presence of the factor \(|Z|^{2(\nu -1)}\) in Eq. (11) which yields another solution to the right equation in Eq. (19) for \(\nu >1\). Although less attractive than the other cases, the coexistence of SUSY and non-SUSY Minkowski vacua may be beneficial for applying the multi-point principle analyzed in Ref. [20]. We obtain the same shape of \(V_{\mathrm{H0c}}\) for \(1<\nu <3/2\). Since no integer \(N_{\mathrm{c}}\) value may be computed via Eq. (15) for \(\nu \) values into this domain, no point in Fig. 1 corresponds to this kind of \(V_{\mathrm{H0c}}\). For the value \((\nu , N_{\mathrm{c}})=(1,-4)\) – black triangle in Fig. 1 – we obtain the well-known form of \(V_{\mathrm{H0c}}\) presented in Ref. [1], whereas for the remaining (gray) triangles in Fig. 1 we obtain the type of \(V_{\mathrm{H0c}}\) drawn by dashed line in Fig. 2. It is remarkable that \(V_{\mathrm{H0c}}\) develops just one critical point in this case. The situation is rather different for \(N_{\mathrm{c}}>0\) as shown in Fig. 3. For \(\nu >0\) – e.g., see the \((\nu , N_{\mathrm{c}})\) values represented by black circles in Fig. 1 –, the type of \(V_{\mathrm{H0c}}\) corresponds to the dashed line in Fig. 3. On the contrary, the \(\nu <0\) values give rise to \(V_{\mathrm{H0c}}\) whose structure is designed by the solid and the dot-dashed lines. As \(N_{\mathrm{c}}\) increases, \(|N_{\mathrm{c}}|\) increases too – see e.g., the gray circles in Fig. 1 – and \(\left\langle {|Z|^2} \right\rangle \) is moved towards larger values.

2.2.3 Particle spectrum

If we analyze Z according to the description

$$\begin{aligned} Z=(z+i{\theta })/\sqrt{2}\end{aligned}$$
(25)

and expand \(V_{\mathrm{H0c}}\) in Eq. (18) about the configuration

$$\begin{aligned} \left\langle {z} \right\rangle =2\sqrt{\frac{2}{3}}|\nu |~~~\text{ and }~~\left\langle {{\theta }} \right\rangle =0,\end{aligned}$$
(26)

– cf. Eq. (19) –, we can work out the hidden-sector spectrum of the model. This is composed of:

(i) :

A (canonically normalized) real scalar field – the sgolstino or R saxion –, \({\widehat{z}}=\sqrt{\left\langle \mathrm{g} \right\rangle }z\), with mass squared

$$\begin{aligned} m^2_{{\widehat{z}}} =\left\langle {\partial ^2_{{\widehat{z}}}V_{\mathrm{H0c}}} \right\rangle =\left\langle {{2V_{\mathrm{H0c}}''|Z|^2}/{\mathrm{g}}} \right\rangle =\frac{9}{4}\frac{m_{3/2}^2}{|\nu |^{2}\langle {\mathrm{g}}\rangle }; \end{aligned}$$
(27a)

where we take into account Eqs. (22) and (23),

(ii) :

The gravitino, \(\widetilde{G}\), – which absorbs the fermionic partner of the R saxion, the R axino – with mass squared

$$\begin{aligned} m_{3/2}^2=m^2\left\langle {e^{K_{\mathrm{Hc}}}|Z|^{2\nu }} \right\rangle =4^{\nu } |\nu |^{2\nu }m^2/3^\nu \langle {\mathrm{g}}\rangle ^{N_{\mathrm{c}}/2}.\end{aligned}$$
(27b)
(iii) :

A massless Nambu–Goldstone boson, \({\theta }\), – referred [33] to as an R axion.

Besides the last one, the masses above keep their values even for \(k\ne 0\) and obey the super-trace formula [23] which, in the case of an hidden sector with one superfield, reads [1]

$$\begin{aligned} \mathsf{STr}M_0^2=m_{\widehat{z}}^2-4m_{3/2}^2=6m_{3/2}^2\left\langle {{{\mathcal {R}}}^{(0)}_\mathrm{Hc}} \right\rangle =\frac{12}{N_{\mathrm{c}}}m_{3/2}^2,\end{aligned}$$
(28)

where we make use of Eqs. (9), (15), (27b) and (27a).

2.3 Including the R-symmetry-violating term

The (canonically normalized) R axion, \({\widehat{\theta }}\), couples to the massless SM gauge bosons (gluons and photons) [53,54,55,56] due to \(SU(3)_{\mathrm{c}}\) and \(U(1)_{\mathrm{EM}}\) anomalies respectively. Non-perturbative \(SU(3)_{\mathrm{c}}\) instanton effects [52] result in a mass for \({\widehat{\theta }}\) which is expected to be at the KeV level and so it is severely constrained by astrophysical and laboratory probes. Here we focus on the most robust, lower bound derived from the fact that \({\widehat{\theta }}\) can be produced in the supernovae core via the bremsstrahlung process \(nn\rightarrow nn{\widehat{\theta }}\) with n being nucleons. The measurements of the neutrino burst from supernova 1987a set important constraints on sub-GeV axion-like particles. In particular, the energy loss rate due those particle should not exceed the one through neutrinos. Considering that the coupling between \({\widehat{\theta }}\) and gluons or photons is well suppressed, the restriction above is translated to the following bound on the mass of \({\widehat{\theta }}\) [40,41,42,43,44]

$$\begin{aligned} m_{\widehat{\theta }}\ge 10~\mathrm{MeV}, \end{aligned}$$
(29)

where we restore the units for convenience. On the other hand, we do not impose here an upper limit on \(m_{\widehat{\theta }}\) from a possible candidacy of \({\widehat{\theta }}\) for cold dark matter since this depends crucially on the cosmological evolution of \({\widehat{\theta }}\) which is not specified in our set-up [53, 54].

The phenomenological problems arising from the masslessness (at perturbative level) of \({\widehat{\theta }}\) for \(k=0\) in Eq. (2) may be evaded invoking several modifications of the setting in Sect. 2.2. E.g., if we consider Z as a nilpotent superfield [58] no sgoldstino appears in the particle spectrum and so no R axion too. Another way out of this problem is the gauging of the R symmetry [21, 45, 57], provided that it is anomaly-free. In this case, though, D-term contributions are introduced which destabilize the construction of Sect. 2.2.2. Following Ref. [1], we here adopt the simplest solution: we explicitly break the R symmetry via a subdominant quartic term in \(K_{\mathrm{H}}\) proportional to k – see Eq. (2). As we show below, even for \(k\ne 0\), the field configuration in Eq. (26) still corresponds to a Minkowski vacuum whereas a large enough mass for the R axion is generated. This is actually the reason for which we prefer to activate the R-violation in \(K_{\mathrm{H}}\) and not in \(W_{\mathrm{H}}\) as in the Polonyi model [26]. It is clear that no purely theoretical motivation exists for this term in \(K_{\mathrm{H}}\). Its presence, though, is “technically natural” thanks to the smallness of k – see below. Although not generalized here, the exponent, 4, of this term is the unique which offers the “facilities” above [1]. The same term is widely utilized for the stabilization of the imaginary direction of the sgoldstino within the no-scale models [2, 3].

Below we verify that \((\left\langle {z} \right\rangle , \left\langle {{\theta }} \right\rangle )\) defined in Eq. (26) represents a stable Minkowski vacuum of \(V_{\mathrm{H}}\) in Eq. (7) for \(k\ne 0\) and \(N=N_{\mathrm{c}}\). To embark on it, we express \(V_{\mathrm{H}}\) as a function of z and \({\theta }\) using the parametrization in Eq. (25). First of all, we show that \(V_{\mathrm{Hc}}\) at the point given by Eq. (26) vanishes, i.e.,

$$\begin{aligned} \left\langle {V_{\mathrm{Hc}}} \right\rangle =0~~\Rightarrow ~~\left\langle {u_{\mathrm{c}}v_{\mathrm{c}}/w_{\mathrm{c}}} \right\rangle =3N_{\mathrm{c}}\left\langle {|Z|^2} \right\rangle .\end{aligned}$$
(30)

Indeed, Eqs. (8a), (8b) and (5b) at the vacuum yield

$$\begin{aligned} \left\langle {u_{\mathrm{c}}} \right\rangle =\left\langle {v_{\mathrm{c}}} \right\rangle =\nu N_{\mathrm{c}}+(N_{\mathrm{c}}+\nu )\left\langle {|Z|^2} \right\rangle ~~\text{ and }~~\left\langle {w_{\mathrm{c}}} \right\rangle =N_{\mathrm{c}},\end{aligned}$$
(31)

and so Eq. (30) can be readily deduced. Then we minimize \(V_{\mathrm{Hc}}\) w.r.t z and \(\theta \). Given that \(Z_{-}=\sqrt{2}i{\theta }\), \(V_{\mathrm{Hc}}\) for \({\theta }=0\) coincides with the one in Eq. (18) , i.e.,

$$\begin{aligned} V_{\mathrm{Hc}}(z,{\theta }=0)=V_{\mathrm{H0c}}\end{aligned}$$
(32)

and therefore, \(\left\langle {z} \right\rangle \) (for \(k\ne 0\)) keeps its value in Eq. (26) (found for \(k=0\)). As regards the \({\theta }\) direction, we check the validity of the extremum condition computing the first derivative of \(V_{\mathrm{Hc}}\) w.r.t \({\theta }\) for \(\left\langle {{\theta }} \right\rangle =0\) with result

$$\begin{aligned} \nonumber \left\langle {\partial _{{\theta }}V_{\mathrm{Hc}}} \right\rangle&= {3m^2}\left\langle {e^{K_{\mathrm{Hc}}} |Z|^{2(\nu -1)}} \right\rangle \cdot \\ {}&\quad \times \left\langle {\left( \frac{\partial _{{\theta }}u_{\mathrm{c}}}{u_{\mathrm{c}}}+\frac{\partial _{{\theta }}v_{\mathrm{c}}}{v_{\mathrm{c}}}-\frac{\partial _{{\theta }}w_{\mathrm{c}}}{w_{\mathrm{c}}}\right) |Z|^2-{\theta }} \right\rangle =0. \end{aligned}$$
(33)

Here we take advantage of Eq. (30). We also observe that

$$\begin{aligned} \left\langle {\partial _{{\theta }}\left( e^{K_{\mathrm{H}}} |Z|^{2(\nu -1)}\right) } \right\rangle =0\end{aligned}$$
(34a)

since this is proportional at least to \({\theta }^2\) and, similarly,

$$\begin{aligned} \left\langle {\partial _{{\theta }}w_{\mathrm{c}}} \right\rangle =\left\langle {\partial _{{\theta }}u_{\mathrm{c}}} \right\rangle =\left\langle {\partial _{{\theta }}v_{\mathrm{c}}} \right\rangle =0, \end{aligned}$$
(34b)

since these are proportional to \({\theta }\) – see Eqs. (5b), (8a) and (8b).

From the results above, it is easy to compute the mixed second derivatives

$$\begin{aligned} \left\langle {\partial _{z}\partial _{{\theta }}V_{\mathrm{Hc}}} \right\rangle = \left\langle {\partial _{{\theta }}\partial _{z}V_{\mathrm{Hc}}} \right\rangle =0. \end{aligned}$$
(35)

which are also zeroed. Therefore, the matrix of the second derivatives of \(V_{\mathrm{H}}\) w.r.t z and \({\theta }\), \(\left\langle {\partial _{i}\partial _{j}V_{\mathrm{Hc}}} \right\rangle \) with \(i=z\) and \({\theta }\) has a diagonal form. For \(i=j=z\) we obtain the mass squared of \({\widehat{z}}\), given by Eq. (27a), since the z-dependent form of \(V_{\mathrm{H}}\) in Eq. (7) coincides with that of \(V_{\mathrm{H0c}}\) in Eq. (18). For \(i=j={\theta }\), we differentiate once more \(w_{\mathrm{c}}, u_{\mathrm{c}}\) and \(v_{\mathrm{c}}\) w.r.t \({\theta }\) with results

$$\begin{aligned} \left\langle {\partial ^2_{{\theta }}w_{\mathrm{c}}} \right\rangle&=-48k\left( \left\langle {|Z|^2} \right\rangle +N_{\mathrm{c}}\right) , \end{aligned}$$
(36a)
$$\begin{aligned} \left\langle {\partial ^2_{{\theta }}u_{\mathrm{c}}} \right\rangle&=\left\langle {\partial ^2_{{\theta }}v_{\mathrm{c}}} \right\rangle =(\nu +N_{\mathrm{c}}). \end{aligned}$$
(36b)

Armed with the results above we arrive at the following one,

$$\begin{aligned} \left\langle {\partial _{{\theta }}^2V_{\mathrm{Hc}}} \right\rangle&= {3m^2}\left\langle {e^{K_{\mathrm{Hc}}} |Z|^{2(\nu -1)}} \right\rangle \cdot \nonumber \\ {}&\quad \times \left( \left\langle {|Z|^2} \right\rangle \left\langle {\frac{\partial ^2_{{\theta }}u_{\mathrm{c}}}{u_{\mathrm{c}}}+\frac{\partial ^2_{{\theta }}v_{\mathrm{c}}}{v_{\mathrm{c}}}-\frac{\partial ^2_{{\theta }}w_{\mathrm{c}}}{w_{\mathrm{c}}}} \right\rangle -1\right) \end{aligned}$$
(37)

Making use of the identity

$$\begin{aligned} \left\langle {|Z|^2} \right\rangle \left( \frac{\partial ^2_{{\theta }}u_{\mathrm{c}}}{u_{\mathrm{c}}}+\frac{\partial ^2_{{\theta }}v_{\mathrm{c}}}{v_{\mathrm{c}}}\right) =1\end{aligned}$$
(38)

and canonically normalizing the relevant mode, we may achieve our final result

$$\begin{aligned} m_{\widehat{\theta }}^2=\left\langle {\partial ^2_{{\widehat{{\theta }}}}V_{\mathrm{Hc}}} \right\rangle =\left\langle {\partial ^2_{{\theta }}V_{\mathrm{Hc}}/\mathrm{g}} \right\rangle = 144km_{3/2}^2/\langle {\mathrm{g}}\rangle ^{3/2}, \end{aligned}$$
(39)

which coincides with the one obtained in Ref. [1] for \(\nu =1\). We observe that \(m_{\widehat{\theta }}^2>0\) for \(k>0\). Consequently, setting \(k>0\) in Eq. (2) does not modify \(V_{\mathrm{H}}\) from \(V_{\mathrm{H0c}}\) in the real direction but just allows for a non-vanishing R-axion mass. To fulfill Eq. (29) for, e.g., \(m=1~{\text{ TeV }}\), it is enough to take \(k\gtrsim 4\cdot 10^{-13}\) – this number corrects a typo in Ref. [1] – for \(\nu =\nu _{+}>0\) and \(k\gtrsim 2\cdot 10^{-14} - 3\cdot 10^{-5}\) for \(\nu =\nu _{-}<0\), where the minimal k increases as \(N_{\mathrm{c}}\) increases from 1 to 10 – see Fig. 2. Therefore, our argument related to the naturalness of the k term in Eq. (2) is compatible with the R-axion phenomenology.

Summarizing, the particle spectrum of the present version of our model comprises \(\widetilde{G}\), \({\widehat{z}}\) and \({\widehat{{\theta }}}\) whose the masses are given by Eqs. (27b), (27a) and (39), respectively. We can verify that these masses obey the supertrace relation – cf. Eq. (28)

$$\begin{aligned} \nonumber \mathsf{STr}M^2= & {} m_{\widehat{z}}^2+m_{\widehat{\theta }}^2-4m_{3/2}^2=6m_{3/2}\left\langle {{{\mathcal {R}}}_\mathrm{Hc}} \right\rangle \\ {}= & {} \frac{9}{\nu }\left( \frac{1}{\nu }-\frac{4}{3}\left( 1-12k\nu /\langle {\mathrm{g}}\rangle ^{3/2}\right) \right) m_{3/2}^2, \end{aligned}$$
(40)

where \(\left\langle {{{\mathcal {R}}}_{\mathrm{Hc}}} \right\rangle \) is geometrically [1] estimated as

$$\begin{aligned} \left\langle {{{\mathcal {R}}}_{\mathrm{Hc}}} \right\rangle =\left\langle {{\mathcal R}^{(0)}_{\mathrm{Hc}}} \right\rangle -24k/\langle {\mathrm{g}}\rangle ^{3/2},\end{aligned}$$
(41)

with \(\left\langle {{{\mathcal {R}}}^{(0)}_{\mathrm{Hc}}} \right\rangle \) given in Eq. (9). Contrary to the case with \(k=0\), here \({{\mathcal {R}}}_{\mathrm{Hc}}\) is not generically constant.

Checking the hierarchy of the various masses, we infer from Eqs. (27b) and (27a) that

$$\begin{aligned} m_{3/2}\le m_{\widehat{z}}~~\text{ for }~~\nu \left\langle \mathrm{g} \right\rangle ^{1/2}\le 3/2 \end{aligned}$$
(42a)

and from Eqs. (27b) and (39) that

$$\begin{aligned} m_{\widehat{\theta }}\le m_{3/2}~~\text{ for }~~k\le \left\langle \mathrm{g} \right\rangle ^{3/2}/144.\end{aligned}$$
(42b)

More specifically, we obtain \(m_{\widehat{z}}\le 2m_{3/2}\) for \(\nu >0\) and \(N_{\mathrm{c}}<0\) and so the decay of \({\widehat{z}}\) into a pair of \(\widetilde{G}\) [53, 54] is kinematically blocked whereas for \(N_{\mathrm{c}}>0\) this decay channel is open. Therefore, the decay of \({\widehat{z}}\) and \({\widehat{{\theta }}}\) into \(\widetilde{G}\) depends on the parameters (\(\nu \), \(N_{\mathrm{c}}\) and k) and may have an impact on the relic abundance of \(\widetilde{G}\) before nucleosynthesis.

Fig. 4
figure 4

The ratios \(m_{3/2}/m\), \(m_{\widehat{z}}/m\) and \(m_{\widehat{\theta }}/m\) for \(k=0.05\) (circles, squares and stars respectively) versus \(N_{\mathrm{c}}\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols)

These conclusions are illustrated in Fig. 4, where we display the ratios \(m_{3/2}/m\) (circles), \(m_{\widehat{z}}/m\) (squares) and \(m_{\widehat{\theta }}/m\) (stars) for \(k=0.05\) versus \(N_{\mathrm{c}}\), for integer \(N_{\mathrm{c}}\) values and \(|N_{\mathrm{c}}|\le 10\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols). For the selected k value \({\widehat{{\theta }}}\) is heavier than both \({\widehat{z}}\) and \(\widetilde{G}\) for any \((\nu ,N_{\mathrm{c}})\) whereas the hierarchy of \({\widehat{z}}\) and \(\widetilde{G}\) is k-independent and mildly dependents on \((\nu ,N_{\mathrm{c}})\) according to Eq. (42a).

Just for completeness, we mention that the R axion may acquire mass squared

$$\begin{aligned} m_{\widehat{\theta }}^2= 144km_{3/2}^2/\langle {\mathrm{g}}\rangle ^{2} \end{aligned}$$
(43)

similar to the one in Eq. (39), if we adopt one of the following Kähler potentials

$$\begin{aligned} K_{\mathrm{H}}&=N_{\mathrm{c}}\ln \left( 1+|Z|^2/N_{\mathrm{c}}\right) +N_k\ln \left( 1-kZ_{-}^4/N_k\right) , \end{aligned}$$
(44a)
$$\begin{aligned} K_{\mathrm{H}}&=N_{\mathrm{c}}\ln \left( 1+|Z|^2/N_{\mathrm{c}}\right) -kZ_{-}^4, \end{aligned}$$
(44b)

where \(N_k\) is an undetermined constant.

3 Observable sector

Our next task is to study the transmission of the SUSY breaking to the visible world. To implement this, we introduce the chiral superfields of the observable sector \(\Phi _{\alpha }\) and assume the following structure – cf. Refs. [1, 26] – for the total superpotential, W, and Kähler potential, K, of the theory

$$\begin{aligned} W= & {} W_{\mathrm{H}}(Z) + W_\mathrm{O}\left( \Phi _{\alpha }\right) , \end{aligned}$$
(45a)
$$\begin{aligned} K= & {} K_{\mathrm{Hc}}(Z)+{\widetilde{K}}(Z)|\Phi _{\alpha }|^2,\end{aligned}$$
(45b)

where \(W_{\mathrm{H}}\) and \(K_{\mathrm{Hc}}\) are given by Eqs. (1) and (2) with \(N=N_{\mathrm{c}}\) whereas \(W_{\mathrm{O}}\) and \({\widetilde{K}}\) are specified in Sect. 3.1 for a generic SUSY model and, in Sect. 3.2, for the MSSM. In the latter case, a solution to the \(\mu \) problem is also proposed.

3.1 Generic model

Here we adopt the following quite generic form of \(W_{\mathrm{O}}\)

$$\begin{aligned} W_{\mathrm{O}}=h \Phi _1\Phi _2\Phi _3+\mu \Phi _4\Phi _5, \end{aligned}$$
(46)

where we may easily select the appropriate R charge for each of \(\Phi _{\alpha }\) as in Ref. [1]. We try also similar \({\widetilde{K}}\), ensuring universal SSB parameters for \(\Phi _{\alpha }\), i.e.,

$$\begin{aligned} K_1&=K_{\mathrm{Hc}}+\hbox { }\ \sum _{\alpha }|\Phi _{\alpha }|^2,~~~ \end{aligned}$$
(47a)
$$\begin{aligned} K_2&=N_{\mathrm{c}}\ln \left( 1+\left( |Z|^2-kZ_{-}^4+\hbox { }\ \sum _{\alpha }|\Phi _{\alpha }|^2\right) /N_{\mathrm{c}}\right) , \end{aligned}$$
(47b)
$$\begin{aligned} K_3&=K_{\mathrm{Hc}}+N_{\mathrm{O}}\ln \left( 1+\hbox { }\ \sum _{\alpha }|\Phi _{\alpha }|^2/N_\mathrm{O}\right) , \end{aligned}$$
(47c)

where the specific value of \(N_{\mathrm{O}}\) is irrelevant for our purposes. If we expand the K’s above for low \(\Phi _{\alpha }\) values, these may assume the form of Eq. (45b) with \({\widetilde{K}}\) being identified as

$$\begin{aligned} {\widetilde{K}}={\left\{ \begin{array}{ll} 1&{}\text{ for }~~K=K_1, K_3;\\ \left( 1+(|Z|^2-kZ_{-}^4)/N_{\mathrm{c}}\right) ^{-1}&{}\text{ for }~~K=K_2\,.\end{array}\right. }\end{aligned}$$
(48)

Compared to the K’s adopted in Ref. [1], we remark that the denominator \(N_{\mathrm{c}}\) in the equations above is replaced by \(-4\) there. This is obvious since \(\nu =1\) in Eq. (1) is associated with \(N_{\mathrm{c}}=-4\) in Eq. (2).

Adapting the general formulae of Ref. [34] to the case with one hidden-sector field, as done in Ref. [1], we obtain the SSB terms in the effective low energy potential which can be written as

$$\begin{aligned} V_{\mathrm{SSB}}= {\widetilde{m}}_{\alpha }^2 |{\widehat{\Phi }}_{\alpha }|^2+\left( Ah {\widehat{\Phi }}_1{\widehat{\Phi }}_2{\widehat{\Phi }}_3+ B\mu {\widehat{\Phi }}_4{\widehat{\Phi }}_5+\mathrm{h.c.}\right) , \end{aligned}$$
(49)

where the canonically normalized fields \({\widehat{\Phi }}_{\alpha }=\langle {{\widetilde{K}}} \rangle ^{1/2}\Phi _{\alpha }\) are denoted by hats. In deriving the values of the SSB parameters above, we find it convenient to distinguish the cases:

(a) For \(K=K_1\) and \(K_3\), we see from Eq. (48) that \({\widetilde{K}}\) is constant and so the relevant derivatives are eliminated. Substituting

$$\begin{aligned} \left\langle {F^Z} \right\rangle&=\langle {\bar{F}^{\bar{Z}}} \rangle =\sqrt{3}m_{3/2}/\langle {\mathrm{g}}\rangle ^{1/2}, \end{aligned}$$
(50a)
$$\begin{aligned} ~\left\langle {e^{K_{\mathrm{Hc}}}} \right\rangle&=\langle {\mathrm{g}}\rangle ^{-N_{\mathrm{c}}/2}, ~\left\langle {\partial _ZK_{\mathrm{Hc}}} \right\rangle =\frac{2\nu }{\sqrt{3}}\langle {\mathrm{g}}\rangle ^{1/2}\end{aligned}$$
(50b)

into the relevant expressions [1] we arrive at

$$\begin{aligned} {\widetilde{m}}_{\alpha }=m_{3/2}~~~\text{ and }~~~A=B+m_{3/2}=2\nu m_{3/2}/\langle {\mathrm{g}}\rangle ^{N_{\mathrm{c}}/4}\,.\end{aligned}$$
(51)

(b) For \(K=K_2\), \({\widetilde{K}}\) in Eq. (48) is Z dependent with \(\langle {{\widetilde{K}}} \rangle =\langle {\mathrm{g}}\rangle ^{1/2}\) and the relevant derivatives are found to be

$$\begin{aligned} \left\langle {\partial _Z\ln {\widetilde{K}}^2} \right\rangle&=\frac{2}{3}\left\langle {\partial _Z\ln {\widetilde{K}}^3} \right\rangle =-\frac{4\nu }{\sqrt{3}N_{\mathrm{c}}}\langle {\mathrm{g}}\rangle ^{1/2}, \end{aligned}$$
(52a)
$$\begin{aligned} \left\langle {\partial _{\bar{Z}}\partial _Z\ln {\widetilde{K}}} \right\rangle&=\frac{4\nu ^2}{3N_{\mathrm{c}}^2}\langle {\mathrm{g}}\rangle -\frac{1}{N_{\mathrm{c}}}\langle {\mathrm{g}}\rangle ^{1/2}.\end{aligned}$$
(52b)

Inserting the expressions above into the general formulae [1] we end up with

$$\begin{aligned} {\widetilde{m}}_{\alpha }&=\frac{3}{4|\nu |}m_{3/2}/\langle {\mathrm{g}}\rangle ^{1/2}, \end{aligned}$$
(53a)
$$\begin{aligned} A&=\frac{9}{8\nu }m_{3/2}/\langle {\mathrm{g}}\rangle ^{(N_{\mathrm{c}}+7)/4}, \end{aligned}$$
(53b)
$$\begin{aligned} B&= \frac{3}{2\nu }(1-\nu )m_{3/2}/\langle {\mathrm{g}}\rangle ^{(N_{\mathrm{c}}+4)/4}, \end{aligned}$$
(53c)

where the last result reveals the reason for which B vanishes for \(\nu =1\) [1].

Fig. 5
figure 5

The ratios \(m_{\alpha }/m_{3/2}\), \(A/m_{3/2}\) and \(B/m_{3/2}\) for \(K=K_2\) (circles, squares and stars respectively) versus \(N_{\mathrm{c}}\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols)

The hierarchy of the SSB parameters above w.r.t \(m_{3/2}\) is demonstrated in Fig. 5, where we display the ratios \({\widetilde{m}}_{\alpha }/m_{3/2}\) (circles), \(|A|/m_{3/2}\) (squares) and \(|B|/m_{3/2}\) (stars) versus \(N_{\mathrm{c}}\), for integer \(N_{\mathrm{c}}\) values and \(|N_{\mathrm{c}}|\le 10\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols). Note that the \({\widetilde{m}}_{\alpha }\)’s corresponding to the same \(N_{\mathrm{c}}>0\) turn out to be equal for \(\nu <0\) and \(\nu >0\) since these are inverse proportional to \(\left\langle \mathrm{g} \right\rangle ^{1/2}\nu \) which is constant for the same \(N_{\mathrm{c}}\), as deduced from the last relation in Eq. (23). For this reason the black circles overlap the light grays ones in the graph. We observe that in all cases the \({\widetilde{m}}_{\alpha }\)’s are of the order of \(m_{3/2}\) whereas |A| and |B| may be much larger than it for \(\nu <0\) and \(N_{\mathrm{c}}>0\). This is due to the fact that the \(\nu =\nu _{-}\) values, for these \(N_{\mathrm{c}}\)’s, may be much larger than unity – see light gray circles in Fig. 2. Therefore, we expect that the produced sparicle spectrum will be rather heavier than \(m_{3/2}\) in these cases. Similar conclusions can be extracted for \(K=K_1\) too.

Let us emphasize, finally, that the presence of \(k\ne 0\) in Eq. (2) not only provides mass to the R axion but also breaks explicitly \(U(1)_R\) and so, no topological defects are generated when Z acquires its v.e.v in Eq. (26). Otherwise, \(\left\langle {z} \right\rangle \) could break the discrete subgroup to which \(U(1)_R\) is broken, due the SSB parameters in \(V_{\mathrm{SSB}}\), and the production of topological defects could be possible.

3.2 Generation of the \(\mu \) term of MSSM

Extending further the results in Ref. [1], we now check if the hidden-sector models introduced in this work offer an explanation of the \(\mu \) term of MSSM, following the recipe of Guidice and Masiero in Ref. [37]. To this end, we assign the R charges for the MSSM fields indicated in Ref. [1], which forbid terms of the form \({H_{u}}{H_{d}}\) in \(W_{\mathrm{O}}\) – see Eq. (46). Namely, we may write it as

$$\begin{aligned} W_{\mathrm{MSSM}} =h_{{\alpha }{\beta }\gamma } \Phi _{\alpha }\Phi _{\beta }\Phi _\gamma /6\,,\end{aligned}$$
(54)

where we closely follow the notation established in Ref. [1]. The mixing term between \({H_{u}}\) and \({H_{d}}\) may emerge in the part of the potential including the SSB terms

$$\begin{aligned} \nonumber V_{\mathrm{SSB}}= & {} {\widetilde{m}}_{\alpha }^2 |{\widehat{\Phi }}_{\alpha }|^2+\left( \frac{1}{6} A_{{\alpha }{\beta }\gamma } h_{{\alpha }{\beta }\gamma } {\widehat{\Phi }}_{\alpha }{\widehat{\Phi }}_{\beta }{\widehat{\Phi }}_\gamma \right. \\&\left. + {\widetilde{B}}\mu {\widehat{H}}_u{\widehat{H}}_d+\mathrm{h.c.}\right) , \end{aligned}$$
(55)

if we incorporate (somehow) into the K’s of Eqs. (47a) – (47c) the following higher order terms, inspired by Ref. [37],

$$\begin{aligned} \Delta K_{\mu }=\lambda _{\mu }\frac{{\bar{Z}}^{2\nu }}{m_{\mathrm{P}}^{2\nu }}{H_{u}}{H_{d}}\ +\ \mathrm{h.c.},\end{aligned}$$
(56)

where \(\lambda _{\mu }\) is a real constant and the hatted fields in Eq. (55) are related to the unhatted ones as shown below Eq. (49). Due to the adopted R symmetry, the terms in Eq. (56) do not coincide with those proposed in the original paper [37]. It is, therefore, non-trivial to find out the magnitude of the resulting \({\widetilde{B}}\mu \).

We consider several Kähler potentials to exemplify our approach, as done in Ref. [1]. Namely, we select

$$\begin{aligned} K_{11}&=K_1+\Delta K_{\mu },~~~ \end{aligned}$$
(57a)
$$\begin{aligned} K_{21}&=K_2+\Delta K_{\mu },\end{aligned}$$
(57b)
$$\begin{aligned} K_{22}&=N_{\mathrm{c}}\ln \Bigg (1+\left( |Z|^2-kZ_{-}^4+\hbox { }\ \sum _{\alpha }|\Phi _{\alpha }|^2+\Delta K_{\mu }\right) \nonumber \\&\qquad /N_{\mathrm{c}}\Bigg ),~~~~~~\end{aligned}$$
(57c)
$$\begin{aligned} K_{23}&=N_{\mathrm{c}}\ln \left( 1+\left( |Z|^2-kZ_{-}^4+\Delta K_{\mu }\right) /N_{\mathrm{c}}\right) \nonumber \\&\qquad +\hbox { }\ \sum _{\alpha }|\Phi _{\alpha }|^2,~~~~~~~~ \end{aligned}$$
(57d)

where \(K_1\) and \(K_2\) are defined in Eqs. (47a) and (47b) respectively. The K’s above may be brought into the form

$$\begin{aligned} K_{\mathrm{MSSM}}=K_{\mathrm{Hc}}(Z)+{\widetilde{K}}(Z)|\Phi _{\alpha }|^2+\big (C_{H}{H_{u}}{H_{d}}+\mathrm{h.c.}\big ),\nonumber \\ \end{aligned}$$
(58)

where \({\widetilde{K}}\) is determined as shown in Eq. (48). Namely the branch of the definition of \({\widetilde{K}}\) in Eq. (48) for \(K=K_1\) and \(K_3\) corresponds to \(K=K_{11}\) and \(K_{23}\), whereas that one which is valid for \(K=K_2\) corresponds to \(K=K_{21}\) and \(K_{22}\). As a consequence, the derived \({\widetilde{m}}_{\alpha }\) and \(A_{{\alpha }{\beta }\gamma }\) in Eq. (51) remain the same for \(K=K_{11}\) and \(K_{23}\), whereas those shown in Eqs. (53a) and (53b) are found also for \(K=K_{21}\) and \(K_{22}\).

For the computation of \({\widetilde{B}}\mu \) in Eq. (55) it is crucial to find out \(C_{H}\) in Eq. (58). This is done by expanding the K’s in Eqs. (57a) – (57d) for low \({H_{u}}\) and \({H_{d}}\) values. Our result is

$$\begin{aligned} C_{H}=\lambda _{\mu }\frac{{\bar{Z}}^{2\nu }}{m_{\mathrm{P}}^{2\nu }}{\left\{ \begin{array}{ll} 1&{}\text{ for }~~K=K_{11}, K_{21};\\ \left( 1+\frac{|Z|^2-kZ_{-}^4}{N_{\mathrm{c}}}\right) ^{-1}&{}\text{ for }~~K=K_{22},K_{23}.\end{array}\right. }\nonumber \\ \end{aligned}$$
(59)

Thanks to the non-vanishing \(C_{H}\), we expect that the effective coefficient \({\widetilde{B}}\mu \) in Eq. (55) assumes a non-vanishing value which may be found by applying the relevant formula in Refs. [1, 34]. In particular, we obtain

$$\begin{aligned} \frac{{\widetilde{B}}\mu }{m_{3/2}^2}= \lambda _{\mu }\left( \frac{4\nu ^2}{3}\right) ^\nu \cdot {\left\{ \begin{array}{ll}4(\nu -1)&{}\text{ for }~~K=K_{11}; \\ 3(6\nu -5)/2\nu \langle {\mathrm{g}}\rangle &{}\text{ for }~~K=K_{21};\\ 9(1-2\nu )/8\nu ^2\langle {\mathrm{g}}\rangle &{}\text{ for }~~K=K_{22}; \\ 3(2-3\nu )/\nu &{}\text{ for }~~K=K_{23}, \end{array}\right. }\nonumber \\ \end{aligned}$$
(60)

where we take into account the following:

(a) For \(K=K_{11}\) and \(K_{21}\), \(C_{H}\) in Eq. (59) is only \({\bar{Z}}\) dependent and so we have

$$\begin{aligned} \left\langle {C_{H}} \right\rangle =\left\langle {\partial _{\bar{Z}}C_{H}} \right\rangle /\sqrt{3}=\lambda _{\mu }\left( 4\nu ^2/3\right) ^\nu \,.\end{aligned}$$
(61a)

As regards \({\widetilde{K}}\), this is trivial for \(K=K_{11}\), whereas for \(K=K_{21}\), its derivatives w.r.t Z can be computed with the aid of Eq. (52b). Note that for \(K=K_{11}\) and \(\nu =1\) we obtain \({\widetilde{B}}\mu =0\) and so no \(\mu \) term arises – cf. Ref. [1]. This result though is not generic.

(b) For \(K=K_{22}\) and \(K_{23}\), \(C_{H}\) in Eq. (59) is both Z and \({\bar{Z}}\) dependent and so the results in Eq. (60) can be reproduced making use of the following identities

$$\begin{aligned} \left\langle {C_{H}} \right\rangle&=-\frac{\sqrt{3}N_{\mathrm{c}}}{2\nu \sqrt{\left\langle \mathrm{g} \right\rangle }}\left\langle {\partial _ZC_{H}} \right\rangle =\lambda _{\mu }\left( \frac{4\nu ^2}{3}\right) ^\nu \left\langle \mathrm{g} \right\rangle ^{1/2}; \end{aligned}$$
(61b)
$$\begin{aligned} \left\langle {\partial _{\bar{Z}}C_{H}} \right\rangle&=\left\langle {C_{H}} \right\rangle \left( \sqrt{3}/\left\langle \mathrm{g} \right\rangle ^{1/2}-2\nu /\sqrt{3}N_{\mathrm{c}}\right) ; \end{aligned}$$
(61c)
$$\begin{aligned} \left\langle {\partial _{\bar{Z}}\partial _ZC_{H}} \right\rangle&=\frac{1}{N_{\mathrm{c}}}\left\langle {C_{H}} \right\rangle \left\langle \mathrm{g} \right\rangle ^{1/2}\left( \frac{8\nu ^2}{3N_{\mathrm{c}}}\left\langle \mathrm{g} \right\rangle ^{1/2}-(2\nu +1)\right) .\end{aligned}$$
(61d)

The derivatives of \({\widetilde{K}}\) w.r.t Z do not vanish only for \(K=K_{22}\) and are computed by employing Eq. (52b), as previously.

Fig. 6
figure 6

The ratios \({\widetilde{B}}\mu /\lambda _{\mu }m_{3/2}^2\) for \(K=K_{11}, K_{22}\) and \(K_{23}\) (circles, squares and stars respectively) versus \(N_{\mathrm{c}}\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols)

The magnitude of the derived \({\widetilde{B}}\mu \)’s above w.r.t \(m_{3/2}^2\) is demonstrated in Fig. 6, where we present the ratios \({\widetilde{B}}\mu /\lambda _{\mu }m_{3/2}^2\) for \(K=K_{11}\) (circles), \(K_{22}\) (squares) and \(K_{23}\) (stars) versus \(N_{\mathrm{c}}\), for integer \(N_{\mathrm{c}}\) values and \(|N_{\mathrm{c}}|\le 10\). We take \(N_{\mathrm{c}}<0\) (gray symbols) or \(N_{\mathrm{c}}>0\) and \(\nu >0\) (black symbols) or \(N_{\mathrm{c}}>0\) and \(\nu <0\) (light gray symbols). We observe that the \({\widetilde{B}}\mu \) values, for \(\lambda _{\mu }\)’s of order unity, are comparable to \(m_{3/2}^2\) for any \(\nu \) in the two first cases, whereas for \(N_{\mathrm{c}}>0\) and \(\nu <0\) these decrease as \(N_{\mathrm{c}}\) (and \(|\nu |\)) increase. In his context, therefore, besides the much heavier than \(m_{3/2}\) sfermion spectrum – see Fig. 5 – we may obtain \(\mu \ll m_{3/2}\). The latter result is welcome, since it enhances the electroweak naturalness [39] of MSSM.

4 Conclusions

We completed our approach, initiated in Ref. [1], to the problem SUSY breaking with a mildly violated R symmetry. Our setting is relied on the super- and Kähler potentials given in Eqs. (1) and (2) respectively. We revealed that the solution \((\nu , N)=(1,-4)\) presented in Ref. [1] is just one from a class of solutions obtained by considering various values for the R character of the goldstino superfield, Z, and letting free the geometry of the internal space. In particular, we found “magic” pairs \((\nu , N)=(\nu , N_{\mathrm{c}})\), with \(N_{\mathrm{c}}\) satisfying Eq. (15) and \(\nu \) in the domains of Eq. (24), which allow for a naturally vanishing cosmological constant. Specific examples are presented in Table 1 and in Figs. 2 and 3 for the cases of SU(1, 1)/U(1) and SU(2)/U(1) Kähler manifolds.

The presence of the cosmologically dangerous R axion in the spectrum of the model can be eluded switching on a quartic term in the Kähler potential – i.e., setting \(k\ne 0\) in Eq. (2) – which mildly breaks the R symmetry without modifying the SUGRA potential, along its real direction, and the position of the Minkowski vacua in Eq. (26). The mass hierarchy between the R saxion and axion and \(\widetilde{G}\) is given in Eqs. (42a) and (42b) and deviates from the one found in Ref. [1]. Our scheme not only provides non-vanishing SSB (i.e., soft SUSY-breaking) parameters, independent from k, but also offers an explanation of the \(\mu \) problem of MSSM inspired by the Giudice–Masiero mechanism. Broadening the outputs of Ref. [1] – according to which the SSB and \(\mu \) parameters turn out to be of the order of \(m_{3/2}\) – here we specify patterns of SUSY breaking which lead to values for the aforementioned parameters much larger or lower than \(m_{3/2}\).

The crucial difference of our proposal compared to the no-scale models [2, 3] is that here the SUSY breaking vacuum is obtained for a specific value of Z, whereas in those models the minimization of the SUGRA potential occurs along a Z-flat direction, i.e., for any values of Z. As a consequence, \(\widetilde{G}\) mass varies with Z remaining thereby undetermined, whereas it depends explicitly on \(\left\langle {z} \right\rangle \) and the fundamental parameters \(\nu \) and m within our models. In the majority of the no-scale models the SSB masses turn out to be zero contrary to what happens in our setting.

Unfortunately, our scheme does not support viable inflationary solutions driven by Z for any of the \((\nu ,N_{\mathrm{c}})\) values examined. Namely, the distinct values of \(\nu \) and \(N_{\mathrm{c}}\), entailed by Eqs. (15) or (16), can not be reconciled with the current inflationary data. However, the hidden sector analyzed in our work may coexist with an inflationary one provided that both sectors respect the same R symmetry – for R-symmetry-compatible inflationary models see, e.g., Refs. [6, 59,60,61,62]. The variety of the R charges of Z allowed within our framework is expected to facilitate the “marriage” of both sectors. For some recent attempts towards this direction see Refs. [4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19].

Finally, it would be interesting to investigate the low-energy consequences of the relations between the SSB mass parameters which are usually imposed at a high scale as boundary conditions for the evolution of the relevant renormalization group equations. To implement this program we have to specify the SSB masses for gauginos besides those found in Sect. 3.1 for the scalars. As explained in Ref. [1], the safest option is the assumption that the gauginos acquire SSB masses from gauge anomalies so that the R-symmetry violation is conserved mild enough. This conclusion remains intact from the modifications made in the present work. The derived SUSY spectrum and the relevant cosmo-phenomenological constraints could assist us to constrain further the parameters of our models and help us to distinguish which of them is the most compelling.