1 Introduction

The mass of the Higgs boson [1, 2] at around 126 GeV is near the upper limit of the predictions in commonly studied models of the supersymmetric standard model such as the constrained minimal supersymmetric standard model (CMSSM) [313].Footnote 1 This rather large Higgs boson mass and the null results of the sparticle searches at the LHC [1722] seem to hint to rather heavy sparticles [2334], and possibly heavy top squarks with masses in the range of tens to hundreds of TeV.

Among the models with heavy sparticles, models with a mild hierarchy between sfermion and gaugino masses such as pure gravity mediation (PGM) [3541] and models with strongly stabilized moduli [4249] are very successful not only phenomenologically but also cosmologically, and they have spectra which are characteristic of split supersymmetry [5054] with anomaly mediation [5560]. In these models, the sfermions obtain tree-level masses of the order of the gravitino mass, \(m_{3/2}\), while the gaugino masses are dominated by one-loop masses from anomaly mediation [5560].

In the original version of PGM [35, 36], it was assumed that the supersymmetric Higgs mixing term, the \(\mu \)-term, is generated via the tree-level couplings to an \(R\)-symmetry breaking sector which generates the non-vanishing vacuum expectation value of the superpotential [6163]. In this case, the \(\mu \) term and the supersymmetry breaking bilinear \(B\) term are two independent parameters and are both of the order of the gravitino mass, \(m_{3/2}\). In previous papers, we showed that models based on PGM with [39] and without [40] scalar mass universality, could explain virtually all experimental constraints with successful radiative electroweak symmetry breaking (EWSB). In this case, one can impose the supergravity boundary condition \(B_0 = A_0 - m_{3/2}\) which is essentially \(-m_{3/2}\) since \(A_0\) is also determined by anomalies and hence small compared with \(m_{3/2}\). The universal scalar mass, \(m_0\) is fixed by \(m_{3/2}\). The \(\mu \) term is determined by the minimization of the Higgs potential. The ratio of the Higgs vacuum expectation values (VEVs) can be treated as a free parameter so long as a Giudice–Masiero coupling, \(c_H\), is included in the Kähler potential, \(K\). The value of \(c_H \lesssim 1\) is then also fixed by the minimization of the Higgs potential.

In this paper, we discuss another version of PGM, in which the \(\mu \)-term originates from the breaking of a Peccei–Quinn (PQ) symmetry [64] via a dimension five operator [65, 66]. The size of the \(\mu \)-term is determined by the PQ-breaking scale, \(f_{ PQ}\), and hence it can be related to the axion dark matter density. In practice, the \(\mu \) term and \(f_{ PQ}\) are then determined by the EWSB boundary conditions at the weak or SUSY scale. As in [39], the \(B\)-term is fixed to \(-m_{3/2}\) at the input UV scale. However, because the Higgs fields carry PQ charge in this model, a Giudice–Masiero coupling is not allowed in \(K\), and therefore some departure of scalar mass universality is required [40]. With \(c_H = 0\), \(\tan \beta \) must also be determined by the EWSB boundary conditions at the weak or SUSY scale. We will explore a three-parameter version of the PGM where the three free parameters are chosen to be the gravitino mass, \(m_{3/2}\), and the two soft Higgs masses, \(m_1\) and \(m_2\).

We show further that successful phenomenological models can be constructed if only the soft mass of the up-type Higgs, \(m_2\), is non-universal. Thus the family of PQ-symmetric PGM models can be expressed in terms of only two parameters, \(m_{3/2}\) and \(m_2\). In fact, viable solutions are possible with \(m_1 = m_{3/2}\) and \(m_2=0\). Thus, if the up-type Higgs multiplet is associated with a pseudo Nambu–Goldstone boson [6769] or if the Higgs soft mass is protected by a (partial) no-scale structure [70, 71] of the Kähler potential as discussed in [40], we are reduced to a theory with one single free parameter! Finally we will show that an alternative model for breaking the PQ-symmetry based on [72] may allow full universality to be restored.

As in previous studies of PGM, we also find that the lightest supersymmetric particle (LSP) is the neutral wino. In the \(R\)-parity conserving case, the model allows several dark matter scenarios, including one with pure wino dark matter with possible non-thermal sources such as the late time decay of the gravitino. As we will see, even if the wino contribution to the relic density is negligible (as is the case for the thermal contribution when \(\mu >0\)), it is possible that axions make up the entire dark matter, or of course it is also possible that the dark matter is an axion–wino mixture. In the scenario where a significant portion of the dark matter is the axion, the wino can be much lighter than the 3 TeV as required by thermal dark matter constraints. In fact, it could have a mass below 1 TeV and fall within the reach of the LHC at 14 TeV [38]. While it is well known that models with axion dark matter generally overproduce isocurvature fluctuations [7378], it is also known that the problem may be resolved if the axion decay constant takes on values during inflation which are large compared to the nominal low-energy value [79]. We show several ways, this can be implemented in the models presented.

The organization of the paper is as follows. In Sect. 2, we summarize PGM with universal and non-universal Higgs masses. We then generalize the model to include a PQ-symmetry in which the \(\mu \)-term originates from PQ-symmetry breaking. In Sect. 3, we briefly review the properties of axion dark matter. There, we also propose several solutions to the problem of isocurvature fluctuations including a novel model of dynamical PQ-breaking. In Sect. 4, we demonstrate that this version of PGM achieves successful EWSB. Here, we will consider the case with two non-universal Higgs soft masses, that is, the three-parameter version of the model. We will see, however, that the down-type Higgs soft mass may remain universal, and successful models are still obtained. We will further demonstrate that even in the special case where \(m_2 = 0\), viable models are possible so long as \(300~\hbox {TeV}\lesssim m_{3/2} \lesssim 850~\hbox {TeV}\). Before concluding, we will briefly describe in Sect. 5 an alternative model for breaking the PQ-symmetry in which full universality may be restored. The final section is devoted to discussions.

2 \(\mu \)-term from \(P\!Q\)-symmetry breaking

2.1 Pure gravity mediation

In pure gravity mediation models, it is assumed that all fields in the supersymmetry (SUSY) breaking sector are charged under some symmetry (e.g. an R-symmetry). Under this assumption, the gaugino masses and the \(A\)-terms of the chiral multiplets are suppressed at the tree level in supergravity and they are dominated by the anomaly-mediated contributions [5560]. The soft squared masses of the scalar bosons are, on the other hand, generated at the tree level and are expected to be of the order \(m_{3/2}^2\) with a generic Kähler potential. If we optionally assume a flat Kähler manifold for all the MSSM fields, all of the MSSM scalar fields obtain the universal soft mass squared equal to \(m_{3/2}^2\). In our analysis, we take the model with universal scalar masses as our starting point. We will assume universality at the grand unified (GUT) scale and run all quantities down to the weak scale through 2-loop RGEs and minimize the Higgs potential. As in the CMSSM, minimization allows us to solve for \(\mu \) and the bilinear supersymmetry breaking \(B\)-term. As we will see in our later analysis, we will be required to consider a slightly relaxed assumption where the Kähler manifold is flat for all the MSSM fields except for the Higgs doublets, which leads to non-universal Higgs soft masses (NUHM) [8095].

In the original PGM model, we assumed that the holomorphic bilinear of the two-Higgs doublets, \(H_uH_d\), has a vanishing \(R\)-charge and appears in the Kähler potential and the superpotential,

$$\begin{aligned} K|_{H_uH_d}&= c_H H_u H_d + \hbox {h.c.}, \end{aligned}$$
(1)
$$\begin{aligned} W|_{H_uH_d}&= c_H'm_{3/2} H_u H_d, \end{aligned}$$
(2)

where \(c_H\) and \(c_H'\) are \(O(1)\) coefficients [6163]. From these terms, the \(\mu \) and the \(B\)-parameters are given by

$$\begin{aligned} \mu&= c_H m_{3/2} + c_H' m_{3/2}, \end{aligned}$$
(3)
$$\begin{aligned} B\mu&= 2 c_H m_{3/2}^2 - c_H' m_{3/2}^2. \end{aligned}$$
(4)

In minimal supergravity, we would take \(c_H = 0\), so that \(\mu = c_H' m_{3/2}\) and \(B = -m_0\), neglecting the small contribution from anomaly-mediated \(A\)-terms. Minimization of the Higgs potential in this case allows one to solve for \(\mu \) (\(c_H'\)) and \(\tan \beta \). Inclusion of \(c_H\) allows one to keep \(\tan \beta \) as a free parameter solving instead for \(\mu \) (\(c_H'\)) and \(c_H\) [96]. Thus, there are two independent parameters, which can be chosen as \(m_{3/2}\) and \(\tan \beta \).Footnote 2

2.2 PQ-symmetric PGM

Now, let us move on to the PQ-symmetric PGM model where the \(\mu \)-term is generated by the breaking of the PQ-symmetry. In this case, instead of the above potentials in Eqs. (1) and (2), the \(\mu \)-term is generated via,

$$\begin{aligned} W|_{H_uH_d} = k \frac{P^2}{M_P} H_u H_d, \end{aligned}$$
(6)

where \(k\) denotes a dimensionless constant and \(P\) is a PQ-symmetry breaking field with PQ-charge \(+1\). \(M_P = 2.4 \times 10^{18}\) GeV refers to the reduced Planck scale. We assumed that the holomorphic bilinear \(H_uH_d\) has a PQ charge of \(-2\) and the charges of other MSSM matter fields are appropriately assigned. As a result, the Kähler term in Eq. (1) is not allowed.

From this potential, we obtain the \(\mu \)- and the \(B\)-parameters,

$$\begin{aligned} \mu&= k \frac{\langle {P}\rangle ^2}{M_P}, \end{aligned}$$
(7)
$$\begin{aligned} B\mu&= - m_{3/2}\mu . \end{aligned}$$
(8)

Therefore, the \(\mu \)-parameter is determined by the PQ-breaking scale and, for example, \(\mu ={\mathcal O}(100)\) TeV is realized for \(\langle P \rangle = {\mathcal O}(10^{12})\) GeV. It should be noted that \(\mu \) cannot be much larger than the gravitino mass for successful EWSB.Footnote 3 The \(B\)-parameter is, on the other hand, fixed to \(-m_{3/2}\) (at the GUT scale), which should be contrasted to the above original PGM where the \(B\)-parameter (through \(c_H\)) is an independent parameter which for convenience can be exchanged with \(\tan \beta \) as described above. As we will discuss later, this restricted parameter set conflicts with full scalar mass universality.

In summary, the PQ-symmetric PGM model is more restricted than the original PGM model and has effectively only one parameter:

$$\begin{aligned} m_{3/2} \end{aligned}$$
(9)

with full scalar mass universality. Since \(B\) is fixed, the EWSB conditions amount to solving for \(\tan \beta \) and the constant \(k\) in Eq. (6), which has no solution. Here, we have traded the size of \(\mu \)-term with the \(Z\)-boson mass, fixed to \(m_Z \simeq 91.2\) GeV. Even with the NUHM2 [91, 92], the model has only three parameters:

$$\begin{aligned} m_{3/2}, \quad m_1, \quad m_2, \end{aligned}$$
(10)

where \(m_{1,2}\) denote the soft masses of the two Higgs doublets, \(H_d, H_u\). However, as we shall see, viable solutions are possible with non-universality extended only to the up-type Higgs, i.e., we will be able to keep \(m_1 = m_{3/2}\). Furthermore, we will also see that the special case of \(m_2 = 0\) also yields acceptable solutions, so if \(H_2\) originates as a pseudo Nambu–Goldstone boson, or \(H_2\) is part of a no-scale structure in \(K\), we are again reduced to a one-parameter theory.

3 Supersymmetric axion model

3.1 Brief review of the PQ-breaking model

Before going on to study the parameter space for the PQ-symmetric PGM model, let us discuss the supersymmetric axion model [97] in a little more detail. As a simple example, we may take the following model of spontaneous PQ-symmetry breaking:

$$\begin{aligned} W_\mathrm{PQ} = \lambda X(PQ-v_{ PQ}^2), \end{aligned}$$
(11)

where \(\lambda \) is a dimensionless coupling and \(v_{ PQ}\) a dimensionful parameter. The superfields \(X\), \(P\), and \(Q\) have PQ-charges, \(0\), \(+1\), and \(-1\), respectively, with vacuum values,

$$\begin{aligned} \langle P \rangle = \langle Q \rangle \simeq v_{ PQ}, \quad \langle X \rangle \simeq m_{3/2}/\lambda , \end{aligned}$$
(12)

so that the PQ-symmetry is spontaneously broken,Footnote 4 and we are left with an axion, saxion, and axino at an energy scale much lower than \(v_{ PQ}\).

Due to SUSY breaking effects, the saxion and the axino obtain masses of the order of the gravitino mass, and hence they decay quickly to a pair of the gluinos and a gluino/gluon pair, respectively, and cause no cosmological problems. The axion, on the other hand, remains very light and has a very long lifetime, and it can be a good dark matter candidate with a relic density [100],

$$\begin{aligned} \Omega _ah^2 = 0.18\left( \frac{F_{ PQ}}{10^{12}~\mathrm{\mathrm GeV}}\right) ^{1.19}\left( \frac{\Lambda }{400~\mathrm{\mathrm MeV}}\right) \end{aligned}$$
(13)

where \(\Lambda \) denotes the QCD scale and the decay constant \(F_{ PQ}\) is defined by \(v_{ PQ}\) and determined by a domain wall number \(N_\mathrm{DW} = 6\) in this case. We assumed the initial axion amplitude \(a \simeq F_{ PQ}\). Thus, the axion can be the dominant dark matter component for \(F_{ PQ}\simeq 7 \times 10^{11}\) GeV, assuming that the mis-alignment angle is of the order of \(\pi \).

It should be noted that axion dark matter models are suspect to problems with domain walls and isocurvature fluctuations. The former problem is, however, solved relatively easily if the PQ-symmetry is broken before the primordial inflation starts and is never restored after the end of inflation. In this case, the domain walls are not formed after inflation, and hence there is no domain wall problem.

The latter problem, the overproduction of isocurvature fluctuations [7378], on the other hand, puts a severe constraint on the Hubble scale during inflation. In fact, in order to suppress the isocurvature mode in the axion dark matter enough to be consistent with the constraint set by CMB observations [101], the Hubble scale during inflation is required to be rather small (i.e. \(H \lesssim 2 \times 10^7\) GeV), which is much smaller than the one in more conventional models of inflation, where \(H\sim 10^{13}\) GeV.

It is, however, possible to relax the constraint on \(H\), if the axion decay constant, \(F_{ PQ}\), were larger than its low-energy value during inflation [79]. Roughly, we would require \(H/F_{ PQ} \lesssim 3 \times 10^{-5}\) to resolve the problem, and hence \(F_{ PQ} \gtrsim 3 \times 10^{17}\) GeV during inflation for \(H\sim 10^{13}\) GeV. Such a large PQ-breaking scale during inflation can be achieved, for example, when \(P\) in the above model picks up a negative mass squared contribution along the \(PQ\) flat direction, in a similar way that Affleck–Dine fields [102] pick up large VEVs to generate a baryon asymmetry (for a concrete model in this context see [103]).

The PQ-breaking scale can also be large during inflation when the radial component of the PQ-breaking field, \(\sigma \equiv P=Q\) flows into the so-called attractor solution during inflation [79]. In this case, the PQ-breaking field scales as \(\lambda ^{-1/2}\) where \(\lambda \) is the coupling in Eq. (11) [104]. This may require couplings as small as \(10^{-8}\) to \(10^{-12}\) depending on the particular model of inflation. Because the amplitude of the \(P\) and \(Q\) fields are large during inflation, it is possible that subsequent large oscillations could pass through \(P=Q=0\) which would amount to the restoration of the PQ-symmetry and could lead to domain wall formation. To determine if domain walls form, a detailed analysis of the relaxation of the \(P\) and \(Q\) fields is needed. It is important to note that the amount of relaxation depends on the model of inflation. For inflation models quadratic in the inflaton, the PQ fields relax too slowly and domain walls are formed; see [105].

It is also possible to relax this problem by considering a Giudice–Masiero-like term involving the PQ fields in the Kähler potential,

$$\begin{aligned} K\supset -c_{ PQ}\left( PQ+P^\dagger Q^\dagger \right) . \end{aligned}$$
(14)

Since the product of \(P\) and \(Q\) have \(R\) and \(PQ\) charge zero, this additional term cannot be forbidden. During inflation, the Kähler potential now gives a correction to the scalar potential

$$\begin{aligned} \Delta V&= -c_IH^2c_{ PQ}\left( PQ+P^\dagger Q^\dagger \right) \nonumber \\&+c_IH^2\left( |P|^2+|Q|^2\right) , \end{aligned}$$
(15)

where \(c_I \approx 3\) and \(H\) is Hubble’s constant. Adding this to the scalar potential for the \(PQ\) fields from the superpotential, we find

$$\begin{aligned} V&= |\lambda |^2\left| PQ-\frac{c_{ PQ}c_I}{|\lambda |^2}H^2-\Lambda ^2\right| ^2\nonumber \\&+c_IH^2\left( |P|^2+|Q|^2\right) +V_0, \end{aligned}$$
(16)

where \(V_0\) keeps track of the constant pieces in the potential. To get PQ-breaking during inflation, we need \(c_{ PQ}>1\). During inflation, the new minimum is around

$$\begin{aligned} P=Q \sim \left( \frac{c_{ PQ}c_I}{|\lambda |^2}\right) ^{1/2} H. \end{aligned}$$
(17)

To sufficiently suppress the isocurvature perturbations, we need a somewhat small coupling,

$$\begin{aligned} \frac{|\lambda |}{\sqrt{c_{ PQ}c_I}}\lesssim 3 \times 10^{-5}, \end{aligned}$$
(18)

which is effectively the constraint of \(\lambda \lesssim 10^{-4}\).

In this scenario, after inflation the \(P\) and \(Q\) fields will relax back to the true minimum set by \(\Lambda \). The masses of the \(P\) and \(Q\) fields during inflation are of the order of \(\sqrt{(c_{ PQ}-1)c_I}H\). Therefore, during their relaxation back to the true minimum, \(P\) and \(Q\) will track the minimum (so long as \(c_{ PQ}\) is not very close to 1) and no domain walls will be formed, that is, the PQ-symmetry remains broken.

3.2 Dynamical PQ-symmetry breaking

We may also consider an alternative mechanism for realizing a large PQ-breaking scale during inflation. If PQ-breaking scale is generated dynamically it is possible to get this dynamical scale large during inflation. To illustrate this mechanism, let us consider a supersymmetric \(SU(2)\) gauge theory with four fundamental chiral fields, \(P_{1,2}\) and \(Q_{1,2}\), and four singlet fields \(S_{ij}\, (i,j=1\cdots 2)\). The superfields \(P\), \(Q\), and \(S\) have PQ-charges \(+1\), \(-1\), and \(0\), respectively, and they are coupled through the superpotential,

$$\begin{aligned} W=\sum _{i,j=1,2}\lambda S_{ij}P_{i}Q_{j}, \end{aligned}$$
(19)

where \(\lambda \) is again a dimensionless coupling. Below the dynamical scale of \(SU(2)\), \(\Lambda _{ PQ}\), the model can be described by the six composite mesons,

$$\begin{aligned}&M_{PP} \simeq P_1P_2/\Lambda _{ PQ}, \quad M_{QQ} \simeq Q_1Q_2/\Lambda _{ PQ}, \quad \nonumber \\&\quad {M_{ PQ}}_{ij} \simeq P_iQ_j/\Lambda _{ PQ}, \end{aligned}$$
(20)

whose effective superpotential terms are roughly given by

$$\begin{aligned} W_\mathrm{eff} \simeq \sum _{ij} \lambda \Lambda _{ PQ} S_{ij} {M_{ PQ}}_{ij} + X(\mathrm{Pf}(M) - \Lambda _{ PQ}^2). \end{aligned}$$
(21)

Here, \(X\) denotes the Lagrange multiplier to impose the quantum deformed moduli constraint on the mesons [106],

$$\begin{aligned} \mathrm{Pf}(M) = M_{PP} M_{QQ} + \frac{1}{2} \sum _{}\epsilon ^{ij}\epsilon ^{kl} {M_{ PQ}}_{ik}{M_{ PQ}}_{jl} = \Lambda _{ PQ}^2. \end{aligned}$$

By noting that the \(M_{ PQ}\) obtain masses of \({\mathcal O}(\lambda \Lambda _{ PQ})\) from the first term in Eq. (21), we find that the PQ-symmetry is spontaneously broken by

$$\begin{aligned} \langle M_{PP}\rangle = \langle M_{QQ}\rangle \simeq \Lambda _{ PQ}, \quad \langle X\rangle \simeq m_{3/2}, \quad \langle M_{ PQ}\rangle =0,\nonumber \\ \end{aligned}$$
(22)

Therefore, this model is a dynamical realization of the previous model defined in Eq. (11).Footnote 5

Now let us assume that the gauge coupling constant of \(SU(2)\) is given by a VEV of a singlet field \(\phi \) through the gauge kinetic function, \(f = (1/g_0^2 + \phi ^2)\), assuming \(g_0 = 4 \pi \) and \(\langle \phi \rangle =O(1) \) at the Planck scale. Then the effective gauge coupling at the Planck scale is perturbative, \(1/g^2=(1/g_0^2 + \langle \phi \rangle ^2)\), with \(\langle \phi \rangle \) chosen so that the condensation scale is \(\Lambda _{ PQ} \simeq 10^{12}\) GeV. However, if \(\phi \) gets a large positive mass during inflation, the VEV of \(\phi \) will be suppressed, \(\langle \phi \rangle \ll 1\). Since the VEV independent part of the gauge coupling is already strong at the Planck scale, we get \(\Lambda _{ PQ}=M_P\) during inflation. In this way, we can easily realize a large PQ-breaking scale during inflation, and hence the isocurvature fluctuations in axions can be suppressed. \(\phi \) can be given a mass by adding the following interactions:

$$\begin{aligned} W=\lambda ^\prime Y\left( \phi ^2-M_P^2\right) . \end{aligned}$$
(23)

At the minimum of the potential, \(\phi \) has a Planck scale VEV which generates a mass for \(\phi \) of the order of \(\lambda ^\prime M_P\), avoiding a moduli problem.Footnote 6

4 Results

We are now in a position to explore the necessary parameter space for the PQ-symmetric PGM model. We begin with the three-parameter version of the model defined by the gravitino mass, and the two soft Higgs masses. All other scalars are assumed to be universal at the GUT scale (the renormalization scale where the two electroweak couplings are equal). All masses and couplings are run down to the weak scale, where the Higgs potential is minimized, thus determining \(\mu \) (or \(k\) in this context) and \(\tan \beta \). Gaugino masses and \(A\)-terms assume their anomaly-mediated values and \(B_0 = -m_{3/2}\). As we have seen previously [40], PGM solutions with \(c_H = 0\) are possible so long as we allow the Higgs soft masses to depart from universality.

In Fig. 1, we show examples of the \(m_1, m_2\) plane for fixed values of \(m_{3/2} = 60, 150, 300\), and 400 TeV. The red dot-dashed curves show contours of the light Higgs mass, \(m_h\) from 122–130 GeV in 1 GeV intervals. The region with 124 GeV \(< m_h<\) 128 GeV is shaded green. In all cases, we have assumed the supergravity boundary condition of \(B_0 = -m_{3/2}\) and \(c_H = 0\), and calculate \(\mu \) and \(\tan \beta \). The solid black contours show \(\mu /m_{3/2}\) and as one can see, this ratio is close to one over much of the displayed planes. The exception occurs when \(m_2^2\) is large and positive causing \(\mu \) to become small. In the figure, when \(m_{3/2} = 60\) TeV, the region shaded blue at the top right of the figure has \(\mu ^2 <0\) indicating the lack of an EWSB solution. The region at low and negative \(m_1^2\) is also excluded as there the Higgs pseudo-scalar mass, \(m_A^2 < 0\). Note the sign of the soft Higgs masses in the figure refers to the sign of the mass squared. The gray dotted curves show the calculated values of \(\tan \beta \) which are typically around 4–5 when \(m_{3/2} = 60\) TeV, and which are closer to 2 at larger \(m_{3/2}\).

Fig. 1
figure 1

The \((m_1, m_2)\) plane for fixed \(m_{3/2}=60,\) 150, 300, and 400 TeV. Shown are the contours for the light Higgs mass, \(m_h\) (red, dot dashed) from 122–130 GeV in 1 GeV intervals. The region with a Higgs mass between 124 and 128 GeV is shaded green. The gray dotted curves show the calculated values of \(\tan \beta \). The range spans 2–15 when \(m_{3/2} = 60\) TeV, and 2–3 when \(m_{3/2} = 400\) TeV. Also shown are contours of \(\mu /m_{3/2}\) (solid, black) which are typically close to 1. The blue shaded regions (when shown) correspond to regions where no EWSB is possible

As one can see all of the viable solutions displayed in the figures require some degree of non-universality. In each case displayed, forcing Higgs mass universality would either require a value of \(m_1\) too small corresponding to a light Higgs with mass \({<}124\) GeV, or a value of \(m_2\) too large to allow EWSB solutions. As one can see, \(m_1\) can be made universal, when the gravitino mass is \({\gtrsim }300\) TeV. For \(m_{3/2} = 300\) TeV as shown in the lower left panel, \(m_1 = m_{3/2}\) is very close to the \(m_h = 124\) GeV contour when \(m_2\) is relatively small. When \(m_{3/2} = 400\) TeV as shown in the lower right panel, There is no problem is obtaining a suitable Higgs mass. But for \(m_2 \gtrsim 300\) TeV, \(\mu ^2\) quickly runs negative and we lose the ability to satisfy the EWSB boundary conditions. At smaller, \(m_2\) the results are in fact quite insensitive to the particular value of \(m_2\). Therefore, in the following, we will set \(m_2 = 0\).

The vanishing of the up-type Higgs soft mass at the universality scale can be explained [40] if either this Higgs field was part of a no-scale structure [70, 71] of the Kähler potential as in

$$\begin{aligned} K = y y^* - 3 \log \left( 1 - \frac{1}{3}(H_2 H_2^*+ K^{(Z)}) \right) + \log |W|^2\nonumber \\ \end{aligned}$$
(24)

where \(Z\) is the field(s) which breaks supersymmetry, and \(y\) represents all other fields including \(H_1\). The resulting soft masses for the Higgs doublets in this case is \(m_2^2 = 0\). Alternatively, it is possible that the up-type doublet appears as a Nambu–Goldstone boson described by the coset space, \(U(3)/SU(2)\times U(1)\) [6769].

Having fixed \(m_2 = 0\), it is possible to display the parameter space on a single two-dimensional \(m_1, m_{3/2}\) plane, as in Fig. 2. As one can see, for low(er) values of \(m_{3/2}\), the value of \(m_1\) needed to obtain \(m_h\) between 124 and 128 GeV (shown as the shaded green region) requires non-universality in the Higgs soft masses and \(m_1 > m_{3/2}\). When \(m_{3/2} \gtrsim 300\) TeV, solutions with \(m_1 = m_{3/2}\) become possible. In all cases, we find \(2 \lesssim \tan \beta \lesssim 4\).

Fig. 2
figure 2

The \((m_1, m_{3/2})\) plane for fixed \(m_{2}= 0\). Shown are the contours for the light Higgs mass, \(m_h\) (red, dot dashed) from 122–132 GeV in 4 GeV intervals. The region with a Higgs mass between 124 and 128 GeV is shaded green. The gray dotted curves show the calculated values of \(\tan \beta \). The black solid line shows the down-type universal case, where \(m_1 = m_{3/2}\). The blue shaded regions (when shown) correspond to regions where no EWSB is possible

We can go further and insist on universality of the down-type Higgs soft mass. Results for this restrictive case are shown in Fig. 3 where we plot \(\mu /m_{3/2}\), \(\tan \beta \) (left) and \(m_h\) (right) as a function of \(m_1 = m_{3/2}\) for \(m_2 = 0\). As one can see, in this case, for a wide range of values of \(m_{3/2}\), \(\mu /m_{3/2} \simeq 1\) and \(\tan \beta \simeq 2.2\). However, in order to obtain \(m_h > 124\) GeV, we need 300 TeV \(\lesssim m_{3/2} \lesssim 850 \) TeV.

Fig. 3
figure 3

Solutions for \(\mu /m_{3/2}\) and \(\tan \beta \) (left) and for \(m_h\) (right) in the one-parameter version of the PQ-symmetric PGM model. Here \(m_1 = m_{3/2}\) and \(m_2 = 0\)

5 Universality with PQ-symmetry breaking

By generalizing the model for PQ-symmetry breaking, it may be possible to restore full scalar mass universality. The particular model we consider was first presented in [72] and relates the PQ scale with see-saw scale for generating neutrino masses. The superpotential for this model is

$$\begin{aligned} W_{ PQ}= \frac{f}{M_P}P^3Q+\frac{g}{M_P}H_u H_d PQ+\frac{1}{2}h_{ij}PN_iN_j, \end{aligned}$$
(25)

where \(P,Q\) are the fields responsible for breaking the PQ-symmetry with charges \((-1,3)\), \(N_i\) are the right handed neutrinos with PQ charge \(1/2\), and \(H_{u,d}\) have PQ charge \(-1\). In this model the right handed neutrino masses are generated by PQ-symmetry breaking, which is of the order of \(10^{12}\) GeV as discussed above. In pure gravity mediation models, these fields will also get supersymmetry breaking parameters. The relevant supersymmetry breaking soft masses and \(A\)-terms are

(26)

with

$$\begin{aligned}&m_Q^2(\mu _\mathrm{GUT})=m_P^2(\mu _\mathrm{GUT})=m_N^2(\mu _\mathrm{GUT})=m_{3/2}^2 \quad \nonumber \\&\quad m_A(\mu _\mathrm{GUT})=m_{3/2}. \end{aligned}$$
(27)

The RG equations for these soft masses are dominated by the couplings \(h_{ij}\). In fact, if the number of neutrinos is large enough and the \(h_{ij}\) are large enough, \(m_P^2\) will be driven negative and the PQ-symmetry is broken. The VEVs of \(\langle P \rangle \) and \(\langle Q \rangle \) will effectively be free parameters determined by \(h_{ij}\) and \(f\). Breaking the PQ-symmetry in this manner will generate independent VEVs for \(\langle Q\rangle \) and \(\langle P\rangle \). Because both \(\langle P\rangle \) and \(\langle Q \rangle \) are none zero, the \(F\)-terms for \(P,Q\) will also be non-zero and independent. With this set up, \(\mu \) and \(B\mu \) are linearly independent as in Eqs. (3) and (4) and in contrast to the case considered in Eqs. (7) and (8). This additional freedom in \(\mu \) and \(B\mu \) makes universal soft mass for the MSSM fields possible. For this scenario, the parameter space, defined by \(m_{3/2}\) and \(\tan \beta \), will be identical to that considered in [39].

There is another way to get universality for the scalar masses with the PQ-symmetry breaking. If we again include the Giudice–Masiero mixing term for the \(P\) and \(Q\) in the Kähler potential, the relationship for \(B\) of the Higgs bosons changes to

$$\begin{aligned} B\mu =-m_{3/2}\left( 1-2c_{ PQ}\right) \mu . \end{aligned}$$
(28)

Due to the freedom in \(B\) from the \(c_{ PQ}\) term, we can independently define \(\mu \) and \(B\). This allows for solutions to the EWSB conditions to be found by solving for \(k\) and \(c_{ PQ}\) leaving \(\tan \beta \) as a free parameter.Footnote 7

6 Discussions

The relatively large Higgs mass determined at the LHC coupled with the lack of discovery of any superpartners indicates that the scale of supersymmetry must be higher than originally thought if it is realized at low energy at all. While it remains possible that the discovery of supersymmetry is around the corner at scales close to 1 TeV and well within reach on a 14 TeV LHC collider [16, 33, 34], it is also possible that the supersymmetry scale is significantly higher and sits in the range of 100–1000 TeV as expected in PGM models [3541], models with strongly stabilized moduli [4249], or in so-called models of mini-split supersymmetry [107].

In either case, we are forced to address the question regarding the scale of supersymmetry breaking. In models where the SUSY scale is upwards of 100 TeV, gaugino masses are generally generated at the 1-loop level through anomalies. In that case, the lightest supersymmetric particle is usually the wino. For \(\mu > 0\), the wino relic density is too small to account for the dark matter of the universe, but the wino might be a viable candidate when \(\mu < 0\) and the SUSY scale is of the order of 500 TeV (see e.g. [39]). However, even in that case, there are strong constraints against wino dark matter from higher-energy gamma-ray observations [109, 110]. In this context the axion becomes an attractive dark matter candidate.

In addition to the dark matter problems, most low-energy supersymmetric models suffer from the \(\mu \)-problem. Even in models where the scale of supersymmetry breaking is generated spontaneously, the \(\mu \) term, being supersymmetric, is typically put in by hand as a bilinear in the superpotential. Therefore, the dynamical generation of the \(\mu \) term, through the coupling of the MSSM Higgs doublets to Standard Model singlets with non-zero PQ charge [65, 66] offers an attractive solution to, potentially, both the \(\mu \) term and the dark matter problems.

From Eq. (13), it is clear that a PQ scale of close to \(10^{12}\) GeV is needed, if axions are to be the dominant form of dark matter in the universe. Thus if axions make up any significant component of the dark matter (say, at least 10 % of the dark matter), the \(\mu \) term is expected to be at least several TeV. If axions are the dominant form of dark matter, then the \(\mu \) term is expected to be of the order of 100 TeV or more.

In this paper, we have shown how a model of PGM can be constructed in the context of supergravity with a PQ coupling to the MSSM Higgs doublets. Because of this coupling, a Giudice–Masiero-like term in the Kähler potential is not allowed and we must deviate slightly from pure scalar mass universality. Allowing for the possibility of non-universal Higgs masses, we have shown that viable models exist with (non-Higgs) scalar mass universality which respect minimal supergravity boundary conditions for the \(B\)-term. Gaugino masses and \(A\)-terms are assumed to arise from anomaly-mediated contributions. All soft terms and couplings are run down from the universality scale (assumed to be the GUT scale), and minimization of the Higgs potential is used to determine the \(\mu \) term and \(\tan \beta \). If axions are the dominant form of dark matter, the coupling of the Higgs doublets to the PQ fields thus generates a \(\mu \) term of the order of 100 TeV, which sets the scale for supersymmetry breaking. While it is possible to find solutions with \(\mu \ll m_{3/2}\) (for example when \(m_2\) is large and \(\mu ^2\) is driven to 0), it is not possible to find solutions with \(\mu \gg m_{3/2}\). Thus fixing \(\mu \gtrsim {\mathcal O}(100)\) TeV in order to obtain a significant axion relic density forces us into the domain where supersymmetry is broken at a similarly high scale.Footnote 8

Fortuitously, it also possible to find solutions of the type just described with a Higgs mass in the range determined at the LHC. For relatively low \(m_{3/2} \lesssim 300\) TeV, we require \(m_1 \gtrsim m_{3/2}\). While for larger \(m_{3/2}\) up to 850 TeV, solutions with \(m_1 = m_{3/2}\) are possible. Our results are not particularly sensitive to \(m_2\), though \(m_2 < m_{3/2}\) is quite generic. If the up-type Higgs is a pseudo Nambu–Goldstone boson or is part of a no-scale structure so that \(m_2 = 0\), we are left with a particularly simple model with one free parameter, \(m_{3/2}\). For \(m_{3/2} > 300\) TeV, we have \(m_h > 124\) GeV, \(\mu /m_{3/2} \simeq 1\), and \(\tan \beta \simeq 2.2\).

We have also shown several mechanisms which suppress isocurvature fluctuations despite having a dominant component of axion dark matter. The simplest possibility, which may not apply here, is that described in [79] and requires only a small coupling \(\lambda \) in Eq. (11). The constraint on \(\lambda \) can be significantly relaxed if a Giudice–Masiero-like term is added to the Kähler potential as in Eq. (14). Finally, we have proposed a novel dynamical mechanism for the generation of the axion decay constant which allows \(F_{ PQ} \simeq M_P\) during inflation, and smaller values (\({\mathcal O}(10^{12})\) GeV) at low energy.

7 Note added

After this paper was submitted, BICEP2 [111] reported detection of \(B\) modes giving \(r=0.20^{+0.07}_{-0.05}\). With \(r\) this large, isocurvature perturbations are even more problematic. However, the mechanism proposed in Sect. 3 is still viable for solving the isocurvature perturbations if \(F_{ PQ}\) is close to the Planck scale during inflation. With \(F_{ PQ}\) close to the Planck scale, explicit breaking of the PQ-symmetry from gravity-induced Planck suppressed operators could be important. It is possible that these higher dimensional operators, which are negligible in the vacuum state, could give the axion a large mass during inflation, further suppressing the isocurvature perturbations.