1 Introduction

In the Minimal Supersymmetric Standard Model (MSSM) the scalar boson discovered by ATLAS and CMS [1, 2] is identified with the lightest neutral Higgs boson of the model. Its mass has been determined to be 125.10 GeV with a precision of 0.14 GeV when combining the measurements of ATLAS and CMS [3,4,5,6].

Given the precision of the measurement it is tempting to express model parameters of the MSSM as a function of this measurement, as it is customarily done for the fermion masses. This choice is also similar to e.g, the almost universally adopted convention of expressing electroweak BSM model parameters as functions of the Z-boson pole mass input, \(m_Z\), after its precise determination at LEP1 in the early 1990s [7]. This procedure was analytically nontrivial beyond tree-level as it necessarily involved the radiative corrections contributing to the Z-boson pole mass (for the state of the art see [8] and references therein). In the MSSM, the relation of the lightest Higgs boson mass \(m_h\) to the basic model parameters is even more involved when including the radiative corrections.

The inversion of relations between parameters of a model and a physical observable is facilitated by approximations which are more easily amenable to such a procedure. In the MSSM this approach has been studied in the gaugino-Higgsino sector [9,10,11] and in the Higgs sector [11, 12]. By construction the precision of the approach depends on the precision of the approximation which has to be compared to the precision of the experimental measurement. Thus the development of an algorithm to cope with the highest available precision is developed in this paper.

There is a second motivation for the replacement of the model parameter. If a full exploration of the MSSM parameter space is performed, a large fraction of the parameter sets studied will not predict a lightest neutral Higgs boson mass in agreement with the experimental measurement. Using \(m_h\) as parameter has the potential to lead to a more efficient exploration of the MSSM.

In the following the MSSM with \(m_h\) as parameter is referred to as \(m_h\)MSSM to differentiate the model from the standard MSSM. It is well known that \(m_h\) depends non-trivially and strongly on the trilinear coupling \(A_t\) as well as on the third generation squark sector soft breaking masses via its leading radiative corrections. Because of the \(A_t \leftrightarrow m_h\) connection, it is natural to choose to develop the inversion with \(A_t\) replaced by \(m_h\) as model parameter in the \(m_h\)MSSM. The study is carried out within the CP-conserving version of the MSSM where \(A_t\) is real-valued, and relies on the fixed-order loop approximations. We will comment on departure from these assumptions towards the end of the paper.

Inverting a relationship between two parameters can be performed in multiple ways. A brute force approach would be to calculate \(m_h\) as function of all MSSM parameters as well as their SM inputs, taking into account the experimental and systematic errors. This is time consuming and inefficient as in most of the parameter space the predicted \(m_h\) is too light with respect to the experimental measurement. Machine Learning algorithms are more efficient but need extensive training and validation. For each update of the calculations the full procedure of determination and validation has to be performed again. For these reasons an algorithmically simple procedure, the fixed point algorithm, is used. It has the advantage that its preparation is analytical work, i.e., an appropriate function has to be derived and the convergence criteria have to be fulfilled. The guiding principle behind this choice is that the additional calculations should add minimal overhead to the calculation of the spectrum.

The paper is organized as follows. In Sect. 2 the basic generic expressions for the scalar Higgs boson masses in the MSSM, including higher order radiative corrections, are recalled and the notations are defined. The MSSM parameter set to serve as test case for our general approach is defined. In Sect. 3 the inversion \(A_t(m_h)\) is first illustrated using an analytical approximation at one-loop level, whose purpose is to serve as first guess within a subsequent more elaborate construction. Then the exact full one-loop expression is given together with a description of the strategy for the inversion algorithm, as well as the consistent implementation of the dominant two-loop contributions. In Sect. 4 the full algorithm is assembled by combining the analytical approximation with the full one-loop and dominant two-loop calculations. The results of its application to the parameter set are discussed, as well as possible extensions. We conclude in Sect. 5 and provide technical material in the appendices.

2 Setting up the MSSM Higgs mass

In this section we briefly recall the main content of the fixed-order (diagrammatic) calculation of \(m_h\). Then we define a parameter set to illustrate numerical results.

2.1 Calculation of the Higgs boson mass

In the standard ‘top-down’ procedure one assumes that all the SM and MSSM parameters are taken as input before determining the light (and heavy) CP-even Higgs boson masses through the diagonalization of the corresponding (momentum dependent) squared mass matrix. The latter has the following generic form:

$$\begin{aligned}&{\mathcal {M}}^2_s(p^{2}) = \left( \begin{array}{ll} \overline{m}_{11}^2-\Pi _{11}(p^{2})+\frac{t_1}{v_1} &{} \overline{m}_{12}^2- \Pi _{12}(p^{2})\\ \overline{m}_{12}^2- \Pi _{12}(p^{2}) &{} \overline{m}_{22}^2 -\Pi _{22}(p^{2})+\frac{t_2}{v_2} \end{array}\right) \end{aligned}$$
(1)

where

$$\begin{aligned}&\overline{m}_{11}^2 =\overline{m}_{Z}^{2} \cos ^{2}\beta + \overline{m}_{A}^{2} \sin ^{2}\beta , \end{aligned}$$
(2)
$$\begin{aligned}&\overline{m}_{22}^2 =\overline{m}_{Z}^{2} \sin ^{2}\beta + \overline{m}_{A}^{2} \cos ^{2}\beta , \end{aligned}$$
(3)
$$\begin{aligned}&\overline{m}_{12}^2 =-\frac{1}{2}(\overline{m}_{Z}^{2} + \overline{m}_{A}^{2}) \sin 2\beta . \end{aligned}$$
(4)

Here \(\overline{m}_{Z}^{2}\) and \(\overline{m}_{A}^{2}\) denote the running Z-boson and CP-odd Higgs boson squared masses, \(\beta \) the angle defined by \(\tan \beta =v_2/v_1\) where \(v_1,v_2\) are the two Higgs vacuum expectation values, and \(\Pi _{ij}(p^{2})\) and \(t_i\) designate respectively the renormalized self-energy and tadpole loop contributions, formally to arbitrary orders in perturbation theory. In practice we rely on the \(\overline{DR}\) renormalization scheme. For reviews on radiative corrections to \(m_h\) in the MSSM and original references see e.g. [13, 14], and [15] for a recent up-to-date review.

In Eq. (1) the tree-level contributions involve \(\overline{m}_{A}\) evaluated at a given electroweak symmetry breaking (EWSB) scale \(Q_{EWSB}\), and the MSSM parameter \(\tan \beta \). If we ignore momentarily the complication that the running parameters in the expressions above have actually a non-trivial implicit dependence on the other MSSM parameters, the “tree-level” masses of the two CP-even MSSM Higgs states and their mixings are described by only these two unknown MSSM parameters as well as \(\overline{m}_Z\).

Beyond the tree-level the squared mass matrix depends on the external squared momentum \(p^2\) through the self-energies, as shown in Eq. (1). The actual pole masses, \(m_h\) and \(m_H\), are then obtained by determining the two solutions \(p^2=p_h^2\) and \(p^2=p_H^2\) to the equation

$$\begin{aligned} \det \left( p^{2} {\mathbf {1}} - {\mathcal {M}}^{2}_s(p^{2}) \right) =0, \end{aligned}$$
(5)

and taking \(m_{h,H}=\sqrt{{\text {Re}}( p_{h,H}^2)}\). Obviously, the loop contributions in \({\mathcal {M}}^2_s(p^{2})\) depend also on the chosen (\(\overline{DR}\)) renormalization scale.

In this paper we focus on the lighter CP-even Higgs with mass \(m_h\). The self-energies and tadpoles contained in \({\mathcal {M}}^2_s(p^{2})\) have been known to one-loop order exactly since the 1990s [16,17,18,19], as well as the QCD [20, 21] and other dominant two-loop corrections in the on-shell scheme [22,23,24] or in the \(\overline{\mathrm{DR}}\) scheme [25,26,27,28,29,30,31,32,33]. The (almost) complete two-loop contributions are also available [34,35,36,37], as well as the dominant higher order contributions [38,39,40,41]. These have been included in some analyses (see [15] for details). In practice, the determination of the pole mass \(m_h\) is achieved by iterating on its implicit expression Eq. (5), until a sufficient accuracy is reached.

Let us now recall some important features of the scalar sector parameter relationship that will be relevant for our construction. Depending on the phenomenological context, \(\overline{m}_{A}\) may be either an input or a derived quantity. In the first case, typical for low-energy model-independent applications, \(\overline{m}_{A}\) can be a direct input at a given EWSB scale \(Q_{EWSB}\), or inferred from the pole mass \(m_A\) taken as input. In the second case, typical for top-down approaches, it is obtained from the Supersymmetry (SUSY) soft-breaking running Higgs mass parameters \(\overline{m}_{H_{u}}\) and \(\overline{m}_{H_{d}}\), evolved by the renormalization group equations (RGE) down to a scale \(Q_{EWSB}\) where the EWSB constraints are imposed:

$$\begin{aligned} \overline{m}_{A}^{2}(Q_{EWSB})&= \frac{1}{\cos 2 \beta }\left( \hat{m}_{H_{u}}^{2}-\hat{m}_{H_{d}}^{2}\right) -\overline{m}_Z^2 , \end{aligned}$$
(6)
$$\begin{aligned} \mu ^2(Q_{EWSB})&=\frac{1}{2} \left( \left( \hat{m}_{H_{u}}^{2} \tan \beta - \hat{m}_{H_{d}}^{2} \cot \beta \right) \right. \nonumber \\&\quad \times \left. \tan 2 \beta -\overline{m}_Z^2 \right) . \end{aligned}$$
(7)

Here \(\mu \) denotes the running supersymmetric Higgs mixing parameter, and \(\hat{m}_{H_{i}}^{2} \equiv \overline{m}_{H_{i}}^{2} - t_i/v_i\).

The self-energies and tadpole loop contributions in Eq. (1) depend implicitly on all MSSM parameters through their sensitivity to the couplings and masses of (s)particles entering the loops. In particular, the dominant radiative correction originates from the top quark mass \(m_{t}\) as well as the stop masses and mixing, \(m_{\tilde{t}_{i}}\), \(X_t\equiv A_t-\mu /\tan \beta \). A further implicit dependence on these parameters occurs when Eqs. (6) and (7) are imposed. This will be instrumental for the identification of the functional dependence of \(m_h\) on the trilinear stop coupling \(A_t\), in order to set up an efficient inversion algorithm leading to \(A_t(m_h)\), i.e. in the \(m_h\)MSSM.

2.2 Stop Cliff

Table 1 The Stop Cliff set of MSSM parameters is given. The scale dependent parameters are defined at the EWSB scale with the exception of \(\tan \beta \) which is defined at the Z boson scale. The resulting masses of the Higgs, lightest stop and LSP are listed as well

In order to illustrate the general procedure with a concrete example, an MSSM benchmark parameter set is used in the study. The parameters are listed in Table 1.

The EWSB scale is fixed as suggested in [42]. The numerical value chosen is close to the geometric mean of the top squark masses \(m_{\tilde{t}_{1}},m_{\tilde{t}_{2}}\). In the following we will use as input parameter either \(A_t\) in the MSSM or alternatively \(m_h\) in the \(m_h\)MSSM. Many parameters of the table have little influence on the value of \(m_h\). They are set to values which are sufficiently large to evade the lower bounds on supersymmetric particle masses determined at the LHC [43,44,45,46,47].

The value of \(M_1\) is chosen to obtain a dominantly Bino LSP \(\chi _1^\circ \). Of the two soft breaking masses \(m_{\tilde{q}3_L}\) is much greater than \(m_{\tilde{t}_R}\), therefore the lightest top squark is essentially of type R. \(m_{\tilde{t}_R}\) dominates the determination of its mass as the mixing angle is small. The resulting masses of the lightest top squark and the LSP are shown in Table 1 as well. The top squark mass of 1.3 TeV was chosen to be close to the exclusion bounds determined at the LHC by ATLAS [48] and CMS [49]. We will refer to the benchmark point of Table 1 as the Stop Cliff.

In order to accurately compare the \(m_h\)MSSM to the MSSM determination, we have first adjusted \(A_t\) such that \(m_h(A_t)\) is driven to its experimental value. The resulting \(m_h\) values are rounded to MeV. This is two orders of magnitude more precise than the current experimental precision.

3 From the Higgs boson mass to the trilinear stop coupling

The dependence of \(m_h\) on \(A_t\) is non-trivial. First a simple well-known approximation will be studied, followed by a full one-loop expression derivation. Finally the two-loop contributions will be included.

The strategy to determine \(A_t(m_h)\) relies on identifying first the algebraic dominant dependencies on \(A_t\) at the given perturbative order or approximation. Then either the approximation is solved for \(A_t\) or the resulting equation at given order is transformed into a fixed-point problem. The latter, via the intrinsically iterative structure of the determination, will account also for the residual non-algebraic dependencies, leading to the exact determination of \(A_t(m_h)\).

3.1 Approximate one-loop inversion

Several approximate expressions have been developed in the past for the dominant radiative corrections to \(m_h\) in the MSSM, ranging from simple to sophisticated [50,51,52,53,54,55,56,57]. While the latter cannot compete with the full one-loop plus two-loop calculations available nowadays, some of the approximations including dominant two-loop contributions can be rather precise, depending on the considered MSSM parameter range. In our construction there is no need to rely on the most elaborate approximations. The specific expression that we will use is the well-known one originally derived in [53, 54], see also [56], obtained from considering only the dominant top and stop contributions to the one-loop MSSM effective potential, and using renormalization group properties to resum the leading logarithms of decoupled “heavy” stops (relative to \(\sim m_t\)):

$$\begin{aligned} m^{2}_{h} = \overline{m}_{h}^{2} +\dfrac{3\overline{g}_{2}^{2}\overline{m}_{t}^{4}}{8 \pi ^{2} \overline{m}_{W}^{2}} \left[ \ln \left( \dfrac{M_{S}^{2}}{\overline{m}_{t}^{2}}\right) +\dfrac{X_{t}^{2}}{M_{S}^{2}}-\dfrac{X_{t}^{4}}{12 M_{S}^{4}}\right] , \end{aligned}$$
(8)

with the noteworthy quartic and quadratic dependencies on the stop mixing parameter,

$$\begin{aligned} X_{t}\equiv A_{t}- \mu \cot \beta . \end{aligned}$$
(9)

In Eq. (8), \(\overline{m}_h\) is the (running) tree-level Higgs mass

$$\begin{aligned} \overline{m}_{h}^{2} = \dfrac{1}{2}\left[ \overline{m}_{Z}^{2} {+} \overline{m}_{A}^{2} {-} \sqrt{(\overline{m}_{A}^{2}-\overline{m}_{Z}^{2})^2 {+} 4 \overline{m}_{Z}^{2} \overline{m}_{A}^{2} \sin ^2 (2\beta ) } \right] , \end{aligned}$$
(10)

\(\overline{m}_{t}\) the running top mass, \(\overline{g}_{2}\) the running SU(2) gauge coupling, and \( \tan \beta \) the running vev ratio, and we define

$$\begin{aligned} M_{S}^{2}=\,\,&\sqrt{(\overline{m}_{\tilde{q}3_L}^{2} +(\frac{1}{2}-\frac{2}{3} s_W^2 ) \overline{m}_Z^2 \cos {2\beta }+ \overline{m}_t^2)}\nonumber \\&\cdot \sqrt{(\overline{m}_{\tilde{t}_{R}}^{2} +\frac{2}{3}s_W^2 \overline{m}_Z^2\cos {2\beta } + \overline{m}_t^2)} \end{aligned}$$
(11)

where \(\overline{m}_{\tilde{q}3_L}^{2}\) and \(\overline{m}_{\tilde{t}_{R}}^{2}\) are the running soft SUSY-breaking parameters of the third generation associated to the left doublet and the stop singlet.

We choose to evaluate all the running quantities in Eq. (8) at the EWSB scale \(Q_{EWSB}\simeq (m_{\tilde{t}_1} m_{\tilde{t}_2})^{1/2}\), including \(\overline{m}_t\). While this choice for \(\overline{m}_t\), and the definition of \(M_S\) in Eq. (11), are somewhat at variance with the literature, we emphasize here that we seek a sufficiently accurate but simple expression whose sole purpose is to serve as a first guess for our genuine algorithm, the latter giving consistently a (perturbatively) “exact” \(A_t(m_h)\). To put things in perspective, let us enumerate some important features related to Eq. (8), referring to [53,54,55,56] for details:

  • Strictly speaking, Eq. (8) is valid for large \(\tan \beta \), neglecting \(\mathcal{O}(g^4_i)\) terms, and in the limit \(M_S \gg m_t\) as an expansion in \(\frac{|m_t X_t|}{M_S^2}\). In particular, among other necessary steps [53] in the derivation of Eq. (8) from the MSSM effective potential, the \(X^4_t\) term arises only after expanding to second order in \(X_t^2\) the terms with logarithmic dependence in the stop masses, \( \ln m^2_{\tilde{t}_i}\simeq \ln (M_S^2 \pm \overline{m}_t X_t)\) (in the limit \(M_S\simeq \sqrt{m_{\tilde{q}3_L}^2 + m_t^2} \simeq \sqrt{m_{\tilde{t}_{R}}^2 + m_t^2}\)). This approximation was thus a priori expected to be valid only for rather moderate \(|X_t/M_S|\), large \(\tan \beta \) and for \(m_A \gg m_Z\), but turned out to be reasonably good in an extended range.

  • In [53,54,55,56], the D-terms \(\propto m_Z\) in Eq. (11) were accordingly neglected, moreover, universal soft masses, \(m_{\tilde{q}3_L} =m_{\tilde{t}_{R}}\simeq M_{susy}\), were also assumed for simplicity. Actually in the effective field theory (EFT) framework appropriate to derive Eq. (8), \(M_S\) is not very precisely defined as long as it is identified as the scale at which the top squarks are decoupled, and the matching to the EFT is done: \(M_S\) is assumed in the literature to be of order the average stop masses. In a more refined treatment (or to account for \(m_{\tilde{t}_1} \ll m_{\tilde{t}_2}\)) one would need to decouple the two stop masses separately, which is beyond the scope of the approximation Eq. (8). Our slightly different definition in Eq. (11) makes minor differences, the important practical feature for our purpose being that \(M_S\) in Eq. (11) does not involve an extra dependence on \(X_t\).

  • The scale \(Q_t\) at which the running top mass \(\overline{m}_t(Q_t)\) is evaluated in Eq. (8) is quite relevant due to the \(\overline{m}_t^4\) dependence. While the EFT one-loop calculation in [53, 54] involves \(\overline{m}_t(m_t)\), in [55] it was shown that the leading (EFT) two-loop contributions are essentially absorbed by the one-loop expression Eq. (8), if setting \(\overline{m}_t(Q_t\equiv (\overline{m}_t M_S)^{1/2})\) and \(\overline{m}_t(Q_t\equiv M_S)\), respectively for the term \( \sim \ln M^2_S/\overline{m}_t^2\) and mixing terms \(\sim \overline{m}_t X_t\). Given that we do not seek the best possible approximation, the choice of a (unique) EWSB scale \(Q_{EWSB}\simeq (m_{\tilde{t}_1} m_{\tilde{t}_2})^{1/2}\sim M_S\) for all running parameters (or any fixed scale sufficiently close to the latter, as often conveniently chosen in Suspect and similar codes [58]) appears to be a reasonable compromise.

  • Equation (8) can also be derived from the diagrammatically obtained \(\Pi _{ij}(0)\) self-energies and \(t_i\) tadpoles, provided one carefully identifies [56] approximations at the same level as described above.

There exists several refinements of Eq. (8), e.g. including sbottom [55], stau and QCD leading effects [57]. In the following we stick to Eq. (8), referring to it as approximate one-loop whose simplicity is important for our construction, as explained next.

In order to determine \(A_t\) as a function of the “physical” mass \(m_h\), Eq. (8) is solved as being a quadratic equation in \(X^2_t\).

Fig. 1
figure 1

The prediction of \(m_h\) at one- and two-loop precision as well as for the approximate one-loop is shown as function of \(A_t\) together with the measured mass

The procedure captures an essential feature of the inversion for \(A_t\): in principle there are two \(X^2_t\) solutions which, when real-valued and positive, lead to up to four different \(A_t\) solutions. This well-known feature is illustrated in Fig. 1 where we show the \(m_h\) prediction for the full one-loop, two-loop and the approximate one-loop calculations. For the \(m_h(A_t)\) full one-loop and two-loop results, \(A_t\) was varied and for each \(A_t\) the EWSB minimisation was performed to derive the running mass of the CP-odd Higgs boson and the \(\mu \) parameter. All other parameters were fixed to their cliff point values. For the approximate one-loop, the coefficients of Eq. (8) were calculated for the cliff point, thus fixing \(\overline{m}_A^2\) and \(\mu \) and varying only \(A_t\) in the square root of Eq. (8) shown in the figure. For the measured Higgs boson there are four intersections with the prediction. The third intersection (from left to right) with the two-loop prediction is consistent with the \(A_t\) cliff point value of 3.61 TeV.

Table 2 The result of the \(A_{t}\) determination from \(m_{h}\) using Eq. (8) is shown. s3 corresponds to the nominal solution: the cliff point

We turn now to the inverse approach of determining \(A_t\) from Eq. (8) when the measured \(m_h\) is taken as input. The \(A_{t}\) determination is illustrated in Table 2. Four solutions are determined for \(A_t\) as expected. The third solution s3 corresponds to the nominal solution: the cliff point. In each one of the solutions \(\overline{m}_A^2\) and \(\mu \) are those corresponding to the intersection of the two-loop Higgs mass with the measured \(m_h\).

For the nominal \(A_t\) in s3 Eq. (8) leads to a Higgs mass larger than the two-loop Higgs mass prediction (see Fig. 1). This is compensated by an \(A_t\) value smaller than the nominal one: 2.87 TeV instead of 3.61 TeV. The use of the measured \(m_h\) in the calculation therefore inevitably leads to a shift in \(A_t\). Its calculated value is within 25% of the nominal value, a clear improvement compared to a blind guess. At the same time, it clearly illustrates that an accurate \(A_t(m_h)\) determination requires a more elaborate construction as compared to Eq. (8), which we will describe next.

3.2 Full one-loop: the HiggsMolar function

For the full one-loop inversion the starting point is the (formally exact) Eq. (5) which we rewrite here as an essentially quadratic equation in the CP-even Higgs squared masses,

$$\begin{aligned}&m_{h,H}^{4}- m_{h,H}^{2}\left( ({\mathcal {M}}^2_s)_{{}_{11}}(p^2)+({\mathcal {M}}^2_s)_{{}_{22}}(p^2)\right) \nonumber \\&\quad +({\mathcal {M}}^2_s)_{{}_{11}}(p^2) ({\mathcal {M}}^2_s)_{{}_{22}}(p^2)\nonumber \\&\quad - \left( ({\mathcal {M}}^2_s)_{{}_{12}}(p^2)\right) ^{2} \vert _{p^2 = m^2_{h,H}}=0 , \end{aligned}$$
(12)

of which \(m_h^2\) corresponds to the lighter mass solution. Here \(({\mathcal {M}}^2_s)_{{}_{ij}}(p^2)\) denotes the real parts of the matrix element, and the solutions \(p^2=m_{h,H}^2\) are real-valued.Footnote 1 The \(({\mathcal {M}}^2_s)_{{}_{ij}}(p^2)\)’s extracted from Eq. (1) include perturbatively complete loop corrections where the full one-loop Higgs boson self-energies, \(\Pi _{ij}\), and tadpoles \(t_{i}\) are taken from [19]. In the \(\overline{\mathrm{DR}}\) scheme these expressions include contributions from all (s)particles (running) masses, resulting in a highly nonlinear dependence on \(A_{t}\) from the stop sector. In particular the finite part of the one-loop scalar function \(A_{0}\), occurring in the tadpoles and self-energies, has a rather involved \(A_{t}\) dependence when its argument is the stop mass:

$$\begin{aligned} A_{0}(m_{\tilde{t}_{i}})= m_{\tilde{t}_{i}}^{2}\left( 1-\ln \left( \frac{m_{\tilde{t}_{i}}^{2}}{Q^{2}}\right) \right) \end{aligned}$$
(13)

where Q is the \(\overline{\mathrm{DR}}\)-scheme renormalization scale, and where from the stop sector diagonalization, the \(\overline{\mathrm{DR}}\)-scheme running stop masses can be written as

$$\begin{aligned} m_{\tilde{t}_{1,2}}^2 = \frac{1}{2} \left( M^2 \mp \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s} \right) , \end{aligned}$$
(14)

where

$$\begin{aligned}&M^2 = m_{\tilde{q}3_L}^{2} + m_{\tilde{t}_{R}}^{2} + 2 m_t^2 + \frac{1}{2} m_Z^2 \cos {2\beta }, \nonumber \\&a_s = 4 m_t^2, \nonumber \\&b_s = -8 m_t^2 \mu \cot \beta , \nonumber \\&c_s = \left( m_{\tilde{q}3_L}^{2} - m_{\tilde{t}_{R}}^{2} + \left( \frac{1}{2}-\frac{4}{3} s_W^2 \right) m_Z^2 \cos {2\beta }\right) ^2 \nonumber \\&\quad \quad + 4 m_t^2 \mu ^2 \cot ^2\beta , \end{aligned}$$
(15)

are \(A_t\) independent.Footnote 2 The strategy is to rewrite Eq. (12) as an equation for \(A_t\) by extracting from Eq. (1) the explicit polynomial \(A_{t}\) dependencies or “power counting” within each \(\Pi _{ij}\) and \(t_{i}\). On close inspection of the various contributions in e.g. [19] we identify:

  1. 1.

    terms depending linearly on \(A_{t}\) (with coefficients identified by a superscript ‘1’), originating from the \(s_k\)-\(\tilde{t}_i\)-\(\tilde{t}_j\) couplings, \(g_{s_2\tilde{t}_1\tilde{t}_1}\), \(g_{s_2\tilde{t}_2\tilde{t}_2}\), \(g_{s_2\tilde{t}_1\tilde{t}_2}\) which are given by

    $$\begin{aligned} \begin{aligned} g_{s_2\tilde{t}_1\tilde{t}_1}&= c_t^{2} \, g_{s_2{\tilde{t}_L}{\tilde{t}_L}} + 2 c_t s_t \, g_{s_2{\tilde{t}_L}{\tilde{t}_R}} + s_t^{2} \, g_{s_2{\tilde{t}_R}{\tilde{t}_R}},\\ g_{s_2\tilde{t}_2\tilde{t}_2}&= s_t^{2} \, g_{s_2{\tilde{t}_L}{\tilde{t}_L}} - 2c_t s_t \, g_{s_2{\tilde{t}_L}{\tilde{t}_R}} + c_t^{2} \, g_{s_2{\tilde{t}_R}{\tilde{t}_R}}, \\ g_{s_2\tilde{t}_1\tilde{t}_2}&= s_t c_t \, (g_{s_2{\tilde{t}_R}{\tilde{t}_R}} - g_{s_2{\tilde{t}_L}{\tilde{t}_L}}) + (c_t^{2}-s_t^{2}) \, g_{s_2{\tilde{t}_L}{\tilde{t}_R}},\\ g_{s_2{\tilde{t}_L}{\tilde{t}_R}}&= \frac{y_{t}}{\sqrt{2}}A_{t}, \end{aligned} \end{aligned}$$
    (16)

    with \(y_t\) the top quark Yukawa coupling, \(s_t\equiv \sin \tilde{\theta }_t\), \(c_t\equiv \cos \tilde{\theta }_t\), where \(\tilde{\theta }_t\) denotes the stop mixing angle, \(\tilde{t}_1, \tilde{t}_2\) are the stop mass eigenstates, \(\tilde{t}_L\), \(\tilde{t}_R\) the gauge eigenstates, and \(s_k\) are the neutral scalar states in the basis corresponding to Eq. (1). Note that \(g_{s_2{\tilde{t}_R}{\tilde{t}_R}}\), \(g_{s_2{\tilde{t}_L}{\tilde{t}_L}}\) do not depend explicitly on \(A_t\), their expressions can be found e.g. in [19] (denoted by \(\lambda _{s_2{\tilde{u}_{L,R}}{\tilde{u}_{L,R}}}, \lambda _{s_2{\tilde{u}_L}{\tilde{u}_R}}\) therein).

  2. 2.

    terms depending quadratically on \(A_{t}\) (with coefficients identified by a superscript ‘2’), originating from the \(g_{s_2t_it_i}^2\) that appear solely in \(\Pi _{22}(p^2)\);

  3. 3.

    terms depending on \(\sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s}\) (with coefficients identified by ‘s’) originating from the \(m_{\tilde{t}_{i}}^{2}\) in \(A_{0}(m_{\tilde{t}_{i}})\);

  4. 4.

    a term containing \(A_{t}\cdot \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s}\) (with coefficient identified by ‘1s’) resulting from the occurrence of the product \(g_{s_2t_it_i}\times A_{0}(m_{\tilde{t}_{i}})\) appearing solely in \(t_2\);

  5. 5.

    finally all remnant contributions with no (explicit) dependence or with logarithmic dependence on \(A_{t}\), are identified by a ‘0’ superscript. It is an important part of our strategy that in our \(A_t\)-power counting any “logarithmic” dependence \(\sim \ln (M^2 \mp \sqrt{a_s A_t^2+\cdots })\) on \(A_t\) (such as in the second term of Eq. (13), and in the one-loop scalar function \(B_0\)), as well as the other algebraic dependence on \(A_t\) in \(B_0\), are incorporated exactly as they stand within the coefficients of the above listed relevant \(A_t\) powers. In the following we dub these dependencies “residual”. The dependence of the stop mixing angle \(\tilde{\theta }_t\) on \(A_t\) is also treated as residual, since it enters in Eq. (16) through \(c_t\) and \(s_t\) that remain obviously bounded functions of \(A_t\).

According to the previous \(A_t\) power counting, within the relevant (one-loop) \(\Pi _{ij}\) and \(t_i\) individual contributions, there are no higher degree monomials in \(A_t\) than the \(A_t^k\) identified above with \(k \le 2\). This gives the following formal decomposition of the tadpoles and self-energies:

$$\begin{aligned}&\frac{t_{1}}{v_1}= t_{1}^{ (s) }\sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s}+ t_{1}^{ (0) } \nonumber \\&\frac{t_{2}}{v_2}= t_{2}^{ (1s) } A_t \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s}+ t_{2}^{ (1) } A_t \nonumber \\&\quad + t_{2}^{ (s) } \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s} + t_{2}^{ (0) }\nonumber \\&\Pi _{11}= \pi _{11}^{(s)} \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s}+ \pi _{11}^{ (0) } \nonumber \\&\Pi _{12}= \pi _{12}^{ (1) } A_t + \pi _{12}^{ (0) }\nonumber \\&\Pi _{22}= \pi _{22}^{ (2) } A_t^2+ \pi _{22}^{ (1) } A_t\nonumber \\&\quad + \pi _{22}^{ (s) } \sqrt{a_s A_{t}^{2}+b_s A_{t} +c_s} +\pi _{22}^{ (0) }. \end{aligned}$$
(17)

Equation (17) is simply a convenient rewriting of already available exact one-loop expressions, no contributions are ignored. Using Eq. (17) to display the algebraic dependence on \(A_t\) in Eq. (12), the following molar-shaped function \(\mathrm {HiggsMolar}(A_{t})\), which should consistently vanish for any \(A_{t}\) solution, is obtained:

$$\begin{aligned}&\mathrm {HiggsMolar}(A_{t})= C_{3} A_{t}^{3} + C_{2} A_{t}^{2} + C_{1} A_{t} + C_{0} \nonumber \\&\quad + (R_{2} A_{t}^{2} + R_{1} A_{t}+ R_{0}) \sqrt{a_s A_{t}^{2} +b_s A_{t}+ c_s}=0 . \end{aligned}$$
(18)

The \(\mathrm {HiggsMolar}\) is shown as function of \(A_t\) in Fig. 2. For each \(A_t\) the coefficients are recalculated. The four other curves show the behavior of the function if the coefficients determined at the intersections of the \(\mathrm {HiggsMolar}\) with the line \(\mathrm {HiggsMolar}= 0\) are used in the vicinity of the intersections. The curves illustrate the variation of the coefficients.

This equation can in principle be used separately either for the lighter or for the heavier CP-even Higgs masses, as clear from Eq. (12). Hereafter we are only considering the lighter Higgs mass at \(\sim 125~\text {GeV}\) as input. The \(C_i\) and \(R_i\) coefficients are easily identified upon use of Eqs. (1), (12) and (17). They contain all the residual \(A_t\) dependencies quoted above, neglecting all imaginary parts as justified in footnote 1.

Fig. 2
figure 2

The distribution of the \(\mathrm {HiggsMolar}\) is shown as function of \(A_t\): calculating the coefficients of Eq. (18) for each \(A_t\) (solid blue curve); fixing the coefficients of Eq. (18) at the four \(A_t\) solutions (red curves). For the four solutions, given by the intersections with the dotted blue line, only the \(A_t\) range around the solution is shown by the red curves

The cubic term results from the product \((t_1-\Pi _{11}) \times t_{2} \) present in \(({\mathcal {M}}^2_s)_{11} ({\mathcal {M}}^2_s)_{22}\), with a coefficient \(C_3\) given by

$$\begin{aligned} C_{3}= 4\overline{m}_t^2 ( t_{1}^{(s)} - \pi _{11}^{(s)} ) t_{2}^{(1s)}. \end{aligned}$$
(19)

In particular, in contrast with Eq. (8) and unless performing in Eq. (18) an expansion in \(X_t\), explicit \(X_t^4\sim A_t^4\) terms cannot occur within the exact Eq. (12) as the equation does not involve squares of terms of the type ‘2’ or ‘1s’.

The general form of the other \(C_{i}\) and \(R_{i}\) coefficients is given in Appendix A. These coefficients have a more involved dependence on (differences of) the quantities \(\pi _{ij}^{(0,s,1,2)}\) and \(t_{i}^{(0,s,1,1s)}\) entering Eq. (17). The relevant one-loop expressions of the latter are also given in Appendix A. They allow to track the residual dependence on \(A_t\) and the absence of some finite combinations in relation to the expected cancellation of the quadratic divergences in softly-broken SUSY. The full one-loop explicit dependence on \(A_t\) is thus included in Eq. (18). A further implicit dependence on \(A_t\) will originate from the two EWSB conditions Eqs. (6), (7) when imposed beyond the tree-level, due to the presence of \(\overline{m}_A^2\) as well as \(\mu \) in the \(C_{i}\) and \(R_{i}\) coefficients. In particular, the \(t_2\)-tadpole dependence in \(\overline{m}_A^2\) will induce, through the term \(\overline{m}_{11}^2 \times t_2\) appearing in the cross-product \(({\mathcal {M}}^2_s)_{11} ({\mathcal {M}}^2_s)_{22}\) in Eq. (12), an effectively quartic dependence on \(A_t\) for large \(A_t\), not explicit in Eq. (18). This entails solving simultaneously Eq. (18) and the EWSB constraints Eqs. (6) and (7), which we will perform numerically in a consistent way as described in Sect. 4.1. Other implicit dependencies on \(A_t\) are discussed in Sect. 3.5. Hereafter we ignore momentarily these issues and focus solely on the resolution of Eq. (18).

3.3 Full one-loop inversion: the fixed point algorithm

To solve Eq. (18) for \(A_t\), a fixed point iterative method is used. For this purpose we define

$$\begin{aligned} \mathrm {C_{FP}}(A_t)= & {} -\frac{1}{C_{3}} \Big ( C_{2} A_{t}^{2} + C_{1} A_{t} + C_{0}\nonumber \\&+ (R_{2} A_{t}^{2} + R_{1} A_{t}+ R_{0})\sqrt{a_s A_{t}^{2} +b_s A_{t}+ c_s} \Big ), \nonumber \\ \end{aligned}$$
(20)

and rewrite Eq. (18) as

$$\begin{aligned} A_t = \root 3 \of {\mathrm {C_{FP}}(A_t)}. \end{aligned}$$
(21)

It is then clear that finding all the real-valued solutions of Eq. (18) is equivalent to determining all the fixed points \(A_t=A_{t}^{FP}\), satisfying \(\mathrm {L_{FP}}(A_{t}^{FP})= A_{t}^{FP}\), of the function \(\mathrm {L_{FP}}\) defined by

$$\begin{aligned} \mathrm {L_{FP}}(A_t) \equiv \root 3 \of {\mathrm {C_{FP}}(A_t)}, \end{aligned}$$
(22)

with only real-valued cubic roots allowed.

Fig. 3
figure 3

The distributions of the functions \(\mathrm {C_{FP}}\) and \(\mathrm {L_{FP}}\) are shown as function of \(A_t\): calculating the coefficients of Eq. (18) for each \(A_t\) (solid blue curve); fixing the coefficients of Eq. (18) at the four \(A_t\) solutions (red curves). For the four solutions, given by the intersections with the dotted blue line, only the \(A_t\) range around the solution is shown by the red curves

To determine the fixed points one starts from a guess value \(A_{t, 0} = A_{t}^{Guess}\) and studies the convergence of the sequence \(A_{t, i+1} = \mathrm {L_{FP}}(A_{t_{i}})\). The iterations return unambiguously real-valued \(A_t\) as a consequence of the definition of \(\mathrm {L_{FP}}\), Eq. (22). As we will specify in more detail in Sect. 4, appropriate \(A_t\)-guess values, not too far from the exact solutions, are those obtained from our approximate one-loop Eq. (8) that already captures the multi-solution structure. However, even if starting relatively close to the exact solutions, the method will catch only the attractive fixed points and can thus miss some, otherwise acceptable, \(A_t\) solutions.

One expects typically four distinct solutions as illustrated in Fig. 3 for the stop cliff benchmark: The functions \(\mathrm {C_{FP}}(A_t)\) and \(\mathrm {L_{FP}}(A_t)\), in blue, intersect the dotted blue lines at the fixed points of these functions, corresponding to the four \(A_t\) solutions that are consistent with \(m_h=125~\text {GeV}\). For the blue curve the coefficients \(C_i\) and \(R_i\) of the two fixed-point functions were recalculated at each \(A_t\). For the four red curves these coefficients are frozen at their values calculated at the intersection of the blue curve with the blue dotted line, for \(A_t\) varying in the vicinity of the intersection.

The dotted blue line on Fig. 3-right being the bisector, it follows that the slope of \(\mathrm {L_{FP}}\) at its fixed points is alternatively greater or smaller than one. We label the solutions s1, s2, s3, s4 ordered in ascending in \(A_t\). Given that \(\mathrm {L_{FP}}\) takes negative values for very large \(|A_t |\), the slopes of \(\mathrm {L_{FP}}\) at s1 and s3 are necessarily greater than one and those at s2 and s4 less than one. s1 and s3 are thus repulsive fixed points. In contrast, s2 and s4 will be either attractive if the slope is between \(-1\) and 1, or alternating/repulsive if the slope is less than \(-1\). The slope at s2 is typically less than zero due to the strong variation of \(\mathrm {L_{FP}}\) in the vicinity of \(|A_t |\sim 0\). It follows that the iterative procedure on \(\mathrm {L_{FP}}\) as described above will always miss solutions s1 and s3 while capturing at best two solutions, namely s2 and s4, provided furthermore the initial guess values \(A_{t}^{Guess}\) close enough to s2 and s4 to feel their attracting character. If a guess point is chosen anywhere above s3 the iterations will repel it from s3 and converge to s4. If chosen below s1 it will be repelled to the left and the procedure will never converge. Finally, if chosen between s1 and s3 then, depending on the behaviour around s2, the convergence to s4 (and occasionally to s2 if its slope is greater than \(-1\)) may or may not occur.

In order to ensure convergence and be able to capture all solutions, we consider a generalized family \(\mathrm {L_{FP}}_{\tau }\) of functions parametrized by \(\tau \ne 0\) as follows:

$$\begin{aligned} \mathrm {L_{FP}}_\tau (A_t) = \frac{1}{\tau } (\mathrm {L_{FP}}(A_t) -A_t) + A_t. \end{aligned}$$
(23)

Any fixed point of \(\mathrm {L_{FP}}\) is also a fixed point of \(\mathrm {L_{FP}}_\tau \) and vice versa, for any value of \(\tau \).

Moreover from

$$\begin{aligned} \mathrm {L_{FP}}'_{\tau }(A_{t})= 1 + \frac{\mathrm {L_{FP}}'(A_{t}) -1}{\tau }, \end{aligned}$$
(24)

follows that one can always choose \(\tau \) in such a way that \(\mathrm {L_{FP}}'_{\tau }(A_{t}^{FP})\) lies in the interval \([-1,1]\) for any given value of \(\mathrm {L_{FP}}'(A_{t}^{FP})\). Thus the advantage of \(\mathrm {L_{FP}}_\tau \) is that all fixed points of \(\mathrm {L_{FP}}\) can be made attractive with respect to \(\mathrm {L_{FP}}_\tau \) for appropriately chosen values of \(\tau \). In this case an iteration over the sequence \(A_{t, i+1} = \mathrm {L_{FP}}_{\tau } (A_{t_{i}})\) is guaranteed to converge on \(A_{t}^{FP}\), at least when the initial guess values are not too far from the solution. However, \(A_{t}^{FP}\) is not known in advance, even less \(\mathrm {L_{FP}}'(A_{t}^{FP})\). Without this knowledge, a rough strategy to converge on a given solution could be to choose:

  • \(\tau > rsim 1\) for s4

  • \(\tau \ll -1\) for s1, s3

  • \(\tau \gg 1\) for s2.

Actually one can do better by determining optimal values of \(\tau \) from the knowledge of the local variation of \(\mathrm {L_{FP}}\) during the iterative procedure. A numerical estimate \(\Delta \mathrm {L_{FP}}/\Delta A_t\) of the first derivative of \(\mathrm {L_{FP}}\) with respect to \(A_t\) can in principle be calculated at any given step during the iterations on \(A_t\) at moderate computational cost. Then choosing the \(\tau \) parameter as follows

$$\begin{aligned}&\frac{\Delta \mathrm {L_{FP}}}{\Delta A_t} - 1 < 0 : \tau = + \left|\frac{\Delta \mathrm {L_{FP}}}{\Delta A_t} - 1 \right|, \nonumber \\&\frac{\Delta \mathrm {L_{FP}}}{\Delta A_t} - 1 > 0 : \tau = - \left|\frac{\Delta \mathrm {L_{FP}}}{\Delta A_t} - 1 \right|, \end{aligned}$$
(25)

ensures that an estimate of the derivative of \(\mathrm {L_{FP}}_\tau \) is close to zero, cf. Eq. (24). Since this estimate is in practice not at the fixed point, Eq. (25) is not sufficient to guarantee convergence. However, it approximates a necessary condition, fulfilling a convergence criterion discussed in Appendix B, see Eq. (B.17). This allows to make any fixed point attractive, provided initial guess values are not too far from that fixed point. A simple and efficient algorithm will be implemented along these lines, as described in Sect. 4.

3.4 Dominant two-loop inversion

The extension of the above exact one-loop method to the two-loop contributions is rather straightforward. The latter corrections depend dominantly on the strong, weak and third generation Yukawa couplings \(y_i\), leading to terms of \({\mathcal {O}}\)(\( \alpha _{s,i}\))\(\times {\mathcal {O}}\)(One-loop), where \(\alpha _i\equiv y_i^2/(4\pi )\). In the standard fixed-order (diagrammatic) \(m_h\) calculations, \(\Pi _{ij}^{(2\text {-}loop)}\) and \(t_i^{(2\text {-}loop)}\) contributions are added to the \({\mathcal {M}}^2_s\) matrix elements which enter the squared Higgs boson mass equation shown in Eq. (12).

Due to the extra loop suppression factor, at any scale relevant for the MSSM spectrum calculation where either \(\alpha _s(Q)\) or \(\alpha _i(Q)\) remain moderate, the two-loop contributions are a moderate correction relative to the one-loop contributions (even if including those is obviously very relevant for a more precise comparison between the MSSM prediction and the measured \(m_h\) value. For instance typically for the benchmark cliff point, restricting to exact one-loop would give \(m_h\sim 118\) GeV instead of \(\sim 125\) GeV as in Table 1.) Thus, for the \(A_t(m_h)\) inversion, rather than trying to extract specific quite involved \(A_{t}\) dependencies from the \(\Pi _{ij}^{(2\text {-}loop)}\) and \(t_i^{(2\text {-}loop)}\) contributions, the latter are incorporated just as they contribute to Eq. (1): more precisely, within the above algorithm the available two-loop contributions are formally treated as if they were independent of \(A_{t}\), therefore concretely incorporated as additional contributions to either \(t_i^{(0)}\) or \( \pi _{ij}^{(0)}\) in Eq. (17):

$$\begin{aligned} t_i^{(0)} \rightarrow t_i^{(0)} + t_i^{(2\text {-}loop)},\quad \pi _{ij}^{(0)} \rightarrow \pi _{ij}^{(0)} + \Pi _{ij}^{(2\text {-}loop)}. \end{aligned}$$
(26)

This is then corrected iteratively for the true \(A_t\) dependence. The seven coefficients entering the \(\mathrm {HiggsMolar}\)(\(A_{t}\)) function of Eq. (18) can now be computed incorporating consistently two-loop contributions.

3.5 Including higher orders and refinements

At this stage the inversion is formally ‘exact’ at the considered (perturbative) level of theoretical precision taken for the \(t_i\) tadpoles and self-energy contributions \(\Pi _{ij}\), namely full one-loop and only the dominant two-loop contributions. It is a straightforward matter to incorporate either more complete two-loop and/or higher (3-loop) contributions, by considering those contributions similarly \(A_t\) independent, since their actual \(A_t\) dependence, independent of its complexity, is screened by tiny perturbative expansion coefficients. As long as higher order corrections are obtained diagrammatically in the form of self-energy or tadpole contributions, these could be included explicitly by adding them to the \(t^{(0)}_i\) and \(\pi ^{(0)}_{ij}\) contributions.

It is well known that sizeable theoretical uncertainties in \(m_h\) determinations (customarily taken as \(\sim \pm 2-3\) GeV in phenomenological analyses) are due to presently unknown higher order contributions, discrepancies between different renormalization schemes, etc (see e.g. [33, 59], or for more recent analyses [60, 61], as well as the recent updated discussion in [15]). Given these uncertainties, one might question the importance of devising a very accurate inversion procedure. The answer is obvious: not to introduce artificially further uncertainties in the determination of \(A_t(m_h)\) than there are from a given content of higher order contributions included in the standard \(m_h\) determination. Related to this, there remains one subtlety to consider: Even at one-loop level, there are extra implicit \(A_t\) dependencies that would not be accounted for by the previous algorithm, if one relied solely on the procedure leading to Eq. (18). Indeed, the self-energies and tadpoles also depend typically on SM-like gauge and Yukawa couplings, as well as other running \(\overline{\mathrm{DR}}\) parameters (\(s_W\), \(m_W\)), which are affected by threshold corrections, depending themselves on the MSSM parameters, therefore depending on \(A_t\) in a highly nontrivial way in this case. While all these threshold corrections give contributions that are formally of higher (at least two-loop) order within the loop self-energy and tadpole expressions, they can induce a numerical inconsistency bias if not incorporated in the inversion, slightly shifting the resulting \(A_t(m_h)\) with respect to its actual ‘reference’ value in a standard top-down calculation. This, as well as the other residual or implicit dependencies on \(A_t\) already mentioned in Sect. 3.2, are, however, consistently taken into account in the full algorithm as we explain next.

4 The full inversion algorithm

The algorithm has been implemented in SuSpect3 [62, 63]. SuSpect3 is a public spectrum calculator for multiple supersymmetric models that includes, within Eq. (1) for \(m_h\), the \(\overline{\mathrm{DR}}\) radiative corrections at full one-loop and dominant two-loop orders (involving for the latter the QCD and third family Yukawa contributions, but at vanishing \(p^2\)). Other MSSM spectrum calculators are, non-exhaustively, SOFTSUSY [64], SPHENO  [65,66,67], FeynHiggs [22, 57, 68, 69], and FlexibleSUSY [70]. Note that on top of fixed-order calculations including some contributions beyond the above mentioned two-loop order, some of these codes (FeynHiggs, SPHENO, FlexibleSUSY) also include resummations of large logarithms in an EFT approach for the Higgs mass calculations, thus with an a priori increased precision for large soft-supersymmetry breaking mass scenarios.

Before describing the full inversion algorithm, let us first briefly recall the procedure to determine the spectrum in the MSSM. This involves solving the RGE to evolve the parameters between the EWSB scale (\(Q_{\mathrm {EWSB}}\)) and the scale given by the mass of the Z boson (\(m_Z\)), as well as solving the EWSB equations Eqs. (6) and (7) at \(Q_{\mathrm {EWSB}}\). Radiative corrections to the supersymmetric particle and Higgs boson masses are calculated at \(Q_{\mathrm {EWSB}}\). Supersymmetric radiative corrections to Standard Model parameters, the most important in the present study being the top Yukawa coupling, are calculated at \(m_Z\).

For EWSB three variants have been implemented. If \(\overline{m}^2_{H_d}\), \(\overline{m}^2_{H_u}\) and the sign of \(\mu \) are given as input, the running mass squared \(\overline{m}_A^2\) and the Higgs mass parameter \(\mu \) are calculated. Alternatively \(\mu \) and either the pole mass \(m_A\) or the tree-level running mass squared \(\overline{m}_A^2\) can be given to calculate \(\overline{m}^2_{H_d}\), \(\overline{m}^2_{H_u}\).

Both the RGE evolution and the EWSB calculations are implemented as iterations. The convergence is tested on the stability of \(\overline{m}_A^2\) or \(\overline{m}_{H_u}^2\) between successive iterations. The choice depends on the input parameter set chosen.

We recall that for a precise calculation of the MSSM spectrum in the standard top-down approach, it is essential that some of the relevant running parameters at a given scale, and consequently the physical (pole) masses, are calculated iteratively, as these parameters are nontrivially modified by radiative corrections, which in turn depend on potentially all MSSM parameters.Footnote 3 There is also an iteration between the EWSB and \(m_Z\) scales, since important radiative corrections, depending themselves on the MSSM spectrum, are incorporated to extract the \(\overline{\mathrm{DR}}\)-scheme gauge and Yukawa couplings upon matching their experimentally measured values. The convergence criterion for the RGE iteration depends on the choice for the EWSB parameters.

4.1 Algorithm

In the \(m_h\)MSSM the \(A_t(m_h)\) determination algorithm extends the previously described calculation of the relevant EWSB parameters of the MSSM in SuSpect3. The determination of \(A_t\) has been added to this already necessarily iterative structure as the determination of an additional parameter. The algorithm to determine \(A_t\) is independent of the parameter input choice. In the following the numerical examples are given for an input of \(\overline{m}^2_{H_d}\), \(\overline{m}^2_{H_u}\) and the sign of \(\mu \). The calculation starts with the RGE evolution from the Z boson scale to a high scale. \(A_t\) is initialized arbitrarily to a fixed value (10 GeV) as the parameter will be determined after the RGE evolution to \(Q_{\mathrm {EWSB}}\). The procedure is similar to the initialization of \(\mu \).

First the MSSM EWSB calculations are performed, i.e., \(\mu \) and \(\overline{m}_A^2\) are determined, and then \(A_t\) is determined. This procedure is repeated until convergence is reached according to the MSSM criteria, i.e., \(\mu \) is stable and therefore \(\overline{m}_A^2\) is stable as well.

At the first and second RGE iteration the approximate one-loop Eq. (8) with the measured \(m_h\) as input is solved to extract a new \(A_t\) as explained in Sect. 3.1. Taking into account the newly determined \(A_t\), the RGE evolution to the Z mass scale is performed. Radiative corrections are calculated, in particular to the top Yukawa terms. The parameters are then RGE evolved to the high scale. The second RGE iteration therefore starts at the high scale with the \(A_t\) value derived from the approximate one-loop algorithm in the first iteration.

For the cliff point, Table 2 shows that the correct \(A_t\) is within 25% of the calculated value. The use of Eq. (8) is preferred over the fixed point algorithm as the radiative corrections used depend on \(y_t \cdot \mu \). This allows to stabilize quickly \(y_t\) with an approximate \(A_t\) at low computational cost. Using the full radiative calculations at this stage would lead to longer iterations as the variations of both \(\mu \) and \(A_t\) take longer to stabilize.

For the third and all following RGE iterations the full radiative calculations are used for EWSB, combined with the fixed point algorithm described in Sects. 3.3 and 3.4 to determine \(A_t\). For each new \(A_t\), obtained from \(\mathrm {L_{FP}}_\tau \), Eq. (23), the tree-level stop sector and the Higgs sector including radiative corrections are recalculated. This has the advantage that not only the leading terms of \(\mathrm {L_{FP}}\), Eq. (22), are taken into account, but also both the residual and implicit \(A_t\) dependencies in the coefficients \(C_i\) and \(R_i\), as explained previously. The recalculation of the stop sector and the Higgs sector for each iteration brings the function from the red curves closer to the blue (nominal) curves in Figs. 2 and 3.

The convergence depends on the \(\tau \) parameter. When the full radiative corrections are used in the algorithm, an estimate \(\Delta \mathrm {L_{FP}}/\Delta A_t\) of the first derivative of \(\mathrm {L_{FP}}\) with respect to \(A_t\) is calculated numerically at the end of the iteration on \(A_t\). The iterations are stopped when the relative change of \(A_t\) between the last and the current value is smaller than a threshold. Numerical values are given below. The \(\tau \) parameter is then adjusted according to Eq. (25) which is used in the next determination of \(A_t\).

4.2 Proof of concept

Table 3 The result of the Higgs inversion algorithm is shown for the stop cliff. The true point is s3. As a cross check the \(m_h\) values shown are the result of the standard \(m_h(A_t)\) calculation with the \(A_t\) obtained from the input \(m_h\) parameter

As proof of concept the calculation is performed for all four possible solutions using the full algorithm. The \(A_t\) is first determined via the approximate one-loop calculation for the first two RGE iterations and then the fixed point algorithm is used for all following iterations. \(A_t\) is refined at each step. To converge on the spectrum calculation the RGE iterations are stopped once \(\overline{m}_A\) is stabilized to the permil level. The EWSB iterations are stopped when \(\mu \) has converged at permil level. The iterations on \(A_t\) are run with a convergence criterion of permil.

The results of the full algorithm are shown in Table 3. Four solutions are obtained as expected. The calculation of \(A_t\) and the calculated \(m_h\) are in excellent agreement with the expected values.

In point s3 the deviation of the calculated \(A_t\) value from the expected value in Table 1 is far smaller than convergence criterion on \(A_t\) suggests. This is the consequence of the hierarchical structure of the iterations. The iteration on \(A_t\) is at the lowest level, therefore the calculation of \(A_t\) is also refined for each EWSB and (times) RGE iteration until the running mass of the A boson and the \(\mu \) parameter have converged. This leads to a higher precision than naively expected.

4.3 Scan settings

If the algorithm is used in a multidimensional scan of supersymmetric parameters, an example using SuSpect3 is in [71], the \(m_h\)MSSM ensures that no spectrum calculation will be performed for parameter sets incompatible with the measured central value of \(m_h\). This leads to a reduction of the parameter space allowed for \(A_t\). However such a calculation, due to the iteration on \(A_t\), has a calculational overhead compared to an MSSM spectrum calculation.

Given that the experimental uncertainty on the \(m_h\) measurement is of the order of 0.1 GeV and additionally the theoretical uncertainty is about 2 GeV, the following calculations have been performed with reduced precision: the RGE iterations are stopped when percent level convergence on \(\overline{m}_A\) has been reached. All other convergence definitions remain unchanged. These are the standard settings used in Suspect.

To illustrate the calculational overhead the MSSM calculation is compared to the \(m_h\)MSSM calculation for s3. The number of RGE iterations is increased from four to six. The algorithm typically adds one additional iteration to each EWSB calculation on top of the three for the standard algorithm, i.e., a total of 27 iterations is necessary compared to 13 for the standard settings. The calculation of \(A_t\) at the first two RGE iterations as a direct calculation of the approximate solution is not computationally intensive. When the fixed point method is used, for the first two RGE iterations, at the first EWSB iteration six and four calculations are necessary to converge on the fixed point to the required accuracy. For the subsequent EWSB iterations typically only one or two iterations on \(A_t\) are necessary. For the last two RGE iterations for all EWSB iterations only one or two \(A_t\) calculations are necessary. For half of EWSB iterations, a single calculation of \(A_t\) is sufficient. The reduction of the number of iterations on \(A_t\) as the RGE and EWSB iterations progress illustrates the convergence of the algorithm.

Table 4 The result of the Higgs inversion algorithm for the three definitions of EWSB input variables is shown. s3 is the true point. The calculated \(A_t\) and the resulting \(m_h\) are listed for all calculations

In Table 4 the results on \(A_t\) and the calculated \(m_h\) are shown for the reduced precision setting of the algorithm. The maximal difference between the calculated Higgs masses is less than 1 MeV, i.e., largely sufficient given the experimental error.

The two EWSB calculations with the \(\mu \) parameter as input lead to almost identical results for all \(A_t\) solutions. The two EWSB calculations with \(\mu \) use \(\overline{m}_{H_u}^2\) as variable to test convergence whereas the other EWSB calculation uses \(\overline{m}_A^2\) to stop the RGE iterations. The calculation is performed with a fixed convergence precision. The impact on the \(m_h\) value for a change of \(\overline{m}^2_{H_u}\) and \(\overline{m}_A^2\) is not identical. Therefore the results for EWSB with the A boson mass, tree level or pole, can be different with respect to the calculation with the Higgs mass parameters. For the true solution s3, the maximal deviation for all EWSB variants is only two tenth of a permil, at 0.7 GeV with respect to the true \(A_t\) of the cliff point 1.

4.4 Beyond the cliff

The cliff point is a favorable situation as four distinct solutions exist. A parameter set can lead to a situation where the local minimum of the Higgs boson mass in the vicinity of \(A_t=0\) GeV is larger than the \(m_h\) parameter input. Alternatively the \(m_h\) parameter could be higher than either one or both of the maxima of Fig. 1.

To test the validity of the algorithm beyond the proof of concept in the cliff point, the input \(m_h\) parameter was varied. The algorithm was slightly extended for \(m_h\) values close to the maxima and the local minimum by a bisection algorithm. To ensure a logical coherence of the results, the regions of validity for \(A_t\) are defined for the four solutions:

  • s1: \(-\infty \) to \(A_t(m_h^{max}(A_t<0))\)

  • s2: \(A_t(m_h^{max}(A_t<0))\) to \(A_t(m_h^{min}(A_t\sim 0))\)

  • s3: \(A_t(m_h^{min}(A_t\sim 0))\) to \(A_t(m_h^{max}(A_t>0))\)

  • s4: \(A_t(m_h^{max}(A_t>0))\) to \(\infty .\)

If the \(m_h\) parameter is out of reach, the closest \(A_t\) in the region is used.

Fig. 4
figure 4

The result of the determination of \(A_t\) is shown as function of the chosen \(m_h\). The determination is performed separately for the four solutions. The histograms are stacked from s1 (bottom) to s4 (top), the horizontal lines at constant \(A_t\) at the transition between the regions show only the top color

The result of a scan of \(m_h\) values between 112 and 128 GeV is shown in Fig. 4. The four solutions are correctly reconstructed over the full range. For s2 and s3 the lowest parameter values cannot be reached, therefore the \(A_t\) value of the local minimal mass is determined. A similar feature is observed when the parameter is greater than the maximal \(m_h\) at 2-loop precision shown in Fig. 1. In this case \(A_t\) corresponding to the maximal mass achievable for the parameters is determined, resulting in a line parallel to the x-axis. The steps correspond to cases where the \(\mu \) parameter and the CP-odd scalar Higgs boson mass have converged to a slightly different value which leads to a step in \(A_t\) and an offset in the predicted \(m_h\), smaller than the systematic of 2 GeV typically used in scans of the parameter space. In total 256 inversions are shown of which only 3 have not converged correctly. These points are all located in regions where a larger \(m_h\) than possible was requested (horizontal lines at \(A_t \approx -4.95\) TeV and \(A_t\approx 4.85\) TeV) and for two the algorithm returned an error for non convergence. Since the step size of the scan of 0.25 GeV is smaller than the error on the Higgs mass of about 2 GeV, posterior distributions of \(A_t\) also give an additional handle on the outliers.

Figure 4 shows \(A_t\) as multi-valued function of \(m_h\) covering the same parameter space as Fig. 1. This shows that the inversion, based on the exact calculation, works, validating the use of the approximate one-loop calculation as a first guess. However, there could be problematic parameter sets for which the guess points are not sufficiently close to the fixed points to ensure convergence. An alternative algorithm relying on the knowledge of \(A_t(m_h^{max}(A_t<0)), A_t(m_h^{min}(A_t\sim 0))\) and \(A_t(m_h^{max}(A_t>0))\), combined with global rather than local criteria, could then be used.

4.5 Beyond the fixed-order approximation

Our inversion strategy is built upon fixed-order diagrammatic results, in particular upon the knowledge of the analytical form of the exact one-loop contribution to the Higgs mass. In more recent developments of MSSM Higgs mass radiative corrections, there are, however, configurations where an EFT approach is better suited for a reliable estimate of the Higgs mass, like typically when considering an MSSM setup with (very) heavy scalars (see e.g. [15] for a recent review). In this case, if for instance a heavy stop sector is integrated out from the low-energy EFT, a direct analytical relation between \(m_h\) and \(A_t\) will be essentially lost. Nevertheless, an inversion procedure can still be carried out: The needed relation will reside now in the matching condition at the boundary scale between the light and heavy sectors for the quartic Higgs coupling, from which the multi-\(A_t\) solutions can be retrieved analytically.

4.6 Beyond the CP-conserving case

If complex phases are allowed for the MSSM parameters, the dependence of \(m_h\) on \(A_t\) will be in general significantly modified due to loop induced CP-violating mixings among all the neutral Higgs states. The ensuing \(3 \times 3\) mass matrix after identifying the Goldstone boson, implies an analytical expression of \(m_h^2\) given by a root of a cubic equation, quite different from that obtained from the diagonalization of Eq. (1). Our approach can be easily extended to this case in the (experimentally likely) configuration where the MSSM spectrum, and in particular the charged Higgs mass, are much heavier than the \(125~\text {GeV}\) state. As well known, in this case the CP-violating mixing is essentially confined to the two heavy neutral states sector, see e.g. [72], and the phenomenology (mass, couplings) of the lightest would-be CP-even state becomes essentially the same as in the CP-conserving MSSM. In this limit, taking \(A_t\) and \(\mu \) complex (which is sufficient to account for the physically relevant phases, denoted \(\phi _{A_t}\) and \(\phi _{\mu }\)), the functional dependence of \(m_h\) on \(|A_t|\) will be the same as that on \(A_t\) in the CP-conserving case. The only difference is the reduction of some coefficients by cosine factors, (typically occurrences of \((\mu A_t)^n\) become \((\left|\mu \right|\left|A_t\right|)^n \cos n (\phi _{\mu } +\phi _{A_t})\), otherwise even powers \(A_t^{2n}\) become \(\left|A_t\right|^{2n}\), etc.). Modulo these modifications in Eqs. (8) and (18), our algorithm will work exactly as before, to determine \(\left|A_t\right|\) (and/or \(-\left|A_t\right|\)) solutions from \(m_h\), given \(\phi _{A_t}\) (and or \(\phi _{A_t}\pm \pi \)), a complex \(\mu \), and the other parameters of the MSSM. Beyond this (relative decoupling) limit, the algorithm can still provide an educated guess for \(\pm |A_t|\) input, within a more general numerical algorithm for the CP-violating MSSM.

5 Conclusion

The discovery and precise measurement of the Higgs boson motivates the redefinition of the MSSM as \(m_h\)MSSM. The algorithm presented in this paper replaces the trilinear coupling of the stop sector \(A_t\) with the measured \(m_h\) as input parameter.

The simplified expression of the radiative corrections to \(m_h\) and the exact full one-loop with the leading two-loop corrections are assembled in an algorithm to calculate \(A_t\). The algorithm has been applied to a benchmark point showing that the four solutions are determined with the expected precision. A single parameter scan with \(m_h\) as scan parameter shows that the \(A_t\) dependence is correctly reconstructed.

The general structure of the algorithm could also be applied to other parameters. Using the \(m_h\)MSSM may speed up the exploration of supersymmetric parameter space by ensuring the compatibility of all calculated spectra with the experimentally measured \(m_h\). Future work will center on applying the algorithm in multi-parameter scans.