B ->K* mu+ mu- optimised observables in the MSSM

We provide a detailed analysis of the impact of the newly measured optimised observables in the B ->K* mu+ mu- decay by the LHCb experiment. The analysis is performed in the MSSM, both in the context of the usual constrained scenarios, and in the context of a more general set-up where the SUSY partner masses are independent. We show that the global agreement of the MSSM solutions with the data is still very good. Nevertheless, especially in the constrained scenarios, the limits from B ->K* mu+ mu- are now very strong, and are at the same level as the well-known b ->s gamma constraints. We describe the implications of the B ->K* mu+ mu- measurements both on the Wilson coefficients and on the SUSY parameters.


Introduction
Rare b → s transitions provide a powerful probe of the flavour sector of the Standard Model (SM). The effective Hamiltonian formulation of these processes allow for a convenient low energy description in terms of short distance Wilson coefficients C i . Deviations from the SM predictions can then be consistently parametrised in terms of New Physics (NP) contributions to the Wilson coefficients (δC i ≡ C i − C SM i ). In addition, model predictions for the coefficients δC i can be computed by matching the model to the effective theory at the electroweak scale; thus these coefficients provide a link between the high energy model and the low energy phenomenology. In the case of b → s transitions, the relevant d = 6 Hamiltonian is given by Recently, a significant amount of work has been devoted to the study of b → s processes [1][2][3][4][5][6][7][8][9][10], where experimental progress has been outstanding (see e.g. Refs. [11][12][13][14][15] ). These processes are the most sensitive to H sl eff , and together with b → sγ processes, pose strong conditions on the corresponding Wilson coefficients. More specifically, H sl eff is given by 9,10  , while their primed counterparts as well as the scalar and pseudo-scalar Wilson coefficients vanish to a good approximation. Global fits to all available b → sγ and b → sµ + µ − data constrain significantly the allowed values for the NP contributions δC i [24][25][26][27], specially when the latest B → K * µ + µ − data is included [1][2][3][4].
It is precisely the decay B → K * (→ Kπ)µ + µ − that has attracted most of the attention recently. Its angular distribution [28] (see also e.g. [16,29]) provides a plethora of observables sensitive to different helicity structures in the decay amplitude. However, theoretical predictions for the most obvious observables inherit large uncertainties from not-so-well-known hadronic form factors. This has led to the construction of a number of optimised observables as appropriate ratios of angular coefficients where most of the dependence on the form factors cancels, while having high sensitivity to NP effects [27,[30][31][32][33][34]. A complete minimal set of such observables is given by the {P [27,32]. Rigorous SM predictions for these and other B → K * µ + µ − observables can be found in Ref. [35], and are also implemented within SuperIso [36].
While all previous experimental results from rare |∆B| = |∆S| = 1 processes were pointing to SM like values for the Wilson coefficients, the bounds on dileptonic operators were relatively mild still allowing |δC 9,10 ( ) /C SM 9,10 ( ) | ∼ O(1) [27]. First measurements of the full angular distribution by the LHCb collaboration [11,12] have changed the situation, presenting a tension in several binned observables, such as P 2 and P 5 .
In Ref. [1], the first model independent global fit for the Wilson coefficients using the recent LHCb results on B → K * µ + µ − combined with other existing b → s data was performed, hinting to a rather large negative NP contribution δC 9 ∼ −1.2. This was followed by a global fit in Ref. [2] indicating a value close to δC 9 ∼ −0.9 (in agreement with Ref. [1]), and a global Bayesian fit in Ref. [3] with a best fit value for δC 9 of either −1.3 or −0.3 depending on the set of observables used. An independent check has been given in Ref. [5]: a fit to B → K * µ + µ − and B s → φµ + µ − observables at low hadronic recoil only, based on the recent unquenched lattice results for the relevant form factors [6], leads to a similar value for δC 9 ∼ −1. In Ref. [4] it was shown that a consistent new physics explanation of theB →K * µ + µ − results in the context of minimal flavour violation is possible.
The model-independent global fits require C 7 and C 10 to be close to their SM values (in order to satisfy the strong B → X s γ and B s → µ + µ − constraints). This has promoted NP scenarios where δC ( ) 7 and δC ( ) 10 are negligible while δC 9 ∼ −1, as is the case with models with a Z gauge boson coupling to left-and right-handed leptons with equal strength [1,7,37] (arising for example in the 331 models [8,38]) or scenarios with effective four-quark scalar interactions [9]. It has also been argued that within SUSY models or in models with partial compositeness, such large values for δC 9 can hardly be obtained [2]. The analysis in Ref. [4] shows that the set of solutions with flipped sign SM Wilson coefficients has only a slightly larger χ 2 , although this scenario is considerably more aggressive in terms of New Physics.
TheB →K * µ + µ − decay has already been used to constrain the MSSM parameters (see e.g. [39,40]). In this work we study the implications of recent LHCb results for theB →K * µ + µ − decay (combined with existing experimental results for other b → s transitions) on the MSSM. Our numerical analysis is performed with SOFTSUSY [41] and SuperIso [36] and we study two classes of scenarios within the MSSM: the constrained MSSM models CMSSM and the Non Universal Higgs Mass (NUHM) model and a more general set-up known as phenomenological MSSM (pMSSM) which enables access to regions of the MSSM parameter space otherwise inaccessible with the aforementioned constrained scenarios. To investigate the viability of the MSSM scenarios considering recent experimental results onB →K * µ + µ − observables, we perform a model-dependent analysis on the Wilson coefficients using the other relevant b → s modes as well. Using this global analysis, constraints are obtained for several SUSY parameters within each scenario. This paper is organised as follows: in section 2 we provide the SM predictions and errors forB →K * µ + µ − observables as well as the other b → s processes considered in this work. In section 3 we present the parameter ranges within which the MSSM scenarios that we have considered are investigated. Our statistical treatment is described in Section 4, and Section 5 contains our results on the global analysis. Conclusions are given in section 6.

Optimised observables
Angular observables inB →K * (→ Kπ)µ + µ − arise from the differential decay rate: J 1s sin 2 θ K + J 1c cos 2 θ K + (J 2s sin 2 θ K + J 2c cos 2 θ K ) cos 2θ l +J 3 sin 2 θ K sin 2 θ l cos 2φ + J 4 sin 2θ K sin 2θ l cos φ + J 5 sin 2θ K sin θ l cos φ +(J 6s sin 2 θ K + J 6c cos 2 θ K ) cos θ l + J 7 sin 2θ K sin θ l sin φ + J 8 sin 2θ K sin 2θ l sin φ +J 9 sin 2 θ K sin 2 θ l sin 2φ , (2.4) where notation and conventions follow Refs. [16,32,33]. Here we focus on CP-averaged quantities, for which the angular distribution dΓ of the CP-conjugated process B → K * µ + µ − must also be considered. This is obtained from Eq. (2.4) by replacing J 1,2,3,4,7 →J 1,2,3,4,7 and J 5,6,8,9 → −J 5,6,8,9 , whereJ is equal to J with all weak phases conjugated (see e.g. Ref. [28]). The basic observables are the functions J i (q 2 ) integrated in q 2 -bins (q 2 is the squared invariant mass of the muon pair). From these a number of optimised observables can be constructed, where some hadronic uncertainties are minimised by taking appropriate ratios. A set of such optimised observables have been measured recently by the LHCb collaboration [11,12], these are defined as [27,32,35]: , , where the normalisation N bin is given by In addition, several non-optimised observables can be introduced in the analysis which are theoretically independent from the previous ones. In this paper we will consider the branching ratio and the longitudinal polarisation fraction: and analogously forΓ. We do not consider the forward-backward asymmetry, which is significantly correlated to the observable P 2 [4]. The set of SM predictions and experimental measurements for all these observables are summarised in Table 1.
For the computation of these B → K * µ + µ − observables in terms of the Wilson coefficients we follow Ref. [35]. At large recoil (low q 2 ) the theoretical tools are those of QCDF/SCET as described in Refs. [42,43]. In this kinematical region several form-factor relations [44,45] allow to build the various optimised observables [27,32,35]. At low recoil (large q 2 ) the computation relies on an OPE for the relevant non-local hadronic matrix element, either in full QCD [46] or within the HQET [47]. Form-factor relations arise in HQET, allowing to construct optimised observables in this kinematic region [33][34][35], but the observables in Eq. (2.5) are not optimised at low recoil. The full form factors needed as an input are taken from the large-recoil LCSR computation in Ref. [48]. These are extrapolated to the low recoil region following the procedure detailed in Ref. [35], and are consistent with lattice QCD results in this region [6,49], as well as with HQET form factor relations [35,47], for which a 20% error from 1/m b power corrections is included. Concerning power corrections to the hadronic contribution at low recoil, a 10% error is added to each amplitude in an uncorrelated manner and with an arbitrary phase. The computation of these observables is implemented within SuperIso [36], and marginal numerical differences on the SM predictions with respect to Ref. [35] are due to slightly different choice of central values for input parameters.
A visualisation of how these observables depend on the various Wilson coefficients can be obtained by linearising with respect to the NP contributions to the coefficients. In addition, New Physics contributions to the primed coefficients C 7,9,10 are always negligible in the scenarios considered in this paper, being suppressed by a factor m s /m b like in the SM. Within these approximations, we have:   and P 1 depends (almost) only on right-handed currents. In addition, the observables P 5 and P 6 depend formally on scalar operators through the combination (C Q 1 − C Q 1 ). However the coefficient in the linearised expression is smaller than 0.01 and has been neglected. We remind that the coefficients δC i are defined at the scale m b . The strongest constraint on δC 7 is provided by the branching ratio BR(B → X s γ), which strongly favours values around δC 7 ∈ (−0.07, 0.04), with preference for negative values. Looking at Eqs. (2.9) one can see that such negative small contributions to C 7 tend to increase the theory predictions for P 2 [ and P 4 go in the opposite direction, although still compatible within errors (experimental uncertainties on P 4 are still rather large). Therefore, these observables will generally drift δC 7 towards small negative values in agreement with the expectations from BR(B → X s γ) (see Ref. [1] and Fig. 5 therein).
However, in order to explain the observable P 5 [4.3,8.68] , and to a lesser extent also P 2 [4.3,8.68] and P 5 [2,4.3] , contributions to other Wilson coefficients are necessary. As can be seen from Eqs. (2.9), the sensitivity to δC 10 is lower than to δC 9 , so the latter is the best candidate. In addition, δC 10 is constrained by BR(B s → µ + µ − ) to an extent that depends on the pattern of scalar contributions. In the MSSM (where C Q 1 ≈ −C Q 2 ) this constraint is very effective. In this context it was shown in Ref. [1] that the most favourable scenario involves a significant negative contribution to C 9 .
Such large NP contributions to C 9 are not possible in general within the MSSM (see also Ref. [2]), and as we shall see, moderately large negative values for δC 9 -achievable within corners of the pMSSM -are typically correlated to large values of other coefficients that lead to tensions with other flavour data. However, we will also see that all B → K * µ + µ − observables including P

MSSM scenarios
We consider two classes of scenarios: the constrained MSSM scenarios CMSSM and NUHM, and a more general set-up with no universality assumptions for the sparticle masses at a high scale: the phenomenological MSSM (pMSSM) [50]. We scan over the parameters of the MSSM   Table 2: Parameter ranges adopted in the scans (in GeV when applicable) for the CMSSM, NUHM and pMSSM scenarios. scenarios using SOFTSUSY 3.3.10 [41] in the ranges given in Table 2, and generate several millions of random points for each scenario. For each generated point, we then calculate the flavour observables with SuperIso v3.4 [36]. We apply a few loose cuts for some of the SUSY masses to avoid regions of the parameters that are excluded by direct SUSY searches. In particular we impose the gluino and squark masses to be above 500 GeV, and the chargino masses above 150 GeV. These limits could be much stronger in the CMSSM, but since the strong CMSSM limits are falsified in the case of pMSSM (see Ref. [51]) we consider the same loose cuts for all the scenarios we analyse here.

Statistical method
In order to study the overall compatibility of the MSSM scenarios with the B → K * µ + µ − measurements, including all other relevant flavour constraints, we perform a statistical χ 2 analysis of the data. We consider all 45 observables collected in Table 1, and construct the χ 2 distribution: where the central value of the experimental result and theoretical prediction of observable i are given in O exp i and O th i respectively. The first term contains the contribution to the χ 2 from the B → K * µ + µ − observables. Here we include the experimental correlations among the B → K * µ + µ − observables, and the inverse of the correlation matrices (σ (bin) ) −1 for each bin are taken from [4,52]. The second term is the standard χ 2 without correlations for all the other observables. σ exp i and σ th i are the experimental and theoretical errors respectively. Each MSSM point (defining a particular model ) is therefore assigned a χ 2 value, which is directly related to the p-value of the model: the probability that another measurement of these observables leads to worse agreement with the model predictions, assuming the model is true. All models with p-values greater thanp lie inside a (1 −p) × 100% C.L. region. One-, two-, and threesigma regions are defined as the 68.3%, 95.4% and 99.7% C.L. regions respectively. Note that this approach is different from the one used in e.g. Refs. [1,27] as well as from the Bayesian procedure in Refs. [3,24,25] where the difference of the χ 2 for each point with the χ 2 of the best fit point is used to define the different allowed regions.

MSSM predictions for
We study first the reach of each MSSM scenario for all B → K * µ + µ − observables, without applying any flavour or Higgs mass constraint. In this case the parameter space in each model is large enough to contribute significantly to each observable. The most interesting observables, P 2 , P 4 and P 5 , are shown in Fig. 1, where the bands show the span of values allowed in this set-up. For comparison also the experimental results are shown (black data points), together with the SM central values (red line). As can be seen from the figure all MSSM scenarios can explain, within 1σ, each individual tension for all observables 1 . Any tension between the MSSM and B → K * µ + µ − data can only come from correlations among this data, or from the combination with other constraints.
One such independent constraint is the Higgs mass. In order to see the effect of this constraint, we impose the condition 121 < M h < 129 GeV [53, 54] on the scan points in each MSSM scenario. The MSSM contributions to B → K * µ + µ − observables are slightly reduced, as can be seen in Fig. 1, where the light purple bands show the span of values allowed from MSSM points satisfying the Higgs mass constraint. This reduction is inconsequential for all observables (again, barring possible correlations), except for P 5 [4.3,8.68] , where one-sigma agreement is impossible in the constrained scenarios. The pMSSM on the other hand is general enough to provide solutions at the 1σ level.
On the basis of these results, the relevant issue is how these observables are correlated in the different MSSM scenarios, among themselves and with other flavour observables. Even though Fig. 1 looks promising, the region in MSSM parameter space where a good agreement is found for e.g. P 2 [2,4.3] , could be predicting unacceptable values for P 5 [2,4.3] , or for other observables such as BR(B → X s γ). In this respect this is the main characteristic feature of a model-dependent analysis in contrast to a model-independent one where the various Wilson We consider first several interesting examples of correlations among different observables within the MSSM. Even in a model-independent set-up, where all Wilson coefficients are treated as independent free parameters, strong correlations among different observables are present. In the MSSM, and in the case of small contributions to δC 9 , sufficiently large values of δC 7 can enhance the relevant B → K * µ + µ − observables enough to match the data. Irrespectively of whether these values are correlated to other Wilson coefficients, these large values of δC 7 are in tension with the measurement of BR(B → X s γ). This is a (known) model-independent statement [1,27]. With values for the Wilson coefficients reachable in the MSSM, the observables P 2 [2,4.3] , P 5 [2,4.3] and P 5 [4.3,8.68] can acquire large values in agreement with data; however this is mainly due to contributions to δC 7 that enhance BR(B → X s γ) -up to unacceptable levels in the case of P 5 [4.3,8.68] (see Fig. 2). One has then to reach a statistical compromise between these two competing tensions, in order to decide whether δC 7 is more likely enhanced or not. Nevertheless, this tension will lead to a global enhancement of the χ 2 , and again the statistical framework will tell if this enhancement is admissible or not, in the latter case ruling out the corresponding MSSM scenario according to some statistical criteria. We should note however that both B → X s γ and B → K * µ − µ + observables show a preference for negative values of δC 7 (albeit to different degrees) which is by itself an intriguing fact.
In the model-independent framework, this tension can be cured in part with sizeable contributions to C 9 , although it can never provide a good description of P 5 [4.3,8.68] , at least without  Table 1, including the Higgs mass constraint.
contributions to C 9 [1]. In the MSSM, not only we have no means of generating any sizeable contribution to the coefficient C 9 , but also any significant contribution to C 9 is correlated to contributions to other Wilson coefficients affecting the other observables. Some of these interesting patterns are shown in Fig. 2.

Constraints on the Wilson coefficients
The flavour observables depend on the MSSM parameters only through the Wilson coefficients, and so is the case for the χ 2 distribution. It is therefore quite convenient to look at the models and their C.L. regions in the space of Wilson coefficients. This is shown in Fig. 3, where all the points in the scans are shown (grey), indicating the values for the Wilson coefficients allowed within the MSSM, as well as the points satisfying the global 1,2,3 σ constraints (red, yellow, green). The Higgs mass constraint has also been imposed, all the points being consistent with a Higgs mass 121 < M h < 129 GeV.
The comparison with the model-independent fit in Ref. [1] as well as with Refs. [2,3] is not straightforward. In particular, the results in Fig. 3 do not contradict the result of Ref. [1] that a scenario with δC 9 ∼ −1.2 and δC 7 ∼ −0.1 is the most favourable one and with respect to which the SM hypothesis has a pull of ∼ 3-4 σ. Notice that: (a) No such large values for δC 9 are allowed within these MSSM scenarios, and (b) even though the χ 2 grows for δC 9 decreasing from ∼ −0.2 (at least within the limited number of scan points, see the center plot in the last row of Fig. 3), these points lead simultaneously to values for other Wilson coefficients away from zero, and therefore cannot be compared with a model-independent analysis where δC 9 can be decreased independently of the values of the other coefficients. As mentioned earlier, this is the result of strong model-dependent correlations absent within the model-independent approach.
As a conclusion, the most favourable MSSM models given the B → K * µ + µ − data lead only to small deviations for the Wilson coefficients with respect to their SM values. The best scenarios in terms of Wilson coefficients, known from model-independent analyses, are not reachable within the constrained MSSM models, or even within the more generous pMSSM. However, the overall agreement is fairly good, with regions in SUSY parameter space where the absolute χ 2 is sufficiently small. Within this statistical criteria the MSSM is still well compatible with the data.
It is instructive to go back to Fig. 1 and impose the constraints derived from the global fit. This is shown as the over-imposed green bands, showing the span of models that survive the 1 σ constraints from the global fit. We see that good agreement with P 2 [2,4.3] and P 5 [2,4.3] can be obtained, improving the situation with respect to the SM. On the other hand, the tension in P 5 [4.3,8.68] cannot be explained within the pMSSM. This is compatible with the previous observation that, without significant contributions to C 9 , a large value for P 5 [4.3,8.68] requires extreme values for other Wilson coefficients that, while reachable within the MSSM, are ruled out by other flavour bounds.

Implications for MSSM parameters
The results of the global fit can be used to put constraints on the supersymmetric parameters. This is done by computing such quantities from the input parameters within each MSSM scenario which satisfies the global constraints in the fit.
First, in order to see the impact of the newly measured B → K * µ + µ − observables in a simple framework, in Fig. 4 we compare the constraining power of the fit to B → K * µ + µ − observables with some of the other flavour observables (i.e. the Branching Ratios of B s → µ + µ − , B → X s γ and B → τ ν) in the CMSSM. We have kept tan β to the fixed values of 20 and 40 and the trilinear soft breaking parameter A 0 = −2m 0 (this choice allows us to have a better agreement with the Higgs mass). In this way, the results in the two dimensional (m 1/2 , m 0 ) plane can be easily understood. The constraints from the combination of B → K * µ + µ − observables are shown in red and can be compared to other flavour constraints shown in different colours according to the legends in the plots. It is remarkable that the constraints from B → K * µ + µ − using the recent measurements are at the same level as the ones from the well-known BR(B → X s γ) constraints. Also, the constraints from B s → µ + µ − while being very strong at large tan β, are less restrictive for lower values of tan β where the B → K * µ + µ − constraints dominate. For comparison we show also in the figure the direct SUSY search limits in the same plane from ATLAS with 20.3 fb −1 of data at √ s = 8 TeV [55], as well as the region where the Higgs mass is above 122 GeV. It is also very interesting to notice that the flavour constraints start being stronger than the direct search limits for large values of tan β 20.
Next we consider the MSSM scenarios in full generality without fixing any parameters. The constraints from the global fit on the physical parameters Mt 1 , M A and tan β are shown in Fig. 5. Several interesting features can be observed. First, while the constraints are very strong in the CMSSM, they can be easily relaxed in the more general scenarios NUHM or pMSSM. In the case of the CMSSM, the pseudo-scalar Higgs mass is constrained quite generally to be above ∼ 1 TeV. This strong limit from flavour physics is independent of the value of tan β. Another interesting feature is the impact on the lightest stop mass. In the constrained scenarios, flavour constraints require the lightest stop to be heavier than ∼ 500 GeV. On the other hand, in the pMSSM the flavour constraints are much weaker, excluding only scenarios where M A and Mt 1 are both small, or with light M A and large tan β.
Finally, we devote a special attention to P 5 , as this is the only observable presenting more than 2σ tension in the MSSM, and study how the value of the observable P 5 [1,6] changes as a function of the MSSM parameters. The results are displayed in Fig. 6. As can be seen, the tension with respect to the experimental result is reduced for small chargino or stop masses, while the other SUSY parameters are less relevant.

Conclusions
The angular distribution of the decay B → K * µ + µ − provides a large number of observables, probing NP in a complementary way to other related flavour processes. This exclusive mode is under good theoretical control, with the main hadronic uncertainties stemming from the form factors and (at large recoil) unknown power corrections. In this context the use of optimised observables such as the set {P ( ) i } is very convenient in phenomenological analyses. Recent LHCb measurements of these optimised observables have shown certain discrepancies with SM predictions, specifically in some of the P 2 and P 5 bins. Model-independent global fits for the Wilson coefficients have shown that the tensions can be relaxed if δC 9 ∼ −1. In this paper we have studied the implications of these new results along with other relevant b → s transitions on several MSSM scenarios: the constrained CMSSM and NUHM scenarios and also the more general pMSSM framework. While large negative contributions to C 9 are not accessible within the MSSM, it is interesting to note that the MSSM has the potential to reproduce each of the measurements individually at the 1 σ level. Correlations among the different observables are then the central issue.
In order to study the impact of the recent LHCb results, we have first performed a global fit to the Wilson coefficients, including most of the available b → s modes. The comparison of this analysis with previous model-independent analyses is not straightforward, but can be understood in terms of model-dependent correlations and a different statistical set-up. In our analysis, the best fit values for the Wilson coefficients not only depend on the correlations among the different flavour observables but also on the correlations between the various Wilson coefficients in each specific MSSM scenario. Even though the small value of C 9 ∼ 3 favourable from model-independent analyses is not accessible within MSSM, there are regions of the MSSM parameter space that (within some statistical criteria) are still well compatible with the experimental data.
We have translated the constraints on different SUSY parameters in the three aforementioned scenarios. These constraints provide complementary information to direct searches, and show that the data on B → K * µ + µ − has become competitive with the traditional modes such as B → X s γ and B s → µ + µ − .
The observable P 5 remains nevertheless a challenge for SUSY models. We have shown its dependence on some of the more relevant SUSY parameters, making it possible to easily look for the preferred region of parameter space within the mentioned MSSM scenarios when experimental updates on P 5 are available. We are looking forward to the experimental update including the full 3 fb −1 data-set collected at LHCb.