1 Introduction

The Belle II experiment recently announced the first measurement of \(B^+\rightarrow K^+ +\textrm{inv}\) with a branching ratio of \(\textrm{BR}(B^+\rightarrow K^+ +\textrm{inv}) = (2.4\pm 0.7)\times 10^{-5}\) [1], which is \(2.8\sigma \) above the Standard Model (SM) prediction BR(\(B^+\rightarrow K^+ \nu \bar{\nu })_\textrm{SM}=(5.06\pm 0.14\pm 0.28)\times 10^{-6}\) [2]. The result assumes 3-body decay kinematics as predicted by the SM and is based on two independent analyses, one inclusive analysis and one with a hadronic tag. While the hadronic tag analysis is consistent with the SM prediction within \(1\sigma \), the more sensitive inclusive tag analysis shows a \(3\sigma \) deviation from the SM prediction. Together with the previous searches by Belle [3, 4], BaBar [5] and Belle II [6], a simple weighted average results in BR\((B^+\rightarrow K^++\textrm{inv}) = (1.4\pm 0.4)\times 10^{-5}\) [1]. Motivated by the result, Refs. [7, 8] studied the implication within Standard Model effective field theory (SMEFT) and Ref. [9] demonstrated that non-universal lepton-flavour-conserving \(Z'\) models are not able to accommodate the excess and thus proposed the introduction of lepton-flavour-violating couplings. Several earlier works [10,11,12,13,14,15] also proposed explanations for an excess in \(B^+\rightarrow K^++\textrm{inv}\) motivated by the result of the 2021 Belle II analysis [6], which first used the inclusive tagging method.

One attractive explanation of the excess in \(B^+\rightarrow K^++\textrm{inv}\) is an additional decay channel with undetected final states, like sterile neutrinos [11, 12], dark matter [13, 16, 17] or more generally long-lived particles [14, 18,19,20,21,22]. Additional light sterile neutrinos are particularly well motivated, and occur in numerous minimal extensions of the SM, see e.g. [23,24,25,26,27,28]. Light sterile neutrinos have been proposed as an explanation of the deviation from lepton flavour universality in \(R(D^{(\star )})\) [25, 29,30,31,32,33,34,35] by introducing new decay channels, \(B\rightarrow D^{(\star )}\tau N\), with sterile neutrinos N. Light GeV-scale sterile neutrinos may also explain neutrino masses via the seesaw mechanism [36] like in the \(\nu \)MSM [37]. The lifetime of sterile neutrinos is constrained from big bang nucleosynthesis (BBN) to be shorter than 0.02 s [38]. Together with searches for sterile neutrinos at beam dump experiments, this translates into a lower bound on their mass in the minimal seesaw model of \(\mathcal {O}(350)\) MeV apart from a small mass window at 120–140 MeV [39]. Although the lower bound is well below the \(B^+\) meson mass, the sterile neutrino mass will become important for a relevant part of the parameter space and thus should be included in the analysis.

B meson decays are best described within low energy effective theory (LEFT). Most studies focus on vector operators [17, 40,41,42], because they are predicted in the SM, but sterile neutrinos motivate to extend the analysis to scalar and tensor operators. The scenario of massless sterile neutrinos has been discussed in [7, 10, 43,44,45] and the general case with massive neutrinos has been considered by some of us in [12] in terms of helicity amplitudes [46]. There are also recent studies of \(B\rightarrow K^{(\star )} \nu \bar{\nu }\) decays within SMEFT, which discuss the complementarity with other observables [7, 45, 47] including comments on scalar operators with (massless) right-handed neutrinos [7, 45].

In this work, we will build on a previous analysis within LEFT [12], but work within SM effective field theory extended by sterile neutrinos (\(\nu \)SMEFT). Within the framework of \(\nu \)SMEFT we consider additional decay channels to sterile neutrinos including the full sterile neutrino mass dependence and identify the viable regions of parameter space which explain the excess in \(B^+\rightarrow K^++\textrm{inv}\) and are consistent with the non-observation of the other three \(B\rightarrow K^{(\star )} +\textrm{inv}\) decay modes, invisible \(B_s\) decays and measurements of charged current \(B\rightarrow D^{(\star )}\ell \bar{\nu }\) decays. We also point out ultraviolet (UV) completions for the viable \(\nu \)SMEFT operators.

The remainder of the paper is structured as follows. In Sect. 2 we introduce the effective field theory (EFT) framework, discuss both \(\nu \)SMEFT and LEFT and how they are connected via renormalisation group (RG) running. The relevant observables are introduced in Sect. 3 and the results discussed in Sect. 4. Finally, possible UV completions are presented in Sect. 5 before concluding in Sect. 6. A few technical details are relegated to the Appendix.

2 Effective field theory framework

Starting at dimension-6, there are operators in \(\nu \)SMEFT which contribute to \(b\rightarrow s\nu \nu \) at tree level. We focus on the four semi-leptonic 4-fermion operators with sterile neutrinos [48]

$$\begin{aligned} \mathcal {L}{} & {} = C^\textrm{QN} (\bar{Q} \gamma _\mu Q) (\bar{N} \gamma ^\mu N) + C^\textrm{dN} (\bar{d} \gamma _\mu d) (\bar{N}\gamma ^\mu N)\nonumber \\{} & {} \quad + C^\textrm{LNQd} (\bar{L}^\alpha N)\epsilon _{\alpha \beta } (\bar{Q}^\beta d)\nonumber \\{} & {} \quad + C^\textrm{LNQdT} (\bar{L}^\alpha \sigma ^{\mu \nu }N) \epsilon _{\alpha \beta } (\bar{Q}^\beta \sigma _{\mu \nu } d), \end{aligned}$$
(1)

where Q, L and d are the SM left-handed quark doublet, left-handed lepton doublet and right-handed down-type quark singlet, respectively, and N denotes right-handed neutrinos, i.e. right-handed SM singlet fermions. Flavour and colour indices are suppressed, \(\sigma ^{\mu \nu }=\tfrac{i}{2}[\gamma ^\mu ,\gamma ^\nu ]\) and the Levi-Civita tensor is defined with \(\epsilon _{12}=1\). Assigning lepton number \(+1\) to N, all four operators are lepton number conserving. In contrast to [48], we introduce a tensor operator. This operator basis is convenient as \(\mathcal {O}^\textrm{LNQd}\) and \(\mathcal {O}^\textrm{LNQdT}\) do not mix under 1-loop QCD renormalisation group (RG) running. The tensor operator \(\mathcal {O}^\textrm{LNQdT}\) is related to the scalar operators in [48] via a Fierz transformation

$$\begin{aligned}{} & {} (\bar{L}^\alpha \sigma ^{\mu \nu } N)\epsilon _{\alpha \beta }(\bar{Q}^\beta \sigma _{\mu \nu } d)\nonumber \\{} & {} \quad = -8(\bar{L}^\alpha d)\epsilon _{\alpha \beta } (\bar{Q}^\beta N) -4 (\bar{L}^\alpha N) \epsilon _{\alpha \beta } (\bar{Q}^\beta d). \end{aligned}$$
(2)

For this analysis we define all \(\nu \)SMEFT operators at the scale \(\mu =1\) TeV. At the electroweak scale \({\varLambda }_\textrm{EW} = m_Z\), the \(\nu \)SMEFT operators are matched onto LEFT.

Only a few operators in LEFT are generated at tree level from the set of \(\nu \)SMEFT operators introduced above. The relevant interactions for \(b\rightarrow s \nu \nu \) and \(b\rightarrow c \ell \nu \) processes are described by the Lagrangian [49, 50]

$$\begin{aligned} \begin{aligned} \mathcal {L}&= \sum _{X=L,R} C^{\text {VLX}}_{\nu d} \mathcal {O}^{\text {VLX}}_{\nu d} + \Big ( C_{\nu d}^{\text {SLL}} \mathcal {O}_{\nu d}^{\text {SLL}} + C_{\nu d}^{\text {TLL}} \mathcal {O}_{\nu d}^{\text {TLL}}\\&\quad + C_{\nu edu}^{\text {VLL}} \mathcal {O}_{\nu edu}^{\text {VLL}} + C_{\nu edu}^{\text {SLL}} \mathcal {O}_{\nu edu}^{\text {SLL}} + C_{\nu edu}^{\text {TLL}} \mathcal {O}_{\nu edu}^{\text {TLL}}+\mathrm {h.c.} \Big ) \end{aligned} \end{aligned}$$
(3)

with the effective operators

$$\begin{aligned} \begin{aligned} \mathcal {O}_{\nu d}^{\text {VLX}}&= ( \overline{\nu _L}\gamma _\mu \nu _L)(\overline{d_X} \gamma ^\mu d_X),\\ \mathcal {O}_{\nu d}^{\text {SLL}}&= ( \overline{\nu ^c_L} \nu _L)(\overline{d_R} d_L),\\ \mathcal {O}_{\nu d}^{\text {TLL}}&= (\overline{\nu ^c_L} \sigma _{\mu \nu } \nu _L) (\overline{d_R} \sigma ^{\mu \nu } d_L),\\ \mathcal {O}_{\nu e du}^{\text {VLL}}&= ( \overline{\nu _L}\gamma _\mu e_L)(\overline{d_L} \gamma ^\mu u_L),\\ \mathcal {O}_{\nu edu}^{\text {SLL}}&= ( \overline{\nu ^c_L} e_L)(\overline{d_R} u_L),\\ \mathcal {O}_{\nu e du}^{\text {TLL}}&= (\overline{\nu ^c_L} \sigma _{\mu \nu } e_L) (\overline{d_R} \sigma ^{\mu \nu } u_L). \end{aligned} \end{aligned}$$
(4)

Right-handed neutrinos N are expressed in terms of left-handed Weyl spinors \(\nu _L = N^c \equiv C \overline{N} ^T\) with the charge conjugation matrix \(C=i\gamma ^2\gamma ^0\). SM neutrinos carry the generation index \(\alpha =1,2,3\) and sterile neutrinos are labelled with \(\alpha \ge 4\).

Note that the scalar operator \(\mathcal {O}_{\nu d}^{\text {SLL}}\) is symmetric in the neutrino flavours and the tensor operator \(\mathcal {O}_{\nu d}^{\text {TLL}}\) is antisymmetric in the neutrino flavours, which can be straightforwardly derived from

$$\begin{aligned} \overline{\psi _i^c} {\varGamma }\psi ^{cj}&= \eta _{\varGamma } \overline{\psi ^j} {\varGamma } \psi _i, \;\; C^{-1}{\varGamma } C = \eta _{\varGamma } {\varGamma }^T, \nonumber \\ \eta _{\varGamma }&= {\left\{ \begin{array}{ll} +1 &{} \textrm{for}\;\;{\varGamma }=1,\gamma _5,\gamma ^\mu \gamma _5\\ -1 &{} \textrm{for} \;\; {\varGamma } = \gamma ^\mu ,\sigma ^{\mu \nu }, \sigma ^{\mu \nu }\gamma _5 \end{array}\right. }. \end{aligned}$$
(5)

The other operators do not exhibit any manifest symmetry properties.Footnote 1

The matching conditions have been obtained by translating the existing matching results in the literature [50,51,52] to the operator basis we are using. All LEFT operators are given in the mass eigenstate basis, while the operators in \(\nu \)SMEFT are defined in the interaction basis, where the charged leptons and down-type quarks are mass eigenstates. Up-type quark mass eigenstates \(\hat{u}_i\) and neutrino mass eigenstates \(\hat{\nu }_i\) are related to the interaction basis by

$$\begin{aligned} u_\alpha&= (V^\dagger )_{\alpha i} \hat{u}_i,&\nu _\alpha&= U_{\alpha i} \hat{\nu }_i, \end{aligned}$$
(7)

where V denotes the Cabibbo–Kobayashi–Maskawa (CKM) matrix, and U is the leptonic mixing matrix. The top-left \(3\times 3\) sub-block of the leptonic mixing matrix U is approximately given by the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) matrix. For a large part of the parameter space, (sterile) neutrino masses are negligible compared to the B meson mass, \(m_\alpha \ll m_B\), and thus (sterile) neutrinos can be treated as massless and the PMNS matrix is unphysical in this limiting case. Even for sterile neutrinos with GeV-scale masses, CHARM [53], NuTeV [54] and DELPHI [55] constrain the active-sterile mixing to be small, \(|U|^2\lesssim 10^{-5}\), for the relevant region in parameter space. Thus, in the following we neglect leptonic mixing and treat the sterile neutrinos as mass eigenstates, and so drop the hats from the mass eigenstates for simplicity.

At tree level, we find the following non-vanishing down-quark flavour-violating Wilson coefficients for \(b\rightarrow s\nu \nu \) processes and vector operators with \(\alpha ,\beta \ge 4\)

$$\begin{aligned} \begin{aligned} C_{\nu d,\alpha \beta sb}^{\text {VLL}}&= -C^\textrm{Q N}_{sb (\beta -3)(\alpha -3) },\\ C_{\nu d,\alpha \beta sb}^{\text {VLR}}&= -C^\textrm{d N}_{sb (\beta -3)(\alpha -3)}, \end{aligned} \end{aligned}$$
(8)

and for the scalar and tensor operators

$$\begin{aligned} \begin{aligned} C^{\text {SLL}}_{\nu d, \alpha \beta sb}&= {\left\{ \begin{array}{ll} \frac{1}{2} C^\mathrm{L N Qd*}_{\beta (\alpha -3) bs} &{} \alpha \ge 4;\; 1\le \beta \le 3\\ \frac{1}{2} C^\mathrm{L N Qd*}_{\alpha (\beta -3) bs} &{} 1\le \alpha \le 3;\; \beta \ge 4 \end{array}\right. },\\ C^{\text {SLL}}_{\nu d, \alpha \beta bs}&= {\left\{ \begin{array}{ll} \frac{1}{2} C^\mathrm{L N Qd*}_{\beta (\alpha -3) sb} &{} \alpha \ge 4;\; 1\le \beta \le 3\\ \frac{1}{2} C^\mathrm{L N Qd*}_{\alpha (\beta -3) sb} &{} 1\le \alpha \le 3;\; \beta \ge 4 \end{array}\right. }, \\ C^{\text {TLL}}_{\nu d, \alpha \beta sb}&= {\left\{ \begin{array}{ll} \frac{1}{2} C^\mathrm{LNQdT*}_{\beta (\alpha -3) bs} &{} \alpha \ge 4;\; 1\le \beta \le 3\\ - \frac{1}{2} C^\mathrm{LNQdT*}_{\alpha (\beta -3) bs} &{} 1\le \alpha \le 3;\; \beta \ge 4 \end{array}\right. }.\\ C^{\text {TLL}}_{\nu d, \alpha \beta bs}&= {\left\{ \begin{array}{ll} \frac{1}{2} C^\mathrm{LNQdT*}_{\beta (\alpha -3) sb} &{} \alpha \ge 4;\; 1\le \beta \le 3\\ - \frac{1}{2} C^\mathrm{LNQdT*}_{\alpha (\beta -3) sb} &{} 1\le \alpha \le 3;\; \beta \ge 4 \end{array}\right. }. \end{aligned} \end{aligned}$$
(9)

Note, we explicitly (anti-)symmetrised the scalar (tensor) Wilson coefficients. Furthermore, at tree level the non-vanishing LEFT operators for \(b\rightarrow c\ell \nu \) processes are given by scalar and tensor operators which are matched to \(\nu \)SMEFT operators

$$\begin{aligned} \begin{aligned} C^{\text {SLL}}_{\nu edu, \alpha \beta bc}&= -V_{cm}^*C^\mathrm{L N Qd*}_{\beta (\alpha -3) mb},\\ C^{\text {TLL}}_{\nu edu, \alpha \beta bc}&= - V_{cm}^*C^\mathrm{LNQdT*}_{\beta (\alpha -3) mb}&\textrm{with}\,&\,\, \alpha \ge 4. \end{aligned} \end{aligned}$$
(10)

In addition to the contribution from \(\nu \)SMEFT dimension-6 operators, the SM contribution to the LEFT operators relevant for \(b\rightarrow s,c\) processes can be summarised as

$$\begin{aligned} \begin{aligned} C^\textrm{VLL, SM}_{\nu d, \alpha \beta sb}&= - \frac{4 G_F}{\sqrt{2}} \frac{\alpha }{2\pi } V_{ts}^* V_{tb} \frac{X}{\sin ^2\theta _w}\delta _{\alpha \beta },\\ C^\textrm{VLL,SM}_{\nu e du, \alpha \beta bc}&= -\frac{4 G_F}{\sqrt{2}} V_{cb}^*\delta _{\alpha \beta } \end{aligned} \end{aligned}$$
(11)

in terms of the function X [56]. The Wilson coefficient for \(b\rightarrow s\nu \bar{\nu }\) includes electroweak corrections induced by top quarks at 2-loop order, which are numerically given by \(X=6.3772 \sin ^2\theta _w\) [57,58,59].

The dominant RG corrections originate from strong interactions. While the (axial-)vector current operators do not receive QCD corrections at 1-loop order, scalar and tensor operators receive RG corrections. The 1-loop RG equations for the scalar and tensor Wilson coefficients \(C^S\) and \(C^T\) are

$$\begin{aligned} \frac{d \ln C^S}{d\ln \mu }&= -3C_F \frac{\alpha _s}{2\pi },&\frac{d \ln C^T}{d\ln \mu }&= C_F \frac{\alpha _s}{2\pi }, \end{aligned}$$
(12)

with the second Casimir invariant \(C_F=(N_c^2-1)/2N_c=4/3\) for \(N_c=3\) and \(\alpha _s=g_s^2/4\pi \). Taking into account 1-loop QCD running, we arrive at

$$\begin{aligned} \begin{aligned} C^S(\mu _1)&= \left( \frac{\alpha _s(\mu _2)}{\alpha _s(\mu _1)}\right) ^{3C_F/b} C^S(\mu _2),\\ C^T(\mu _1)&= \left( \frac{\alpha _s(\mu _2)}{\alpha _s(\mu _1)}\right) ^{-C_F/b} C^T(\mu _2), \end{aligned} \end{aligned}$$
(13)

where \(b=-11+\tfrac{2}{3} n_f\) and \(n_f\) is the number of active quark flavours. As there is no operator mixing for 1-loop QCD running in the chosen operator basis, there is a one-to-one correspondence between a given LEFT Wilson coefficient at the hadronic scale \(\mu =4.8\) GeV and the \(\nu \)SMEFT Wilson coefficient at \(\mu =1\) TeV: All scalar LEFT operators are rescaled with the factor 1.608 and tensor operators with 0.853 compared to the result from matching \(\nu \)SMEFT to LEFT.

3 Phenomenology

Belle II measured [1] the decay \(B^+\rightarrow K^++\textrm{inv}\) with a branching ratio of BR(\(B^+\rightarrow K^+ +\textrm{inv})=(2.4\pm 0.7)\times 10^{-5}\) assuming a 3-body decay with massless neutrinos, which is above the 90% CL exclusion limit of BR(\(B^+\rightarrow K^+ +\textrm{inv})<1.6\times 10^{-5}\) [5]. Together with the previous searches for \(B^+\rightarrow K^++\textrm{inv}\) [4, 5, 60], the Belle II measurement results in a simple weighted average of BR(\(B^+\rightarrow K^++\textrm{inv})=(1.4\pm 0.4)\times 10^{-5}\) [1]. The other searches for \(B\rightarrow K^{(\star )}+\textrm{inv}\) so far only resulted in upper limits of BR(\(B^0\rightarrow K^0+\textrm{inv})<2.6\times 10^{-5}\) [4],Footnote 2 BR(\(B^0\rightarrow K^{\star 0} +\textrm{inv})<1.8\times 10^{-5}\) [4], and BR(\(B^+\rightarrow K^{\star +} +\textrm{inv})<4.0\times 10^{-5}\) [3].

The short-distance contributions to the differential decay rates of semi-leptonic B meson decays are straightforwardly expressed in terms of helicity amplitudes

$$\begin{aligned} \begin{aligned} \frac{d{\varGamma }(\bar{B}\rightarrow \bar{K} \nu _\alpha \nu _\beta )}{dq^2}&= \frac{1}{4(1+\delta _{\alpha \beta })} G^{(0)}(q^2),\\ \frac{d{\varGamma }(\bar{B}\rightarrow \bar{K}^\star (\rightarrow \bar{K}\pi ) \nu _\alpha \nu _\beta )}{dq^2}&= \frac{3}{4(1+\delta _{\alpha \beta })} G^{0,0}_0(q^2), \end{aligned} \end{aligned}$$
(14)

where the coefficients \(G^{(0)}(q^2)\) and \(G^{0,0}_0(q^2)\) of the Wigner-D functions are defined in [46] with the replacements for neutrinos as discussed in [12] and \(q^2\) denotes the missing invariant mass squared carried away by the two neutrinos.Footnote 3 Kinematics constrains \(q^2\) to lie in the range \((m_\alpha +m_\beta )^2\le q^2\le (m_{B^+}-m_{K^+})^2\). We use the recently published \(B\rightarrow K^{(\star )}\) form factors from [61] with the parametrisation defined in [62]. Following [63], we increase the \(B\rightarrow K^\star \nu \nu \) branching ratios by 20% to account for finite-width effects.

Fig. 1
figure 1

Differential branching ratio as a function of the missing invariant mass squared \(q^2\) for vector (blue), scalar (red) and tensor (black) operators. All Wilson coefficients are fixed to \(C=0.01\, \textrm{TeV}^{-2}\) at \(\mu =1\) TeV and the SM contribution is taken into account. The solid lines are for massless sterile neutrinos and the dashed line for a sterile neutrino of mass \(m_1=1.5\) GeV

Note that massive sterile neutrinos and the Lorentz structure of the operators result in modifications to the \(q^2\) distribution compared to the SM prediction, see e.g. [12], which is illustrated in Fig. 1 for \(B^+\rightarrow K^+\nu \nu \). The \(q^2\) distributions of \(B\rightarrow K^{(\star )}\nu \nu \) decays are also dependent on the sterile neutrino mass and Lorentz structure of the operators. Heavy sterile neutrinos do not contribute below \(q^2<4m_1^2\) for the vector operators \(\mathcal {O}^\textrm{QN}\) and \(\mathcal {O}^\textrm{dN}\) and \(q^2<m_1^2\) for scalar operators \(\mathcal {O}^\textrm{LdQN}\) and tensor operators \(\mathcal {O}^\textrm{LdQNT}\). Below the kinematic cutoff for sterile neutrinos, the dashed lines in Fig. 1 approach the SM prediction. In comparison to vector operators, scalar and tensor operators are enhanced for large \(q^2\). The experimental efficiency however is largest for small \(q^2\) and reduced for large \(q^2\). As there is currently no publicly available \(q^2\) distribution which could be used for recasting the Belle II result [1], we only consider branching ratios and do not take into account modifications to the \(q^2\) distribution. Viable regions of parameter space which explain the branching ratio measured by Belle II can only be taken as indicative and we would like to stress the need for a more detailed discussion of sterile neutrino final states based on the full \(q^2\) distribution.

In addition to the short-distance contribution, there is a long-distance tree-level contribution to \(B^+\rightarrow K^{(\star )+} \nu \bar{\nu }\) in the SM. [64] It is mediated by a \(\tau \) lepton, \(B^+\rightarrow \tau ^+ (\rightarrow K^{(\star )+} \bar{\nu }_\tau )\nu _\tau \). The interference between the long-distance and short-distance contributions is negligible due to the narrow \(\tau \) resonance [64] and thus the total branching ratio for \(B^+\rightarrow K^{(\star )+} \nu \bar{\nu }\) is approximately obtained by summing the short- and long-distance contributions. Although the long-distance contribution is subdominant, since it only amounts to about 10% of the short-distance contribution, we nevertheless include it in the numerical analysis.

Another relevant decay channel is the invisible decay of \(B_s\) mesons. Reference [65] recently derived a first upper bound on the branching ratio of invisible \(B_s\) decays using LEP data with \(\textrm{BR}(B_s\rightarrow \textrm{inv}) <5.9\times 10^{-4}\). It may also be probed at the Belle II experiment: Using an integrated luminosity of \(5\,\textrm{ab}^{-1}\) allows to probe BR(\(B_s\rightarrow \textrm{inv})>1.1\times 10^{-5}\) [66]. The branching ratio for identical final state neutrinos is given by

$$\begin{aligned} \begin{aligned}&\textrm{BR}(B_s\rightarrow \nu _\alpha \nu _\alpha ) = \frac{\tau _{B_s}m_{B_s}^3 f_{B_s}^2}{32\pi } \\&\quad \times \Bigg (\left| C_{\nu d,\alpha \alpha sb}^{\text {VLR}}-C_{\nu d,\alpha \alpha sb}^{\text {VLL}}\right| ^2 x_\alpha ^2\sqrt{1-4x_\alpha ^2}\\&\quad +\left| C_{\nu d,\alpha \alpha bs}^{\text {SLL}*} -C_{\nu d,\alpha \alpha sb}^{\text {SLL}}\right| ^2 \frac{(1-4x_\alpha ^2)^{3/2}}{4\,x_b^2}\\&\quad +\left| C_{\nu d,\alpha \alpha bs}^{\text {SLL}*} +C_{\nu d,\alpha \alpha sb}^{\text {SLL}}\right| ^2 \frac{\sqrt{1-4x_\alpha ^2}}{4\,x_b^2}\\&\quad +\textrm{Re}\left[ (C_{\nu d,\alpha \alpha sb}^{\text {VLR}}-C_{\nu d,\alpha \alpha sb}^{\text {VLL}}) (C_{\nu d,\alpha \alpha bs}^{\text {SLL}} +C_{\nu d,\alpha \alpha sb}^{\text {SLL}*})\right] \\&\quad \times \frac{x_\alpha \sqrt{1-4x_\alpha ^2}}{x_b} \Bigg ), \end{aligned} \end{aligned}$$
(15)

with \(x_\alpha =m_\alpha /m_{B_s}\) and \(x_b=m_b/m_{B_s}\). The general expression for different neutrino final states is presented in App. B. Vector contributions are helicity suppressed and thus in the limit of vanishing sterile neutrino masses, there is only a contribution from scalar operators [7, 45]

$$\begin{aligned} \textrm{BR}(B_s\rightarrow \nu _\alpha \nu _\alpha ){} & {} = \frac{\tau _{B_s}m_{B_s}^5 f_{B_s}^2}{64\pi m_b^2} \nonumber \\{} & {} \quad \times \left( \left| C_{\nu d,\alpha \alpha bs}^{\text {SLL}}\right| ^2 + \left| C_{\nu d,\alpha \alpha sb}^{\text {SLL}}\right| ^2 \right) . \end{aligned}$$
(16)

Directly related to the discussed processes are the rare decays \(B_s\rightarrow \phi \nu \nu \) and \({\varLambda }_b\rightarrow {\varLambda }^{(\star )} \nu \nu \) which provide independent information. The differential branching ratio for \(B_s\rightarrow \phi \nu \nu \) has the same form as the one for \(B\rightarrow K^*\nu \nu \) with the form factors replaced by the \(B_s\rightarrow \phi \) form factors presented in [61]. See [67] for a calculation in SMEFT. The differential branching ratio for \({\varLambda }_b\rightarrow {\varLambda }^{(\star )} \nu \bar{\nu }\) has been calculated in [68] within SMEFT. Within \(\nu \)SMEFT it may be obtained from the differential branching ratio for the rare \({\varLambda }_b\) decay with charged leptons, see [69], similar to the discussion of \(B\rightarrow K^{(\star )}\nu \nu \) in [12]. These channels are inaccessible at the Belle II experiment running at the \({\varUpsilon }(4S)\) resonance, but a Tera-Z experiment such as FCC-ee or CEPC may be able to measure these channels as pointed out in [70, 71]. Given the uncertain experimental situation, we leave the discussion of these channels for future work. If the anomaly persists, it will be highly interesting to measure \(B_s\rightarrow \phi \nu \nu \) and \({\varLambda }_b\rightarrow {\varLambda }^{(\star )} \nu \nu \) to pin down the Lorentz structure of the effective operators.

Additional constraints arise from light-lepton flavour universality ratios for charged current processes \(b\rightarrow c\ell \bar{\nu }\). The differential branching ratios are obtained by exchanging \(\nu _\alpha \rightarrow e_\alpha \) in Eq. (14) and dropping the factors \((1+\delta _{\alpha \beta })\) for identical final state particles, see also [72]. The relevant form factors for \(B\rightarrow D^{(\star )}\) are taken from [73]. Three different ratios have been measured: Belle measured \(\frac{\text {BR}(B\rightarrow D\mu \nu )}{\text {BR}(B\rightarrow De\nu )}=0.995\pm 0.045\) [74] which is consistent with the SM prediction of 0.997. Similarly, the measurement of \(\frac{\text {BR}(B^0\rightarrow D^{\star -}e^+\nu )}{\text {BR}(B^0\rightarrow D^{\star -}\mu ^+\nu )}=1.01\pm 0.032\) by Belle [66] is consistent with the SM prediction of 1.005. Most recently, Belle II measured the inclusive ratio \(\frac{\text {BR}(B\rightarrow X e\nu )}{\text {BR}(B\rightarrow X \mu \nu )}=1.007\pm 0.009\pm 0.019\) [75] which also agrees with the SM prediction of \(1.006\pm 0.001\) [76]. The deviation from unity in the three ratios originates from phase space suppression due to the muon mass.

Fig. 2
figure 2

The red (green) contour line stands for the present bound on BR(\(B^0\rightarrow K^{\star 0}(K^0)+\textrm{inv}\)), and the respective regions above these lines are therefore ruled out. The light-blue band symbolises the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)), and the hatched light-blue region is compatible with the 2023 Belle II measurement

Although the charged current processes are well constrained, they do not provide competitive constraints. In our analysis we find that they are always subdominant compared to other rare B meson decays \(B\rightarrow K^{(\star )}+\textrm{inv}\) and are thus not included in the discussion of Sect. 4. This can be understood from the different sizes of the SM contributions to the different processes: While charged current processes are induced at tree level in the SM via W boson exchange, rare decays are only induced at loop level and are further suppressed by the Glashow–Iliopoulos–Maiani (GIM) mechanism. The new physics contributions from the operators in Eq. (1) are however of the same size for both charged-current and neutral-current processes. Finally, colliders may be used to search for new charged-current semi-leptonic operators, see e.g. [77, 78], which results in constraints on the new physics scale in the \(O(1-10)\) TeV range. Long-lived sterile neutrinos from meson decays may also be searched for at the proposed far-forward detectors at the (high-luminosity) LHC, SHiP and Belle II, see e.g. [79,80,81]. There are currently no constraints from leptonic \(B_c\) decays.

4 Results

In the following we focus on the rare B meson decay \(B\rightarrow K^{(\star )} +\textrm{inv}\) and present first an analysis with one operator being switched on at a time, which is followed by a discussion of correlations between pairs of operators.

In Fig. 2 we illustrate how a single operator may explain the observed decay \(B^+\rightarrow K^++\textrm{inv}\) as a function of the sterile neutrino mass \(m_1\). The light-blue band indicates the \(1\sigma \)-allowed region of parameter space which explains the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)). The hatched light-blue region indicates the parameter space explaining the Belle II measurement. Wilson coefficients above the solid red and green contour lines are excluded at 90% CL by the non-observation of \(B^0\rightarrow K^{\star 0} +\textrm{inv}\) and \(B^0\rightarrow K^0 +\textrm{inv}\), respectively. The results do not depend on the neutrino flavour. In particular, the results are unchanged, when changing from first generation lepton doublets to second or third generation lepton doublets. We do not show plots for the scalar and tensor operators with a right-handed strange quark. They yield identical results as the operators with a left-handed strange quark, because the branching ratios are invariant under the exchange \(s\leftrightarrow b\) for the LEFT scalar and tensor operators.

Fig. 3
figure 3

The red (green) line stands for the present bound on BR(\(B^0\rightarrow K^{\star 0}(K^0)+\textrm{inv}\)), and the respective regions to the right of these lines are therefore ruled out. The light-blue band symbolises the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)), and the hatched light-blue region is compatible with the 2023 Belle II measurement. The plots in the top row also apply to the operator \(\mathcal {O}^\textrm{dN}\) and the plots in the bottom row also apply to \(\mathcal {O}^\textrm{LNQd}_{2132}\)

Figure 2 (bottom) clearly shows that tensor operators are not able to explain the observed excess in \(B^+\rightarrow K^++\textrm{inv}\). The constraint from \(B\rightarrow K^\star +\textrm{inv}\) is weakened for large sterile neutrino masses \(m_1\sim 4\) GeV because of the larger mass of the \(K^\star \) meson and thus reduced phase space. As large sterile neutrino masses modify the \(q^2\) distribution and the experimental efficiency is generally lower for large \(q^2\), a more careful analysis using the full \(q^2\) distribution is required in this case. The result is independent of the lepton flavour and the same result applies to the tensor operator with right-handed strange quarks.

While vector operators \(\mathcal {O}^\textrm{QN}\) with massless sterile neutrinos are clearly excluded as explanation, Fig. 2 (top left) suggests the ability to explain the observed excess for sterile neutrino masses of \(m_1 > rsim 1.5\) GeV. This feature is again due to the smaller phase space for decays to \(K^\star \) mesons and requires a more detailed analysis using the \(q^2\) distribution to arrive at a conclusive result. The same conclusion holds for the vector operator \(\mathcal {O}^\textrm{dN}\) with right-handed down-type quarks.

As shown in Fig. 2 (top right), the most promising explanation of the observed excess is provided by scalar operators \(\mathcal {O}^\textrm{LNQd}\) whose contribution to \(B\rightarrow K^\star \nu \nu \) is suppressed. This opens up the possibility to explain the observed excess for the whole allowed sterile neutrino mass range. Note, although scalar operators with massless neutrinos do not have a sharp cutoff in the \(q^2\) distribution, they dominantly contribute to the large \(q^2\) region as illustrated in Fig. 1. As it has been pointed out in [7, 45], the Belle II experiment is able to constrain them by searching for invisible \(B_s\) decays which are currently unconstrained.

Fig. 4
figure 4

Branching ratios of several decay modes as a function of the scalar (left) and vector (right) Wilson coefficients for fixed sterile neutrino mass. The dashed horizontal contours indicate the current upper bounds for \(B^0\rightarrow K^{\star 0} +\textrm{inv}\) (red), \(B^+\rightarrow K^{\star +}+\textrm{inv}\) (orange), \(B^0\rightarrow K^0+\textrm{inv}\) (green), and \(B_s\rightarrow \textrm{inv}\) (black). The light-blue band symbolises the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)) and the hatched light-blue region is compatible with the 2023 Belle II measurement

In the following we focus on scalar and vector operators which are most promising to explain the observed excess in \(B^+\rightarrow K^++\textrm{inv}\). Figure 3 shows the dependence of the (total) branching ratio BR(\(B^+\rightarrow K^+\nu \nu \)) as a function of the Wilson coefficient for two benchmark masses of \(m_1=0\,(1.5)\) GeV on the left (right). The vertical red (green) line indicates the most stringent constraint from \(B\rightarrow K^\star +\textrm{inv}\) (\(B\rightarrow K + \textrm{inv}\)), which excludes the region to the right of the line. The horizontal light-blue bands again indicate the region preferred by \(B^+\rightarrow K^++\textrm{inv}\). Figure 3 (top row) shows the branching ratio as a function of the Wilson coefficient for the vector operator \(\mathcal {O}^\textrm{QN}_{2311}\) with left-handed quark doublets. The top-left figure clearly illustrates the conflict between the measurement of \(B^+\rightarrow K^++\textrm{inv}\) and the non-observation of \(B^0\rightarrow K^{\star 0}+\textrm{inv}\) for massless sterile neutrinos. For \(m_1=1.5\) GeV, \(B^0\rightarrow K^{\star 0}+\textrm{inv}\) constrains \(C_{2311}^\textrm{QN}\le 0.037\,\textrm{TeV}^{-2}\), which limits the excess in \(B^+\rightarrow K^++\textrm{inv}\) to the region allowed by the simple weighted average, but precludes an explanation of the larger branching ratio measured by Belle II. The same result applies for the vector operator \(\mathcal {O}^\textrm{dN}\) with right-handed down-type quarks. Similarly, Fig. 3 (bottom row) illustrates the dependence of BR(\(B^+\rightarrow K^+\nu \nu \)) on the scalar Wilson coefficient \(C^\textrm{LNQd}_{2123}\). The region to the left of the vertical green and red lines is consistent with the neutral decay modes \(B^0\rightarrow K^{\star 0}\nu \nu \) and thus the scalar operators are able to explain the observed excess.

In order to compare how future experimental results may impact the conclusions, we show in Fig. 4 the branching ratios for invisible \(B_s\) decay (black), \(B^0\rightarrow K^{\star 0}\nu \nu \) (red), \(B^+\rightarrow K^{\star +}\nu \nu \) (orange), \(B^0\rightarrow K^0\nu \nu \) (green), and \(B^+\rightarrow K^+\nu \nu \) (blue) as a function of the scalar \(C^\textrm{LNQd}_{2123}\) and vector \(C^\textrm{QN}_{2311}\) Wilson coefficient as solid lines and the current upper limits as dashed lines. The region to the right of the intersection of the dashed and solid lines of the same colour is already excluded. The branching ratios of the charged and neutral B meson decay channels are approximately equal due to isospin symmetry. Belle II provided a detailed study of future sensitivities in [82]: In particular with an integrated luminosity of \(1\,\textrm{ab}^{-1}\), Belle II is sensitive to the signal strength of 2.06 for \(B^0\rightarrow K^0 +\textrm{inv}\) and thus is able to probe the entire region favoured by \(B^+\rightarrow K^++\textrm{inv}\). An improved analysis may even reach a signal strength of 1.37. Similarly, the expected sensitivities for the decays \(B\rightarrow K^\star +\textrm{inv}\) are 1.08(0.72) and 2.04(1.45) for the neutral and charged B meson decays, where we indicated the expected improved sensitivities in parentheses. Hence, also the searches for \(B^0\rightarrow K^{\star 0}+\textrm{inv}\) are able to probe most of the preferred parameter space for scalar operators, while \(B^+\rightarrow K^{\star +}+\textrm{inv}\) is less sensitive. For the vector operator explanation, the decays \(B\rightarrow K^\star \nu \nu \) are not expected to be sensitive to the whole preferred parameter space.

There is currently only a weak upper limit on the branching ratio of invisible \(B_s\) decays based on LEP data [65]. Belle II’s ability to probe BR(\(B_s\rightarrow \textrm{inv}) > rsim 1.1\times 10^{-5}\) with an integrated luminosity of \(5\, \textrm{ab}^{-1}\) [66] will provide a conclusive test of the described explanations of \(B^+\rightarrow K^+\nu \nu \) for both scalar and vector operators. The scalar operator contribution is not helicity suppressed, as outlined in Sect. 3, and thus the invisible \(B_s\) decay is sensitive to the full sterile neutrino mass range. The vector operator contribution is helicity suppressed, but the explanation of the excess in \(B^+\rightarrow K^++\textrm{inv}\) requires large sterile neutrino masses \(m_1 > rsim 1.5\) GeV and thus invisible \(B_s\) decays are also able to probe the vector operator scenario.

Fig. 5
figure 5

The red (green) contour line stands for the present bound on BR(\(B^0\rightarrow K^{\star 0}(K^0)+\textrm{inv}\)), and the respective regions outside of these lines are therefore ruled out. The light-blue band symbolises the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)), and the hatched light-blue region is compatible with the 2023 Belle II measurement

We finally turn our attention to correlations between different operators. In Fig. 5, we present the 90% CL exclusion contours for \(B^0\rightarrow K^{\star 0} +\textrm{inv}\) (\(B^0\rightarrow K^0 +\textrm{inv}\)) in red (green) using the current upper limits [3, 4]. Larger Wilson coefficients are excluded. The constraint from \(B^+\rightarrow K^{\star +}+\textrm{inv}\) is weaker than \(B^0\rightarrow K^{\star 0}+\textrm{inv}\) and thus not shown for clarity. Similar to above, the blue-shaded region indicates the simple weighted average for BR(\(B^+\rightarrow K^++\textrm{inv}\)) and the hatched region the 2023 Belle II measurement of \(B^+\rightarrow K^++\textrm{inv}\).

Figure 5 (top row) illustrates the correlation between the vector operators with right-handed down-type quarks, \(\mathcal {O}^\textrm{dN}_{2311}\), and left-handed quark doublets, \(\mathcal {O}^\textrm{QN}_{2311}\). Their contribution to the branching ratio of \(B\rightarrow K\nu \nu \) is proportional to \(\left| C^\textrm{QN}_{2311} + C^\textrm{dN}_{2311}\right| ^2\), while for the branching ratios of \(B\rightarrow K^\star \nu \nu \) there are two independent contributions proportional to the square of the sum and difference of the two Wilson coefficients. This explains the shape of the contour lines: While the non-observation of \(B\rightarrow K^\star +\textrm{inv}\) results in an elliptical exclusion region, the decay \(B\rightarrow K\nu \nu \) is insensitive to \(C^\textrm{QN}_{2311} = -C^\textrm{dN}_{2311}\) and thus the search for \(B\rightarrow K+\textrm{inv}\) is unable to probe this direction in parameter space. Increasing the sterile neutrino mass to \(m_1=1.5\) GeV, see Fig. 5 (top right), reduces the contribution to \(B\rightarrow K^\star \nu \nu \) and thus weakens the \(B\rightarrow K^\star +\textrm{inv}\) constraint. Consequently, the allowed region inside the red ellipse widens.

The plots in Fig. 5 (bottom row) show the correlation between the scalar operator \(\mathcal {O}^\textrm{LNQd}_{2123}\) and the vector operator \(\mathcal {O}^\textrm{QN}_{2311}\) with left-handed quark doublets. The elliptical exclusion regions are due to the absence of complete cancellations between the two operators. In fact, there is no interference between the two operators in the massless limit. The bottom-left plot clearly shows the preference for the pure scalar operator scenario for massless sterile neutrinos. For massive sterile neutrinos, the constraint from \(B\rightarrow K^\star +\textrm{inv}\) weakens. The bottom-right plot shows the case with \(m_1=1.5\) GeV, where the dominant constraint originates from \(B^0\rightarrow K^{\star 0}+\textrm{inv}\) and the constraint from \(B\rightarrow K^0+\textrm{inv}\) is subdominant. The same conclusions holds, if the quark flavours of the scalar operator are exchanged, \(s\leftrightarrow b\), or if the vector operator \(\mathcal {O}^\textrm{dN}\) with right-handed down-type quarks is considered.

5 UV completions

We briefly outline possible UV completions for the scalar and vector operators discussed in the previous section, see [83] for a detailed discussion of tree-level UV completions of dimension-6 operators in \(\nu \)SMEFT. The most straightforward UV completion of the scalar operator \(\mathcal {O}^\textrm{LNQd}\) is provided by a second electroweak doublet scalar \(\eta \sim (\textbf{1},\textbf{2},-\tfrac{1}{2})\) with its Yukawa interactions \(\bar{L} N \eta \) and \(\bar{Q} i\sigma _2\eta ^* d\). Integrating out the electroweak doublet scalar \(\eta \) results in the scalar operator \(\mathcal {O}^\textrm{LNQd}\), but it also generates the 4-fermion operators \((\bar{L} \gamma _\mu L) (\bar{N} \gamma ^\mu N)\) and \((\bar{Q} \gamma _\mu Q) (\bar{d} \gamma _\mu d)\). The latter contributes to \(B_s-\overline{B_s}\) mixing and the former to charged lepton decays \(\ell _i\rightarrow \ell _j +\textrm{inv}\) which both have to be considered in a UV-complete model. The required size of the scalar Wilson coefficient \(C^\textrm{LNQd}\sim 0.02\, \textrm{TeV}^{-2}\) requires the electroweak doublet scalar to have a mass of at most \(\mathcal {O}(7)\) TeV for Yukawa couplings of order unity.

The vector operators are straightforwardly generated using leptoquarks, see e.g. [84] for a review. The \(S_1\sim (\bar{\textbf{3}},\textbf{1},\tfrac{1}{3})\) leptoquark leads to the operator \(\mathcal {O}^\textrm{dN}\) via its Yukawa interaction \(\overline{N^c} S_1 d\) and thus does not contribute to \(B_s-\overline{B_s}\) mixing at tree level. Its other leptoquark couplings generate the SMEFT operators \(\mathcal {O}^\textrm{eu}\), and \(\mathcal {O}^\mathrm{LeQu(1,3)}\), and \(\mathcal {O}^\mathrm{LQ(1,3)}\), see e.g. [85]. Similarly, the leptoquark \(\tilde{R}_2\sim (\textbf{3},\textbf{2},\tfrac{1}{6})\) may generate the operator \(\mathcal {O}^\textrm{QN}\) via its Yukawa interaction \(\overline{Q} \tilde{R}_2 N\), which does not contribute to \(B_s-\overline{B_s}\) mixing at tree level. Its other Yukawa interaction \(\bar{d} \tilde{R}_2 L\) contributes to the SMEFT operator \(\mathcal {O}^\textrm{Ld}\) [85]. The required size of the vector Wilson coefficient \(C^\textrm{QN,dN}\sim 0.03\, \textrm{TeV}^{-2}\) requires the leptoquark to have a mass of at most \(\mathcal {O}(6)\) TeV for Yukawa couplings of order unity. The operators \(\mathcal {O}^\mathrm{LQ(1,3)}\) and \(\mathcal {O}^\textrm{Ld}\) may also contribute to \(B\rightarrow K^{(\star )}\nu \nu \), but they are strongly constrained by searches for \(B\rightarrow K^{(\star )} \ell \bar{\ell }\) for light charged leptons \(\ell \), see e.g. [7, 45].

6 Conclusions

We studied how the recently observed excess in \(B^+\rightarrow K^++\textrm{inv}\) may be explained within \(\nu \)SMEFT. Within \(\nu \)SMEFT there are four 4-fermion operators with sterile neutrinos which may contribute to \(B^+\rightarrow K^++\textrm{inv}\): two vector, one scalar and one tensor operator. We would like to stress that massive sterile neutrinos and scalar and tensor operators modify the \(q^2\) distribution with respect to the SM prediction based on massless neutrinos contributing via a vector operator. The present analysis is entirely based on the branching ratios and we only comment on the \(q^2\) distribution, because it is currently not available. The tensor operator is strongly constrained by \(B^0\rightarrow K^{\star 0} +\textrm{inv}\) and thus is not able to significantly contribute to \(B^+\rightarrow K^+ +\textrm{inv}\).

We find that the excess in \(B^+\rightarrow K^+ +\textrm{inv}\) is most straightforwardly explained by scalar operators with sterile neutrinos irrespective of their mass. This scenario can be conclusively probed using invisible \(B_s\) decays at Belle II and the rare semi-leptonic decay \(B^0\rightarrow K^{(\star ) 0}+\textrm{inv}\) in the near future using \(1\,\textrm{ab}^{-1}\) of data. Vector operators may be able to explain the excess of \(B^+\rightarrow K^+ + \textrm{inv}\) for intermediate sterile neutrino masses of \(m_1 > rsim 1.5\) GeV assuming no significant changes to the \(q^2\) distribution. While \(B\rightarrow K^\star +\textrm{inv}\) strongly constrains this possibility for light sterile neutrinos, the decay \(B\rightarrow K^\star +\textrm{inv}\) is less sensitive for sterile neutrino masses \(m_1\sim 1.5\) GeV due to phase space suppression, but still provides a constraint. Most of the parameter space of this solution can similarly be probed with the invisible \(B_s\) decay and \(B^0\rightarrow K^{(\star )0}+\textrm{inv}\).

We finally outline tree-level UV completions for the scalar and vector operators. The scalar operator may be generated using an electroweak doublet scalar, which however also contributes to \(B_s-\overline{B_s}\) mixing at tree level. The vector operators can be straightforwardly generated using leptoquarks: the \(S_1\) leptoquark may result in \(\mathcal {O}^\textrm{dN}\) and the \(\tilde{R}_2\) leptoquark in \(\mathcal {O}^\textrm{QN}\). Generally, all UV completions also generate other \(\nu \)SMEFT operators, but for the two leptoquarks it is in principle possible to avoid them. It however requires to set all other Yukawa couplings to zero, which may be considered unnatural. This motivates to consider the full set of relevant \(\nu \)SMEFT operators in a future study and to investigate explanations of the excess in UV complete models together with other observables.