1 Introduction

The experimental measurements of the \(\mu /e\) universality ratios \(R_K\) and \(R_{K^*}\) in \(B \rightarrow K^{(*)} \ell ^+\ell ^-\) decays [1,2,3,4] indicate a violation of Lepton Flavor Universality (LFU) of about \(20\%\) in the decay rates, well above the Standard Model (SM) expectation [5, 6]. The statistical significance of each measurement does not exceed the \(3\sigma \) level. However, as pointed out first in [7], these results are consistent with the tension between data and SM predictions in the \(B \rightarrow K^{*} \ell ^+\ell ^-\) differential distribution [8, 9], as well as with the suppression of \(\mathcal {B}(\bar{B}_s\rightarrow \mu ^+\mu ^-)\) [10] compared to the SM expectation [11, 12]. When combined, the \(b\rightarrow s\ell \ell \) data points towards a non-standard phenomenon of short-distance origin, with a statistical significance exceeding \(4\sigma \) (see Refs. [13,14,15] for recent combined analyses).

Interestingly enough, an independent indication of LFU violation occurs in \(\bar{B} \rightarrow D^{(*)} \ell \nu \) decays, when comparing \(\tau \) and light-lepton modes [16,17,18,19,20]. These two hints of LFU violation, generically referred to as B-physics anomalies, can be addressed by a combined description within an effective theory approach to physics beyond the SM based on two main hypotheses:(i) the new dynamics affect predominantly semi-leptonic operators, and (ii) they couple in a non-universal way to the different fermion generations [21,22,23,24,25,26]. In particular, the new dynamics should have dominant couplings to third-generation fermions and smaller, but non-negligible, couplings to second-generation fermions. This non-trivial flavor structure resembles the hierarchies observed in the SM Yukawa couplings, opening the possibility of a common origin of B-physics anomalies and flavor hierarchies, as hypothesized in Refs. [27, 28].

In order to shed light on this phenomenon, it would be very important to establish evidence of the same underlying non-standard dynamics in different channels. In particular, most frameworks addressing B-physics anomalies predict a sizable violation of \(\mu /e\) universality also in processes based on the \(b\rightarrow d \ell \ell \) transitions. The connection between \(b\rightarrow s\) and \(b\rightarrow d\) flavor-changing neutral-current (FCNC) amplitudes is a firm prediction of all models based on a minimally broken \(U(2)^5\) flavor symmetry [29,30,31], such as the ones proposed in Refs. [27, 28], and the wider class models discussed in Refs. ***[32,33,34,35] (the so-called non-universal 4321 models). This connection has been discussed in general terms in Ref. [36], where it has been shown that the \(U(2)^5\) symmetry implies an identical relative breaking of LFU in \(b\rightarrow s \ell \ell \) and \(b\rightarrow d \ell \ell \) amplitudes at the short-distance level [36].

On general grounds, testing the SM precisely in \(b\rightarrow d \ell \ell \)-mediated processes is challenging for two reasons. First, the rates are both loop- and CKM-suppressed, making the decay extremely rare. Second, hadronic intermediate states introduce sizable long-distance contributions, which are difficult to predict. A promising channel in this regard is \(B \rightarrow \pi \ell ^+\ell ^-\): about 20 events of \(B \rightarrow \pi \mu ^+\mu ^-\) have already been observed by LHCb in the LHC run-I [37], and a significantly larger sample can be anticipated from run-II data. Recent theoretical studies of \(B \rightarrow \pi \ell ^+\ell ^-\) decays, analyzing the relative impact of short- and long-distance contributions within the SM, have been presented in Refs. [38,39,40,41]. A key point to notice is that long-distance contributions cannot induce violations of LFU. Hence, as in \(B \rightarrow K \ell ^+\ell ^-\) decays, ratios of the type [5]

$$\begin{aligned} R_\pi [q^2_\text {min}, q^2_\text {max}] =\frac{ \int _{q^2_\text {min}}^{q^2_\text {max}} dq^2\frac{d\mathcal {B}}{dq^2}(B^+\rightarrow \pi ^+\mu ^+\mu ^-)}{ \int _{q^2_\text {min}}^{q^2_\text {max}}dq^2\frac{d\mathcal {B}}{dq^2}(B^+\rightarrow \pi ^+e^+e^-)}\,, \end{aligned}$$
(1)

with \(q^2=m^2_{\ell \ell }\) being the dilepton invariant mass, are expected to provide powerful tests of LFU violation of short-distance origin. The purpose of this paper is to precisely estimate the sensitivity of these ratios to short-distance dynamics.

While long-distance contributions cannot induce a violation of LFU, they can dilute a possible LFU-violating contribution of short-distance origin in kinematical regions where they are dominant. Our main goal is therefore to identify the regions of the dilepton invariant mass spectrum where the decay rate is dominated by short-distance dynamics and to estimate the sensitivity to LFU-violating amplitudes in those regions. We will do so using a general data-driven parameterization of the long-distance contributions induced by light-quark and charm re-scattering into \(\ell ^+\ell ^-\) pairs. The approach employed closely resembles the analysis of the \(B \rightarrow K \mu ^+\mu ^-\) dilepton-spectrum presented in Ref. [48], where \(e^+ e^- \rightarrow q \bar{q}\) data are used to identify the possible discontinuities of the amplitude. The first attempt to use \(e^+ e^- \rightarrow q \bar{q}\) data to describe long-distance effects in rare B decays dates back to Ref. [42]: in that case the approach was based on the hypothesis of naïve factorization for the matrix element of the four-quark operators (see also [44, 45]). As discussed in Refs. [43, 46, 48], the analytic structure of the amplitude is independent of the factorization hypothesis and this method can be indeed generalized, via appropriate dispersion relations, beyond naïve factorization. Using this method we will show that in a wide range of the dilepton-spectrum \(R_\pi \) can allow us to extract precise information on LFU-violating dynamics.

2 Theoretical description of \(B\rightarrow \pi \ell ^+\ell ^-\) decays

The starting point to describe \(B\rightarrow \pi \ell ^+\ell ^-\) decays is the \(b\rightarrow d \ell ^+\ell ^-\) effective Hamiltonian. Since we are interested in the comparison between \(B\rightarrow \pi \) and \(B\rightarrow K\) modes, we keep d as a generic label for down-type quarks whenever possible, and we generically denote the light final-state meson by P.

The effective Hamiltonian is

$$\begin{aligned}&\mathcal {H}(b\rightarrow d\ell ^+\ell ^-)= \frac{4G_F}{\sqrt{2}} \left\{ \lambda ^{(d)}_c(\mathcal {C}_1\mathcal {O}_1^c+\mathcal {C}_2\mathcal {O}_2^c) \right. \nonumber \\&\qquad \left. +\lambda ^{(d)}_u(\mathcal {C}_1\mathcal {O}_1^u+\mathcal {C}_2\mathcal {O}_2^u)-\lambda ^{(d)}_t\sum _{i=3}^{10}\mathcal {C}_i\mathcal {O}_i\right\} \,, \end{aligned}$$
(2)

where \(\lambda ^{(d)}_i=V^*_{id}V_{ib}\) and, due to CKM unitarity, \(\lambda ^{(d)}_t+\lambda ^{(d)}_u+\lambda ^{(d)}_c=0\). The leading FCNC operators are defined as

$$\begin{aligned} \begin{aligned} \mathcal {O}_{7}=&\ \frac{e}{(4\pi )^2}(\bar{d} \sigma ^{\mu \nu } (m_b P_{R} + m_{d} P_{L}) b)F_{\mu \nu } \,, \\ \mathcal {O}_{9}=&\ \frac{e^2}{(4\pi )^2}(\bar{d}_L\gamma _\mu b_L)(\bar{\ell }\gamma ^\mu \ell ) \,, \\ \mathcal {O}_{10}=&\ \frac{e^2}{(4\pi )^2}(\bar{d}_L\gamma _\mu b_L)(\bar{\ell }\gamma ^\mu \gamma _5\ell ) \, , \end{aligned} \end{aligned}$$
(3)

while the leading four-quark operators read (\(q=u,c\)):

$$\begin{aligned} \begin{aligned} \mathcal {O}_1^q=&\ (\bar{d}_L^{\alpha } \gamma _\mu q_L^\beta )(\bar{q}_L^\beta \gamma ^\mu b_L^\alpha ) \,, \\ \mathcal {O}_2^q=&\ (\bar{d}_L \gamma _\mu q_L)(\bar{q}_L\gamma ^\mu b_L) \, . \end{aligned} \end{aligned}$$
(4)

The NNLO expressions for the Wilson coefficients of the operators in Eqs. (3)–(4) can be found in Ref. [49]. For reference, with this normalization we use \(\mathcal {C}_{9}^\mathrm{SM} \approx 4.1\), \(\mathcal {C}_{10}^\mathrm{SM} \approx -4.3\) and \(\mathcal {C}_7^\text {SM}\approx -0.29\).

The hadronic matrix elements of quark bilinears in \(\bar{B}\rightarrow P\) decays, where P is a pseudoscalar meson containing a d quark, can be parametrized as

$$\begin{aligned} \begin{aligned} \langle P(k)|\bar{d} \gamma ^\mu b |\bar{B}(p)\rangle =&\left[ (p+k)^\mu -\frac{m_B^2-m_P^2}{q^2}q^\mu \right] f_+(q^2) \\&+\frac{m_B^2-m_P^2}{q^2}q^\mu \,f_0(q^2)\,, \\ \! \langle P(k)|\bar{d} \sigma ^{\mu \nu }q_\nu b |\bar{B}(p)\rangle =&\frac{i}{m_{B}+m_P}\Big [2 q^2 p^\mu \\&-(m_{B}^2-m_P^2+q^2)q^\mu \Big ]f_{T}(q^2)\,, \end{aligned} \end{aligned}$$
(5)

where \(q^\mu = p^\mu -k^\mu \) is the momentum transfer. The form factors \(f_+(q^2)\), \(f_0(q^2)\) and \(f_T(q^2)\) can be extracted from Lattice QCD and Light-Cone Sum Rules (LCSR) techniques. Concerning \(B\rightarrow \pi \) decays, we use the results of Ref. [50], which combines LCSR estimates with Lattice QCD calculations from Ref. [51].

Taking into account only the contribution of operators with non-vanishing tree-level matrix elements, the differential decay width for the semi-leptonic \(\bar{B}\rightarrow P\ell ^+\ell ^-\) decay reads:

$$\begin{aligned} \begin{aligned} \frac{d\Gamma }{dq^2}&= \frac{\alpha ^2 G_F^2 \left| \lambda _{t}^{(d)}\right| ^2}{1024 \pi ^5 m_B^3} \beta (q^2 )\lambda ^{1/2}(q^2)\, \\&\quad \times \left\{ \lambda (q^2)\left[ 1-\frac{\beta ^2(q^2)}{3}\right] \right. \\&\quad \times \left| \mathcal {C}_9^\ell f_+(q^2) +2 \frac{m_b+m_d}{m_B+m_P}\mathcal {C}^\ell _7f_T(q^2)\right| ^2 \\&\quad +\frac{2}{3}\lambda (q^2) \beta ^2(q^2) \, |\mathcal {C}^\ell _{10} f_+(q^2)|^2 \\&\quad \left. +4 m_\ell ^2 \frac{(m_B^2-m_P^2)^2}{q^2 }|\mathcal {C}_{10}^\ell f_0(q^2)|^2\right\} \,, \end{aligned} \end{aligned}$$
(6)

where

$$\begin{aligned} \begin{aligned} \lambda (q^2)&= (m_{B}^2-m_P^2-q^2)^2- 4 m_P^2 q^2\,, \\ \beta (q^2)&=\sqrt{1-4m_\ell ^2/q^2}\,. \end{aligned} \end{aligned}$$
(7)

The superscript \(\ell \) in the Wilson coefficients denotes the lepton flavor in the final state. We recall that, in the SM, the Wilson coefficients are universal for the three lepton generations, i.e. \(\mathcal {C}_i^{\tau }=\mathcal {C}_i^{\mu }=\mathcal {C}_i^{e} \), where the index i runs over all the possible Wilson coefficients.

2.1 Non-local contributions

In order to account for the contribution of the four-quark operators and describe the \(B\rightarrow P\ell ^+\ell ^-\) spectrum also in the resonance region, we modify the Wilson coefficient \(\mathcal {C}_9^\ell \) as follows [43, 48]:

$$\begin{aligned} \mathcal {C}_9^\ell \rightarrow \mathcal {C}_9^{\ell ,\text {eff}}(q^2)= & {} \mathcal {C}_9^\ell - \frac{\lambda ^{(d)}_c}{\lambda ^{(d)}_t} Y_{c\bar{c}}(q^2) - \frac{\lambda ^{(d)}_u}{\lambda ^{(d)}_t} Y_{u\bar{u}}(q^2) \nonumber \\&+Y_{d\bar{d}}(q^2)+Y_{s\bar{s}}(q^2)~, \nonumber \\&\equiv \mathcal {C}_9^\ell - \frac{\lambda ^{(d)}_c}{\lambda ^{(d)}_t} Y_{c\bar{c}}(q^2) + Y_\mathrm{light}(q^2)~. \end{aligned}$$
(8)

Here \(Y_{q\bar{q}}(q^2)\) denotes the non-local contribution due to intermediate hadronic states with \(q\bar{q}\) valence quarks. For later convenience, we have grouped the contribution induced by light quarks into a single function \(Y_\mathrm{light}(q^2)\). We express the Y functions as

$$\begin{aligned} \begin{aligned} Y_{q\bar{q}}(q^2)&= \frac{16\pi ^2}{f_+(q^2)}\mathcal {H}_{q\bar{q}}(q^{2})\,, \\ Y_\mathrm{light}(q^2)&= \frac{16\pi ^2}{f_+(q^2)}\mathcal {H}_\mathrm{light}(q^{2})~. \end{aligned} \end{aligned}$$
(9)

In principle, \(\mathcal {H}_{c\bar{c}}(q^{2})\) could be evaluated through the correlation function [43]

$$\begin{aligned} \begin{aligned}&i\int d^4x e^{iqx}\langle P(k)|T\left\{ j_\mu ^\text {em}(x), \sum _{i=1,2} \mathcal {C}_i^c\mathcal {O}_i^q(0) \right\} |\bar{B}(p)\rangle \\&\quad =\,[(p\cdot q)q_\mu -q^2 p_\mu ]\mathcal {H}_{c\bar{c}}(q^2)\,, \end{aligned} \end{aligned}$$
(10)

with \(j_\mu ^\text {em}= \sum _{q=u,d,s,c,b}Q_q \bar{q}\gamma _\mu q\). A similar expression can be derived for \(\mathcal {H}_\mathrm{light}(q^{2})\) in terms of the charmless operators. In practice, we are unable to evaluate these expressions from first principles and we estimate them from data using dispersion relations [43,44,45, 47, 48].

2.1.1 Estimate via dispersion relations

In full generality, we can write a subtracted dispersion relation for \(\mathcal {H}_{q\bar{q}}(q^{2})\)

$$\begin{aligned} \Delta Y_{q\bar{q}}(q^{2}) = \frac{16 \pi ^{2}}{f_+(q^2)} \Delta \mathcal {H}_{q \bar{q}}(q^{2})\, , \end{aligned}$$
(11)

with

$$\begin{aligned} \begin{aligned} \Delta \mathcal {H}_{q \bar{q}}(q^{2})&= \frac{q^2- q_{0}^2}{\pi } \int _{s_0}^\infty ds \frac{ \text {Im}[\mathcal {H}_{q\bar{q}}(s)]}{(s-q_{0}^{2})(s-q^2)} \\&\equiv \frac{q^2-q_{0}^2}{\pi }\int _{s_0}^\infty ds \frac{\rho _{q\bar{q}}(s)}{(s-q_{0}^{2})(s-q^2)}\,. \end{aligned} \end{aligned}$$
(12)

The function \(\rho _{q\bar{q}}(s)\) is the spectral density for an intermediate hadronic state with valence quarks \(q\bar{q}\) and invariant mass s, and the parameter \(s_0\) is the energy threshold where the state can be created on-shell. The parameter \(q_0^2\) is the subtraction point that we choose for the different \(q\bar{q}\) states (\(q_{0}^{2}<s_{0}\)).

The leading contribution to \(\rho _{q\bar{q}}(s)\) is provided by single-particle intermediate states. We can describe them as a sum of Breit–Wigner distibutions:

$$\begin{aligned} \begin{aligned} \Delta \mathcal {H}_{q\bar{q}}^{\text {1P} }&= \sum _{V_j}\eta _j e^{i\delta _j}\frac{(q^2-q_0^2)}{(m_j^2-q_0^2)} A_j^\text {res}(q^2)\,, \\ A_j^\text {res}(q^2)&= \frac{m_j \Gamma _j}{m_j^2-q^2-i m_j \Gamma _j} \,, \end{aligned} \end{aligned}$$
(13)

where the sum runs over all the possible vector states associated with the the \(q\bar{q}\) valence quarks. The parameters \(\eta _j\) and \(\delta _j\) have to be determined from data. For the charmonium resonances, which have a high invariant mass, we use dispersion relations subtracted at \(q^2=0\), yielding

$$\begin{aligned} \Delta \mathcal {H}^\mathrm {1P}_{c\bar{c}} = \sum _{V_j =J/\psi , \psi (2S),\ldots }\eta _j e^{i\delta _j} \frac{q^2}{m^2_{\psi _j}} A_j^\mathrm {res}(q^2)\,. \end{aligned}$$
(14)

For the light resonances we use unsubtracted dispersion relations, which is equivalent to assuming a vanishing long-distance contribution from light quarks in the large-\(q^2\) limit. In this case we do not separate the various flavors explicitly, obtaining

$$\begin{aligned} \mathcal {H}^\mathrm {1P}_\mathrm{light} = \sum _{V_j= \rho , \omega , \phi }\eta _j e^{i\delta _j} A_j^\mathrm {res}(q^2)\,. \end{aligned}$$
(15)
Table 1 Experimental inputs for the determination of resonance contributions in \(B\rightarrow \pi \ell ^+\ell ^-\), from Ref. [52]

An estimate of \(\eta _j\) can be obtained from the decay \(\bar{B}_q\rightarrow P V_j\rightarrow P\ell ^+\ell ^-\). Focussing on the \(P=\pi \) case, we can write

$$\begin{aligned} \begin{aligned}&\mathcal {B}(B^{+} \rightarrow \pi ^{+} V_{j}) \times \mathcal {B}(V_{j} \rightarrow \ell ^+\ell ^-) \\&\quad \approx \tau _{B^{+}} \frac{G_{F}^{2} \alpha ^{2} \left| V_{tb}V^{*}_{t d} \right| ^{2}}{1024 \pi ^{5} m_{B}^{3}} \int ^{(m_{B}-m_{\pi })^{2}}_{4 m_{\ell }^{2}} d q^{2}\\&\qquad \times \lambda ^{3/2}(m_{B}^{2}, m_{\pi }^{2},q^{2}) \beta (q^{2})\left[ 1 - \frac{{\beta ^2(q^{2})}}{3} \right] \\&\qquad \times (16 \pi ^{2})^{2} \left| A^{\mathrm {res}}_{j}(q^{2}) \right| ^{2} \eta _{j}^{2} \\&\qquad \times \left\{ \begin{array}{ll} \frac{q^{2}}{m_{j}^{2} } \left| \frac{\lambda ^{(d)}_c}{\lambda ^{(d)}_t} \right| ^{2} &{} [ V_j = J/\psi , \psi (2S) ]~,\\ 1 &{} [ V_j = \rho , \omega , \phi ]~, \end{array} \right. \end{aligned} \end{aligned}$$
(16)

where we have explicitly separated charmonia and light resonances. In the narrow-width approximation (NWA) we have

$$\begin{aligned} \left| A_{j}^{\mathrm {res}}(q^{2}) \right| ^{2} = \frac{m_{j}^{2} \Gamma _{j}^{2}}{(q^{2}-m_{j}^{2})^{2} +m_{j}^{2} \Gamma _{j}^{2}}\rightarrow m_{j} \Gamma _{j} \, \pi \, \delta (q^{2} - m_{j}^{2 })\,, \end{aligned}$$
(17)

which, setting also \(m_{\ell }=0\), allows us to further simplify (16) to

$$\begin{aligned} \begin{aligned}&\mathcal {B}(B^{+} \rightarrow \pi ^{+} V_{j}) \times \mathcal {B}(V_{j} \rightarrow \ell ^+ \ell ^-) \approx \frac{ \ \tau _{B^{+}} G_{F}^{2} \alpha ^{2} }{6 m_{B}^{3} } \\&\times \left| V_{tb}V^{*}_{t d} \right| ^{2} \lambda ^{3/2}(m_{B}^{2}, m_{\pi }^{2},m_{j}^{2})\, \eta _{j}^{2}\, m_{j} \Gamma _{j} \,. \end{aligned} \end{aligned}$$
(18)

Using this expression and the inputs in Table 1 we find the \(\eta _{j}\) values reported in the second column of Table 2. We checked explicitly that relaxing the NWA by considering a variable width for broader resonances, like the \(\rho \), does not affect our results significantly.

2.1.2 Constraints on the charmonium states from \(B\rightarrow K \ell ^+ \ell ^-\)

By definition, the Y functions in (9) are process-dependent. However, the \(Y_{c\bar{c}}(q^2)\) function for \(B\rightarrow \pi \) is expected to be very close to the one for \(B\rightarrow K\) decays, analyzed recently in Ref. [48]. Due to the different CKM structure, in the latter case \(Y_{c\bar{c}}(q^2)\) enters the decay rate via the combination

$$\begin{aligned} \left. \mathcal {C}_9^{\ell ,\text {eff}}(q^2) \right| _{b \rightarrow s} = \left. \mathcal {C}_9^\ell \right| _{b \rightarrow s} +Y_{c\bar{c}}(q^2)~, \end{aligned}$$
(19)

where we have used \(\lambda ^{(s)}_c\approx - \lambda ^{(s)}_t\).

The \(Y_{c\bar{c}}(q^2)\) functions for \(B\rightarrow \pi \ell ^+\ell ^-\) and \(B\rightarrow K\ell ^+\ell ^-\) decays are expected to coincide in the limit where we neglect SU(3)-breaking effects in the subleading spectator-quark contributions. This expectation is well supported by the comparison \(\mathcal {B}(B^{+} \rightarrow \pi ^{+} V_{j})\) vs. \(\mathcal {B}(B^{+} \rightarrow K^{+} V_{j})\), which exhibits a universal scaling,

$$\begin{aligned} \frac{ \mathcal {B}(B^{+} \rightarrow \pi ^{+} V_{j}) }{ \mathcal {B}(B^{+} \rightarrow K^{+} V_{j}) } \approx \left| \frac{ \lambda ^{(d)}_c}{ \lambda ^{(s)}_c} \right| ^{2} \approx 0.04\,, \end{aligned}$$
(20)

in the well-measured cases of the first two charmonium states. This allows us, within our \(B\rightarrow \pi \ell ^+ \ell ^- \) analysis, to use the magnitudes \(\eta _j\) and phases \(\delta _j\) extracted from \(B\rightarrow K\ell ^+ \ell ^- \) in Ref. [53] for all the charmonia, reducing the uncertainty significantly. Note that a similar argument cannot be made for the light resonances due to the different relative weight of the (different) light-quark operators in \(b\rightarrow s\) and \(b\rightarrow d\) transitions.

As far as the analysis of Ref. [53] is concerned, one might be worried that this analysis is performed allowing possible non-standard contributions to the Wilson coefficients \(C_9\) and \(C_{10}\), and that the best fit values thus extracted turn out to be far from the ones obtained from more reliable analyses of short-distance dominated observables. The latter outcome is a consequence of the (unwise) choice of using un-subtracted dispersion relations to describe the charm resonances adopted in Ref. [53]. As already pointed out in Ref. [48], extrapolating the tails of the charm resonances at low \(q^2\) (via the Breit-Wigner shape), i.e. far from the peak region where they are well determined from data, leads to an unreliable description of the amplitude at low \(q^2\). The low \(q^2\) region is the most relevant for the extraction of the short-distance component of the amplitude: this is why the values of \(C_9\) and \(C_{10}\) determined in Ref. [53] are not reliable. This problem is avoided using dispersion relations subtracted at \(q^2=0\) for the charm contributions, as we do in this paper. On the other hand, different choices of the subtraction point do not affect the extraction of the resonance parameters (\(\eta _j\) and \(\delta _j\)), whose values are completely determined by the peak region (\(\eta _j\)) and the near-by-peak region (\(\delta _j\)): these parameters are unaffected by the presence of non-standard contributions of short-distance origin (which necessarily leads to a smooth impact in the \(q^2\) spectrum). For this reason, we can safely use the values for \(\eta _j\) and \(\delta _j\) determined in Ref. [53], and consider them as the best estimates of the charm re-scattering in the SM, near the resonance region, which is what we need for our analysis.Footnote 1

The applicability of \(B\rightarrow K\) data from Ref. [53] is even crucial for the charmonia above the \(\psi (2S)\), where no \(B\rightarrow \pi V_j\) data is available. The \(B\rightarrow K V_j\) branching ratios reported in [53] have a slight dependence on the sign of the \(J/\psi \) and \(\psi (2S)\) phases. However, this is within uncertainties. For this reason we use a weighted average as reference value. Performing the CKM-rescaling illustrated above, we estimate

$$\begin{aligned} \begin{aligned}&\mathcal {B}(B\rightarrow \pi \psi _j) \times \mathcal {B}(\psi _j \rightarrow e^+e^-) \\&\qquad = \left\{ \begin{array}{ll} (8.16\pm 1.54)\times 10^{-11} &{} \psi _j = \psi (3770)\,, \\ (2.26\pm 0.75)\times 10^{-11} &{} \psi _j =\psi (4040)\,, \\ (1.43\pm 0.22)\times 10^{-10} &{} \psi _j =\psi (4160)\,, \\ (3.52\pm 1.15)\times 10^{-11} &{} \psi _j =\psi (4415)\,, \end{array} \right. \end{aligned} \end{aligned}$$
(21)

which yield the \(\eta _V\) reported in the last column of Table 2.

Table 2 Numerical results for the moduli of the \(\eta _V\) parameters controlling long-distance effects in \(B^+\rightarrow \pi ^+\ell ^+\ell ^-\). The uncertainties include only the experimental errors on the \(\mathcal {B}(B^+\rightarrow \pi ^+ V)\) and \(\mathcal {B}(V\rightarrow e^+e^-)\) values used to determine a given \(\eta _V\)

3 Numerical analysis

3.1 Dilepton spectrum

Fig. 1
figure 1

Dilepton spectrum in \(B^+\rightarrow \pi ^+ \mu ^+\mu ^-\) within the SM. The dotted-dashed line indicate the perturbative contribution (ignoring its parametric uncertainty). The gray error band represents the 68% interval after sampling over magnitudes and phases of the vector resonance contributions. The vertical dashed lines denote the three short-distance dominated regions where we provide precise estimates of \(R_\pi \) (see text for details)

Having discussed the general decomposition of the \(B^+\rightarrow \pi ^+ \ell ^+\ell ^-\) decay amplitude, we are ready to present numerical predictions within the SM and beyond. We begin with an analysis of the dilepton invariant mass spectrum in the SM to identify viable \(q^2\)-regions in which to perform precise LFU tests.

In Sect. 2 we discussed how to estimate the parameters \(\eta _V\), which control the magnitude of long-distance contributions, obtaining the results summarized in Table 2. We also pointed out that, in the cases of the charmonia, we can use the results for the strong phases determined from \(B^+\rightarrow K^+\mu ^+\mu ^-\) in Ref. [53]. However, we have no constraints on the strong phases for the light-quark resonances, which we treat as free parameters.

In Fig. 1 we show the differential branching fraction for the \(B^+\rightarrow \pi ^+\mu ^+\mu ^-\) decay obtained adopting the following procedure:

  • (i) sampling the magnitudes \(\eta _V\) using independent Gaussian distributions, with central values and standard deviations defined by the figures in Table 2;

  • (ii) randomly sampling the light-quark phases in the interval \([0,2\pi )\);

  • (iii) randomly choosing one of the four possible sets of solutions in Ref. [53] for \(J/\psi \) and \(\psi (2S)\) phases and, within that set, using a multi-dimensional Gaussian distribution according to the central values and errors.

The gray band represents the 68% confidence interval resulting from the sampling. The dashed-dotted black line indicates the contribution obtained using only the perturbative value of \(\mathcal {C}_9\). We checked explicitly that the size of the uncertainties depends mainly on the lack of information on the strong phases and on the form factors uncertainties, while the errors on the \(\eta _V\) have a subleading impact. We stress that the uncertainties related to the form factors cancel in the LFU ratio.

From the plot in Fig. 1 we identify three regions where the rate is dominated by perturbative contributions: (i) the low-\(q^2\) region, \(q^2 \in [1.1,8]\,\text {GeV}^2\); (ii) the region between the two narrow charmonium states, \(q^2 \in [11,12.5]\,\text {GeV}^2\), and (iii) the high-\(q^2\) region, \(q^2 > 15.5\,\text {GeV}^2\).

3.2 The LFU ratio

Fig. 2
figure 2

Predictions for the parameters \(b_V\) (left) and \(b_L\) (right) in small \(q^2\) bins in the low \(q^2\) region (see main text). Numerical results for the different bins and their correlation matrix are given in Appendix 1

We are now ready to analyze the LFU ratio \(R_\pi \), defined in Eq. (1), in the three regions identified above. To estimate the sensitivity to the class of new physics (NP) models we are interested in, we modify the perturbative values of \(\mathcal {C}^\ell _{9,10}\) as follows:

$$\begin{aligned} \begin{aligned} \mathcal {C}_{9,10}^e&= \left. \mathcal {C}_{9,10}^e \right| _\mathrm{SM}\,, \\ \mathcal {C}_{9,10}^\mu&= \left. \mathcal {C}_{9,10}^\mu \right| _\mathrm{SM} + \Delta \mathcal {C}_{9,10}\,. \end{aligned} \end{aligned}$$
(22)

This allows us to expand \(R_\pi \) in the (small) NP contributions \(\Delta \mathcal {C}_{9,10}\) as

$$\begin{aligned} \begin{aligned} R_\pi [q^2_\text {min}, q^2_\text {max}]\, =\,&R_\pi ^\mathrm{SM} + b_V \Delta \mathcal {C}_9 + (b_V - b_L) \Delta \mathcal {C}_{10} \\&+ \mathcal {O}\left( \Delta \mathcal {C}_9^2 ,\Delta \mathcal {C}_9 \Delta \mathcal {C}_{10} , \Delta \mathcal {C}_{10}^2 \right) \,, \end{aligned} \end{aligned}$$
(23)

where \(R_\pi ^\mathrm{SM}\) and \(b_{V,L}\) are adimensional numerical coefficients which, in general, depend on the \(q^2\) region of interest. The definition of \(b_{V,L}\) is such that \(b_{V}\) (\(b_L\)) controls the effect of a pure vectorial (left-handed) NP contribution in the lepton current.

Performing the same sampling discussed in Sect. 3.1, with non-vanishing \(\Delta \mathcal {C}_{9,10}\), we estimated the numerical values of \(R_\pi ^\mathrm{SM}\) and \(b_{V,L}\), as well as the size of the quadratic terms in Eq. (23). In all regions of interest we find

$$\begin{aligned} \left| R_\pi ^\mathrm{SM} - 1 \right| < 0.01~. \end{aligned}$$
(24)

This is not surprising since we have not included QED corrections and, in this limit, the only breaking of universality within the SM is due to tiny phase-space corrections. The result in Eq. (24) remains true also when QED effects are taken into account, provided \(R_\pi ^\mathrm{SM}\) is defined in a photon-inclusive way [6, 54].Footnote 2

As far as the NP coefficients are concerned, we first note that quadratic terms in \(\Delta \mathcal {C}_{9,10}\) lead to corrections of at most \(1\%\) in \(R_\pi \), i.e. around or below the level of QED corrections, if \(|\Delta \mathcal {C}_{9,10}| < 1.0\). This condition is what we expect in the most plausible NP scenarios. For instance, in models based on a minimally broken \(U(2)^5\) symmetry, \(b\rightarrow s\ell ^+\ell ^-\) data implies \(\Delta \mathcal {C}_{9} = - \Delta \mathcal {C}_{10} = -0.43 \pm 0.11\) [36].

Focusing on the linear terms only, and proceeding as in Sect. 3.1, we find the following results for the three regions of interest:

$$\begin{aligned} \begin{aligned} b_V=&\, 0.22\pm 0.01 \,, \qquad&q^2 \in&\, [1.1,8.0]\,\text {GeV}^2\,, \\ b_V=&\, 0.19\pm 0.03 \,, \qquad&q^2 \in&\, [11.0, 12.5]\,\text {GeV}^2\,, \\ b_V=&\,0.249\pm 0.004\,, \qquad&q^2 >&\, 15.5\,\text {GeV}^2\,, \end{aligned} \end{aligned}$$
(25)

and

$$\begin{aligned} \begin{aligned} b_L=&\, 0.49\pm 0.04 \,, \qquad&q^2 \in&\, [1.1,8.0]\,\text {GeV}^2\,, \\ b_L=&\, 0.47\pm 0.04 \,, \qquad&q^2 \in&\, [11.0, 12.5]\,\text {GeV}^2\,, \\ b_L=&\,0.50\pm 0.01 \,, \qquad&q^2 >&\, 15.5\,\text {GeV}^2\,. \end{aligned} \end{aligned}$$
(26)

In Fig. 2 we show the results of a closer inspection of the low-\(q^2\) region. We divide the region into finer \(q^2\) bins of width \(0.5\, \text {GeV}^2\), except for the first bin, which is defined as \([1.1,1.5]~\text {GeV}^2\), and compute the \(b_{V,L}\) for each bin. We find that the relative uncertainties span from \(\sim 1\) to \(\sim 10\%\), the largest value being for the bins closer to the resonances. The predictions are highly correlated, especially for neighboring bins, as is expected for a slowly-changing function. Central values, uncertainties and the full correlation matrices are reported in Appendix 1.

The figures in Eqs. (25) and (26) indicate that we can perform very precise tests of LFU in \(B\rightarrow \pi \ell ^+ \ell ^-\) decays, irrespective of the sizable long-distance contributions affecting these modes. For instance, in the class of NP models analyzed in Ref. [36], we predict

$$\begin{aligned} \left. R_\pi \right| ^\mathrm{NP}_{U(2)^5} = 0.79 \pm 0.05_\mathrm{NP} \pm 0.02_\mathrm{LD}\,, \end{aligned}$$
(27)

in the low-\(q^2\) region. Here the first error is due to the NP model (or better the NP Wilson coefficients extracted from \(b\rightarrow s\) data), while the second (subleading) error is due to long-distance effects.

Experimental prospects for the measurement of \(R_\pi \) in the various phases of the LHCb Upgrade II are listed in Ref. [55]. Considering only the \([1.1,6.0]~\text {GeV}^2\) bin, and using the prediction in Eq. (27) as a reference value, we should expect to observe a deviation from the SM (i.e. \(R_\pi \not =1\)) exceeding the \(\sim 3\sigma \) level only with \(300\, \text {fb}^{-1}\). However, as we have shown, the low-\(q^2\) bin can be extended up to \(8.0~\text {GeV}^2\) with a significant statistical gain. Moreover, also the high-\(q^2\) region is theoretically clean. Extending the measurement of \(R_\pi \) in both these directions, a similar level of significance could be reached already with \(50\, \text {fb}^{-1}\).

4 Conclusions

The evidence of LFU violation, accompanied by the other anomalies observed in semi-leptonic B decays, may represent the first hint of physics beyond the SM. While it is premature to draw conclusions, the pattern of anomalies is tantalizingly coherent and, when combined, consistently points towards new dynamics of common short-distance origin.

In order to understand the flavor structure of such new dynamics, it is very important to collect additional indications of LFU violation in other low-energy processes. As we have shown in this letter, one candidate is the \(R_\pi \) ratio defined in Eq. (1). By means of a general analysis of long-distance contributions in \(B\rightarrow \pi \ell ^+\ell ^-\) decays, we have shown that \(R_\pi \) can be predicted with high accuracy, both within and beyond the SM, in large regions of the dilepton invariant mass spectrum. The numerical coefficients reported in Sect. 3.2 allow for the extraction of precise short-distance information from this observable. The combination of future data on \(R_\pi \) and \(R_{K^{(*)}}\) would be an extremely valuable tool to determine the orientation of the new dynamics in quark-flavor space, possibly confirming the link between LFU anomalies and Yukawa hierarchies.