1 Introduction

Charged-current semileptonic decays of b hadrons are a precious source of information about flavor physics, both within and beyond the standard model (SM). They are the primary source of information on the elements \(|V_{cb}|\) and \(|V_{ub}|\) of the Cabibbo–Kobayashi–Maskawa (CKM) mixing matrix [13] and, at the same time, they offer the possibility of interesting tests of physics beyond the SM via appropriate Lepton Flavor Universality (LFU) ratios. In this paper we concentrate on the simplest of such LFU ratios, namely

$$\begin{aligned} R_P = \frac{\mathcal {B}(\bar{B}\rightarrow P\tau \bar{\nu })}{\mathcal {B}(\bar{B}\rightarrow P\mu \bar{\nu })}, \end{aligned}$$
(1)

where \(P=D,\pi \).

The theoretical estimate of \( R_P\) within the SM relies dominantly on the hadronic form factors \(f_+\) (the vector form factor) and \(f_0\) (the scalar form factor); see Appendix A for their definitions. For both final states, precise lattice QCD result of these form factors have recently been published [4, 5]. In addition, Light-Cone Sum Rule (LCSR) results for the \(B\rightarrow \pi \) vector form factor and two of its derivatives have been obtained, which complement the lattice QCD results. According to these studies the SM prediction for \(R_D\) [5] is

$$\begin{aligned} R_D^\text {SM} = 0.300 \pm 0.008. \end{aligned}$$
(2)

On the experimental side, measurements of the ratio \(R_D\) have been published by both BaBar [6] and, more recently, by Belle [7],

$$\begin{aligned} R_D^\text {BaBar}= & {} 0.440 \pm 0.058 \pm 0.042,\nonumber \\ R_D^\text {Belle}= & {} 0.375 \pm 0.064 \pm 0.026, \end{aligned}$$
(3)

for while only upper experimental bounds on \(R_\pi \) are available [8]. Combining Babar and Belle results and normalizing them to the SM lead to

$$\begin{aligned} \Delta R_{D} = \frac{R_D^\mathrm{exp}}{R_D^\text {SM}} - 1 = 0.35 \pm 0.17. \end{aligned}$$
(4)

This deviation from the SM is not particularly significant; however, a similar effect has been observed also in the \(R_{D^*}\) ratios [6, 7, 9]. Combining the two deviations, which are compatible with a universal enhancement of semileptonic \(b\rightarrow c\tau \nu \) transitions over \(b\rightarrow c\mu \nu \) ones, the discrepancy with respect to the SM raises to about \(\sim 4\sigma \). This fact has stimulated several studies on possible New Physics (NP) explanations (see e.g. Ref. [1013]). As pointed out in Ref. [12], because of \(\tau \rightarrow \ell \bar{\nu }\nu \) decays, a possible enhancement of semileptonic \(b\rightarrow c\tau \nu \) transitions may have a non-trivial impact in the extraction of \(|V_{cb}|\) from the corresponding \(b\rightarrow c\ell \nu \) modes, and this impact is likely to be different for exclusive and inclusive modes.

Our main goal is to analyze how leptonic \(\tau \rightarrow \mu \bar{\nu }\nu \) decays affect the determination of \(R_P\) and, more generally, the kinematical distribution of \(\bar{B}\rightarrow P \mu \bar{\nu }\) decays via the decay chain \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\) in experimental analyses where there is no precise information available on the missing mass (or the initial B momentum). As we will discuss, our results provide a first attempt toward new strategies to improve the determination of \(R_P\) from data and, possibly, also the determination of \(|V_{cb}|\) and \(|V_{ub}|\). At first glance, leptonic \(\tau \) decay modes might seem unimportant, since they occur at the expense of an additional power of the Fermi coupling \(G_F\) at the amplitude level. However, this process occurs on-shell and the suppression of the \(\tau \) decay amplitude is compensated by the inverse of the \(\tau \) lifetime appearing in the \(\tau \) propagator. This becomes already apparent in the \(\tau \rightarrow \mu \bar{\nu }_\mu \nu _\tau \) branching fraction: \(\mathcal {B}(\tau \rightarrow \mu \bar{\nu }_\mu \nu _\tau ) = (17.41 \pm 0.04)~\%\) [14]. It is therefore interesting to calculate the rate for the decay chain \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\) and compute numerically its impact on the observable rate of \(\bar{B}\rightarrow P\mu X_{\bar{\nu }}\), \(X_{\bar{\nu }} = \lbrace \bar{\nu }, {\bar{\nu }\nu \bar{\nu }}\rbrace \), to which we will henceforth refer as the “neutrino-inclusive” decay.

The layout of this article is as follows. We continue in Sect. 2 with definitions and the bulk of our analytical results. Numerical results and their implications are presented in Sect. 3, and we summarize in Sect. 4. The appendix A contains details as regards the form factors, details concerning the kinematic variables are given in Appendix B, and the numeric results of the \(3\nu \) PDFs are reported in Appendix C.

2 Setup

2.1 Kinematics

As anticipated in the introduction, in this article we assume that experiments cannot distinguish between the semileptonic decay \(\bar{B}\rightarrow P\mu \bar{\nu }\) and \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\) using the missing-mass information. This assumption certainly holds for analyses performed at hadron colliders (e.g., by the LHCb experiment).Footnote 1 On the other hand, it does not hold for analyses performed at \(e^+e^-\) colliders with flavor tagging based on the full reconstruction of the opposite B decay, where \(\bar{B}\rightarrow P\mu \bar{\nu }\) and \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\) will be clearly distinguished using the missing-mass information. The latter type of analyses will certainly provide precise results in the future; however, they cannot be performed at present and will require high statistics. It is therefore useful to discuss the case where there is no (or poor) missing-mass information.

We write for the neutrino-inclusive differential decay width to one muon

$$\begin{aligned} \frac{{\mathrm {d}^{}}\Gamma \,(\bar{B}\rightarrow P \mu X_{\bar{\nu }})}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\mu ]}\,}\equiv & {} \frac{{\mathrm {d}^{}}\Gamma \,(\bar{B}\rightarrow P \mu \bar{\nu }_\mu )}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\mu ]}\,}\nonumber \\&+ \frac{{\mathrm {d}^{}}\Gamma \,(\bar{B}\rightarrow P \tau (\rightarrow \mu \bar{\nu }_\mu \nu _\tau ) \bar{\nu }_\tau )}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\mu ]}\,}\nonumber \\\equiv & {} \frac{{\mathrm {d}^{}}\Gamma \,_1}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\mu ]}\,} + \frac{{\mathrm {d}^{}}\Gamma \,_3}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\mu ]}\,}. \end{aligned}$$
(5)

In the above, we introduce the shorthand \(\Gamma _n\) for the specific decay width with \(n=1\) or \(n=3\) neutrinos in the final state.Footnote 2 The kinematic variables are defined as follows.

  • We define \(q^\mu \) as the momentum transfer away from the \(\bar{B}\)P system, i.e.: \(q^\mu \equiv p^\mu - k^\mu \), where p and k are the momenta of the \(\bar{B}\) and \(P=D,\pi \) mesons, respectively. For \(\Gamma _1\) this implies that \(q^\mu \) coincides with the momentum of the lepton pair \(\mu \bar{\nu }_\mu \). We stress that this does not hold for \(\Gamma _3\).

  • We define the angle \(\vartheta _{[\mu ]}\) via

    $$\begin{aligned} \cos \vartheta _{[\mu ]}\equiv 2\frac{\left( q - 2q_{[\mu ]}\right) \cdot k}{\sqrt{\lambda }}. \end{aligned}$$
    (6)

    We abbreviate the Källén function \(\lambda \equiv \lambda (M_B^2, M_P^2, q^2)\) here and throughout this article. For \(\Gamma _1\), the above formula coincides with

    $$\begin{aligned} \cos \vartheta _{[\mu ]}= 2\frac{\left( q_{[\bar{\nu }_\mu ]} - q_{[\mu ]}\right) \cdot k}{\sqrt{\lambda }}, \end{aligned}$$
    (7)

    and the physical meaning of \(\vartheta _{[\mu ]}\) is the helicity angle of the muon in the \(\mu \bar{\nu }_\mu \) rest frame, with \(-1 \le \cos \vartheta _\mu \le +1\). We stress that for \(\Gamma _3\) this physical interpretation is no longer valid. Yet, we find it convenient to keep using \(\cos \vartheta _{[\mu ]}\) for the description of the neutrino-inclusive rate \(\Gamma (\bar{B}\rightarrow P \mu X_{\bar{\nu }})\). We emphasize also that the phase space boundaries for \(\cos \vartheta _{[\mu ]}\) in \(\Gamma _3\) differ from those in \(\Gamma _1\), and they implicitly depend on the full kinematics of the \(3\nu \) decays.

For the description of \(\Gamma _3\), we need to define further kinematic variables, which will be integrated over at a later point. We choose \(q_{[\tau ]}^2\), the invariant-mass square of the \(\tau \) decay products, \(q_{[\nu _\tau \bar{\nu }_{\mu }]}^2 \equiv (q_{[\nu _\tau ]} + q_{[\bar{\nu }_{\mu }]})^2\), the invariant-mass square of the two neutrinos produced in the \(\tau \) decay, and the five following angles:

  1. 1.

    \(\vartheta _{[\tau ]}\), the helicity angle of the \(\tau \) in the \(\tau \bar{\nu }_\tau \) rest frame:

    $$\begin{aligned} \cos \vartheta _{[\tau ]}= \frac{(q - 2 q_{[\tau ]})\cdot k}{\beta _\tau \sqrt{\lambda }} + \frac{(1 - 2\beta _\tau )}{\beta _\tau } \frac{(M_B^2 - M_P^2 - q^2)}{2 \sqrt{\lambda }}, \end{aligned}$$
    (8)

    where \(2 \beta _\tau \equiv 1 - q_{[\tau ]}^2 / q^2\),

  2. 2.

    \(\phi \), the azimuthal angle between the \(\mu \)\(\nu _\tau \bar{\nu }_\mu \) plane and the \(\bar{B}\)\(\tau \bar{\nu }_\tau \) plane,

    $$\begin{aligned} \varepsilon (p, q, q_{[\mu ]}, q_{[\nu _\tau \bar{\nu }_\mu ]})= & {} -\frac{1}{2} \beta _{\nu \bar{\nu }} \sqrt{1 - 2 \beta _\tau } \beta _\tau q^2 \nonumber \\&\sqrt{\lambda } \sin \phi \sin \vartheta ^{*}_{[\mu ]}\sin \vartheta _{[\tau ]}, \end{aligned}$$
    (9)
  3. 3.

    \(\vartheta ^{*}_{[\mu ]}\), the polar angle of the \(\mu \) momentum in the \(\tau \) rest frame with respect to \(q_{[\nu _\tau \bar{\nu }_\mu ]}\) in the \(\tau \) rest frame:

    $$\begin{aligned} \cos \vartheta ^{*}_{[\mu ]}= & {} \frac{1}{2\beta _{\nu \bar{\nu }} \beta _\tau } \bigg [(1 - 2\beta _{\nu \bar{\nu }}) (1 - \beta _\tau )\nonumber \\&+ \frac{(q_{[\mu ]}- q_{[\nu _\tau \bar{\nu }_\mu ]}) \cdot q}{q^2}\bigg ], \end{aligned}$$
    (10)

    where \(2 \beta _{\nu \bar{\nu }} \equiv 1 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2 / q_{[\tau ]}^2\),

  4. 4.

    \(\vartheta ^{**}_{[\bar{\nu }_\mu ]}\), the polar angle of the \(\bar{\nu }_\mu \) momentum in the \(\nu _\tau \bar{\nu }_\mu \) rest frame with respect to the \(\mu \) momentum in the \(\nu _\tau \bar{\nu }_\mu \) rest frame:

    $$\begin{aligned} \cos \vartheta ^{**}_{[\bar{\nu }_\mu ]}= \frac{(q_{[\nu _\tau \bar{\nu }_\mu ]}- 2 q_{[\bar{\nu }_\mu ]}) \cdot q_{[\mu ]}}{\beta _{\nu \bar{\nu }} q_{[\tau ]}^2}, \end{aligned}$$
    (11)
  5. 5.

    \(\phi ^{**}\), the azimuthal angle between the \(\tau \)\(\mu \) and \(\bar{\nu }_\mu \)\(\nu _\tau \) decay planes in the \(\tau \) rest frame,

    $$\begin{aligned} \varepsilon (q_{[\tau ]}, q_{[\bar{\nu }_\tau ]}, q_{[\mu ]}, q_{[\nu _\tau \bar{\nu }_\mu ]})= & {} \frac{1}{2} \beta _{\nu \bar{\nu }} \beta _\tau \sqrt{1 - 2 \beta _{\nu \bar{\nu }}} q^2 q_{[\tau ]}^2\nonumber \\&\sin \vartheta ^{*}_{[\mu ]}\sin \vartheta ^{**}_{[\bar{\nu }_\mu ]}\sin \phi ^{**}.\nonumber \\ \end{aligned}$$
    (12)

In general, we denote the solid angle in the \(\tau \bar{\nu }_\tau \) rest frame without any asterisks, the solid angle within the \(\tau \) rest frame with one asterisk, and the solid angle in the \(\bar{\nu }_\mu \nu _\tau \) rest frame with two asterisks.

With the above definitions of the kinematics in mind, we can now begin discussing phenomenological applications. We wish to first address the case, in which a \(3\nu \) event is misinterpreted as a 1-neutrino event. In such a case, the misreconstructed \(\cos \vartheta _{[\mu ]}\) reads

$$\begin{aligned} \cos \vartheta _{[\mu ]}\Big |_{3\nu }= & {} 2 \beta _{\nu \bar{\nu }} \Bigg \lbrace \left( \frac{(1 - 2\beta _{\nu \bar{\nu }})}{\beta _{\nu \bar{\nu }}} + 2 \beta _\tau \right) \frac{M_B^2 - M_P^2 - q^2}{2 \sqrt{\lambda }}\nonumber \\&+ \beta _\tau \cos \vartheta _{[\tau ]}\nonumber \\&- \left( 2\beta _\tau \frac{M_B^2 - M_P^2 - q^2}{2 \sqrt{\lambda }} - (1 {-} \beta _\tau ) \cos \vartheta _{[\tau ]}\right) \cos \vartheta ^{*}_{[\mu ]}\nonumber \\&- \sqrt{1 - 2 \beta _\tau } \sin \vartheta ^{*}_{[\mu ]}\sin \vartheta _{[\tau ]}\cos \phi \Bigg \rbrace . \end{aligned}$$
(13)

As an alternative to \(\cos \vartheta _{[\mu ]}\) we also consider \(E_\mu \), the muon energy in the B rest frame. It is defined in terms of Lorentz invariants as

$$\begin{aligned} E_\mu \equiv \frac{p \cdot q_{[\mu ]}}{M_B}. \end{aligned}$$
(14)

In the \(1\nu \) decay, \(E_\mu \) is not independent from our nominal choice of kinematic variables \(q^2\) and \(\cos \vartheta _\mu \). The expression for \(E_\mu \) reads

$$\begin{aligned} E_\mu \Big |_{1\nu } = \frac{1}{4 M_B}\left[ \left( M_B^2 - M_P^2 + q^2\right) - \sqrt{\lambda } \cos \vartheta _\mu \right] , \end{aligned}$$
(15)

and it attains its maximal value at \(q^2 = 0\) and \(\cos \vartheta _\mu = -1\). Its full range reads

$$\begin{aligned} m_\mu \le E_\mu \Big |_{1\nu } \le \frac{M_B^2 - M_P^2}{2 M_B}. \end{aligned}$$
(16)

However, for a misreconstructed \(3\nu \) event we obtain instead

$$\begin{aligned} E_\mu \Big |_{3\nu }= & {} \frac{\beta _{\nu \bar{\nu }}}{2 M_B}\Big [\left( M_B^2 - M_P^2 + q^2\right) \left( (1 - \beta _\tau ) + \beta _\tau \cos \vartheta ^{*}_{[\mu ]}\right) \nonumber \\&- \sqrt{\lambda }\left( \beta _\tau + (1 - \beta _\tau ) \cos \vartheta ^{*}_{[\mu ]}\right) \cos \vartheta _{[\tau ]}\nonumber \\&+ \sqrt{1 - 2 \beta _\tau } \sqrt{\lambda }\sin \vartheta ^{*}_{[\mu ]}\sin \vartheta _{[\tau ]}\cos \phi \Big ], \end{aligned}$$
(17)

which now exhibits an additional dependence on the kinematics variables \(\cos \vartheta ^{*}_{[\mu ]}\) and \(\phi \), as well as \(q_{[\nu _\tau \bar{\nu }_\mu ]}^2\). We find for its range

$$\begin{aligned} m_\mu \le E_\mu \Big |_{3\nu } \le \frac{M_B^2 - M_P^2 + m_\tau ^2 + \sqrt{\lambda \left( M_B^2, M_P^2, m_\tau ^2\right) }}{4 M_B}. \end{aligned}$$
(18)

2.2 Decay rate

In order to proceed, we require an analytic expression for the neutrino-inclusive differential decay rate. The result for \(\Gamma _1\) is known for some time in the literature (see e.g. [15, 16] for reviews in the presence of model-independent NP contributions). However, \(\Gamma _3\) has not been calculated to the best of our knowledge. We begin the computation with the matrix element for the \(\bar{B}(p) \rightarrow P(k) \tau (q_{[\tau ]}) \bar{\nu }(q_{[\bar{\nu }_\tau ]})\) transition:

$$\begin{aligned} i \mathcal {M}= & {} -i \frac{G_FV_{cb}}{\sqrt{2}} \left[ f_+(q^2) \left\{ (p + k)^\mu - \frac{M_B^2 - M_P^2}{q^2} q^\mu \right\} \right. \nonumber \\&\left. + f_0(q^2) \frac{M_B^2 - M_P^2}{q^2} q^\mu \right] L^{(V-A)}_\mu , \end{aligned}$$
(19)

with \(q \equiv p - k = q_{[\tau ]} + q_{[\bar{\nu }_\tau ]}\). In the above, we abbreviate the leptonic currents as

$$\begin{aligned} L_\mu ^{(V-A)}&\equiv \left[ \bar{u}(q_{[\tau ]}) \gamma _\mu (1 - \gamma _5) v(q_{[\bar{\nu }_\tau ]})\right] . \end{aligned}$$
(20)

The contributions to \(\Gamma _3\) then arise from the leptonic decay of the \(\tau \). The corresponding matrix elements can be readily obtained through the replacement

(21)

where \(m_\tau \) and \(\Gamma _\tau \) denote the mass and the total width of the \(\tau \) lepton, respectively.

The fully-differential rate for the 3-neutrino final state can then be expressed as:

$$\begin{aligned}&\frac{{\mathrm {d}^{7}}\Gamma _3\,}{{\mathrm {d}^{}}q^2\,{\mathrm {d}^{}}q_{[\nu _\tau \bar{\nu }_\mu ]}^2\,{\mathrm {d}^{2}}\Omega \, {\mathrm {d}^{}}\Omega ^*\, {\mathrm {d}^{2}}\Omega ^{**}\,}\nonumber \\&\quad = -\frac{3 G_F^2 |V_{cb}|^2 \sqrt{\lambda } (q^2 - m_\tau ^2) \left( m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \mathcal {B}(\tau \rightarrow \mu \bar{\nu }\nu )}{2^{17} \pi ^5 m_\tau ^8 M_B^3 q^2} \nonumber \\&\qquad \times \,\Big [ |f_+|^2 \left( T_1 - \frac{M_B^2 - M_D^2}{q^2} T_2 + \frac{(M_B^2 - M_D^2)^2}{q^4} T_3\right) \nonumber \\&\qquad +\, \,\mathrm{Re}\,{(f_+\,f_0)} \left( \frac{M_B^2 - M_D^2}{q^2} T_2 - 2 \frac{(M_B^2 - M_D^2)^2}{q^4} T_3\right) \nonumber \\&\qquad +\, |f_0|^2 \frac{(M_B^2 - M_D^2)^2}{q^4} T_3 \Big ], \end{aligned}$$
(22)

with auxiliary quantities

$$\begin{aligned} T_1\equiv & {} (p + k)^\mu (p + k)^\nu \sum _\text {spins} \tilde{L}^{(V-A)}_\mu \tilde{L}^{*,(V-A)}_\nu ,\nonumber \\ T_2\equiv & {} \left( (p + k)^\mu q^\nu + (p + k)^\nu q^\mu \right) \sum _\text {spins} \tilde{L}^{(V-A)}_\mu \tilde{L}^{*,(V-A)}_\nu ,\nonumber \\\\ T_3\equiv & {} q^\mu q^\nu \sum _\text {spins} \tilde{L}^{(V-A)}_\mu \tilde{L}^{*,(V-A)}_\nu .\nonumber \end{aligned}$$
(23)

In the above we abbreviate \({\mathrm {d}^{2}}\Omega \, = {\mathrm {d}^{}}\!\cos \vartheta _{[\tau ]}\, {\mathrm {d}^{}}\phi \,\), \({\mathrm {d}^{}}\Omega ^*\, = {\mathrm {d}^{}}\!\cos \vartheta ^{*}_{[\mu ]}\,\), and \({\mathrm {d}^{2}}\Omega ^{**}\, = {\mathrm {d}^{}}\!\cos \vartheta ^{**}_{[\bar{\nu }_\mu ]}\, {\mathrm {d}^{}}\phi ^{**}\,\), and we emphasize that the integration range over \({\mathrm {d}^{}}\cos \vartheta \,\) goes from \(-1\) to \(+1\). The full expressions for \(T_{1,2,3}\) are quite cumbersome to typeset. Instead, we opt to publish them as ancillary files within the arXiv preprint of this article. We also find that the integration of Eq. (22) over \(\Omega ^{**}\), \(\Omega ^*\), \(\phi \), and \(q_{[\nu _\tau \bar{\nu }_\mu ]}^2\) yields \(\mathcal {B}(\tau \rightarrow \mu \bar{\nu }_\mu \nu _\tau ) \times {\mathrm {d}^{2}}\Gamma (\bar{B}\rightarrow P\mu \bar{\nu })\,/{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\!\cos \vartheta _{[\tau ]}\,\) as required. This is a successful crosscheck of our calculation.

In order to carry out our phenomenological study of the quantities \(\cos \vartheta _{[\mu ]}\) in Eq. (13) and \(E_\mu \) in Eq. (17) in the decay chain \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\), we do not require any dependence on the \(\nu \bar{\nu }\) solid angle \(\Omega ^{**} = (\cos \vartheta ^{**}_{[\bar{\nu }_\mu ]}, \phi ^{**})\). We therefore integrate over the latter, and thus obtain the five-differential rate

$$\begin{aligned}&\frac{{\mathrm {d}^{5}}\Gamma _3\,}{{\mathrm {d}^{}}q^2\,{\mathrm {d}^{}}q_{[\nu _\tau \bar{\nu }_\mu ]}^2\,{\mathrm {d}^{2}}\Omega \, {\mathrm {d}^{}}\Omega ^*\,}\nonumber \\&\quad = \frac{\tilde{\Gamma }_3}{\pi m_\tau ^8 q^6} \left[ A + B \cos \vartheta _{[\tau ]}+ C \cos ^2\vartheta _{[\tau ]}\right. \nonumber \\&\qquad \left. +\, \left( D \sin \vartheta _{[\tau ]}+ E \sin \vartheta _{[\tau ]}\cos \vartheta _{[\tau ]}\right) \cos \phi \right] , \end{aligned}$$
(24)

with normalization

$$\begin{aligned} \tilde{\Gamma }_3 = \frac{|V_{cb}|^2G_F^2 \mathcal {B}(\tau \rightarrow \mu \nu \bar{\nu })}{2^{9} \pi ^3 M_B^3}. \end{aligned}$$
(25)

The angular coefficients in Eq. (24) read

$$\begin{aligned} A= & {} \left[ (q^2 - m_\tau ^2) \left( m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \right] ^2 \sqrt{\lambda } \Big [\left( m_\tau ^2 + 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \nonumber \\&\times \,\left( |f_0|^2 (M_B^2 - M_P^2)^2 m_\tau ^2 + |f_+|^2 q^2 \lambda \right) \nonumber \\&-\,\left( m_\tau ^2 - 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \left( |f_0|^2 \left( M_B^2 - M_P^2\right) ^2 m_\tau ^2 \right. \nonumber \\&\left. - |f_+|^2 q^2 \lambda \right) \cos \vartheta ^{*}_{[\mu ]}\Big ],\nonumber \\ B= & {} 2|f_0||f_+|m_\tau ^2\left( M_B^2 {-} M_P^2\right) \lambda \left[ (q^2 - m_\tau ^2) \left( m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \right] ^2\nonumber \\&\times \,\Big [(m_\tau ^2 + 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2)-(m_\tau ^2 - 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2) \cos \vartheta ^{*}_{[\mu ]}\Big ],\nonumber \\ C= & {} -|f_+|^2\lambda ^{3/2}\left[ \left( q^2 - m_\tau ^2\right) \left( m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \right] ^2\nonumber \\&\times \,\left[ \left( q^2 - m_\tau ^2\right) \left( m_\tau ^2 + 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \right. \nonumber \\&\left. + \left( q^2 + m_\tau ^2\right) \left( m_\tau ^2 - 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \cos \vartheta ^{*}_{[\mu ]}\right] ,\\ D= & {} 2 m_\tau \sqrt{q^2}|f_0||f_+|(M_B^2 - M_P^2)\left[ \left( q^2 \right. \right. \nonumber \\&\left. \left. - m_\tau ^2\right) \left( m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \right] ^2\left( m_\tau ^2 - 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2\right) \lambda \sin \vartheta ^{*}_{[\mu ]},\nonumber \\ E= & {} 2 m_\tau \sqrt{q^2}|f_+|^2[(q^2 - m_\tau ^2)\nonumber \\&(m_\tau ^2 - q_{[\nu _\tau \bar{\nu }_\mu ]}^2)]^2(m_\tau ^2 - 2 q_{[\nu _\tau \bar{\nu }_\mu ]}^2)\lambda ^{3/2}\sin \vartheta ^{*}_{[\mu ]}.\nonumber \end{aligned}$$
(26)

We can now proceed to the production of pseudo-events that are distributed as Eq. (24), which is a necessary prerequisite for our phenomenological applications in the following section.

3 Numerical results

Our numerical results are based on a Monte Carlo (MC) study of the decays \(\bar{B}\rightarrow P \mu \bar{\nu }\) and \(\bar{B}\rightarrow P \tau (\rightarrow \mu \nu \bar{\nu }) \bar{\nu }\). For this purpose, we added the signal PDFs for both decays to the EOS library of flavor observables [17]. The relevant form factors \(f_+\) and \(f_0\) are taken in the BCL parametrization [18]. The BCL parameters are fitted from a recent lattice QCD studies [4, 5], and additionally LCSR results in the case of \(\bar{B}\rightarrow \pi \) [19]; see Appendix A for details.

Fig. 1
figure 1

Histograms of \(4\times 10^5\) pseudo-events for the neutrino-inclusive decay \(B \rightarrow D \mu X_{\bar{\nu }}\) (a, b), as well as for the decay \(B \rightarrow D \tau (\rightarrow \mu \nu \bar{\nu })\bar{\nu }\) (c). We show histograms of distributions in the (misreconstructed) angle \(\cos \vartheta _{\mu }\) (a), and \(E_\mu \), the muon energy in the \(\bar{B}\) rest frame (b). The red areas correspond to the neutrino-inclusive decay, while the blue areas highlight the contributions stemming only from \(B\rightarrow D\tau (\rightarrow \mu \nu \bar{\nu })\bar{\nu }\). We also show the histogram of \(E_\mu \big |_{3\nu }\) and its compatibility with our ansatz Eq. (35) (c)

In order to obtain pseudo-events for the neutrino-inclusive decay, we carry out the following steps:

  1. 1.

    We draw \(4.8\times 10^6\) samples \(\lbrace \vec {X}^{(1)}_i\rbrace = \lbrace (q^2, \cos \vartheta _{[\mu ]})_i \rbrace \), which are distributed as their signal PDF \(P_1\),

    $$\begin{aligned} P_1(q^2, \cos \vartheta _{[\mu ]}) \equiv \frac{1}{\Gamma _1} \frac{{\mathrm {d}^{2}}\Gamma _1\,}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}\cos \vartheta _{[\mu ]}\,}. \end{aligned}$$
    (27)
  2. 2.

    We draw \(4.8\times 10^6\) samples \(\lbrace \vec {X}^{(3)}_i\rbrace = \lbrace (q^2, q_{[\nu _\tau \bar{\nu }_\mu ]}^2, \cos \vartheta _{[\tau ]}, \phi , \cos \vartheta ^{*}_{[\mu ]})_i \rbrace \), which are distributed as their signal PDF \(P_3\),

    $$\begin{aligned}&P_3(q^2, q_{[\nu _\tau \bar{\nu }_\mu ]}^2, \cos \vartheta _{[\tau ]}, \phi , \cos \vartheta ^{*}_{[\mu ]})\nonumber \\&\quad \equiv \frac{1}{\Gamma _3} \frac{{\mathrm {d}^{5}}\Gamma _3\,}{{\mathrm {d}^{}}q^2\, {\mathrm {d}^{}}q_{[\nu _\tau \bar{\nu }_\mu ]}^2\, {\mathrm {d}^{}}\cos \vartheta _{[\tau ]}\, {\mathrm {d}^{}}\phi \, {\mathrm {d}^{}}\cos \vartheta ^{*}_{[\mu ]}\,}. \end{aligned}$$
    (28)
  3. 3.

    We combine the two sets of samples with weights \(\omega _1 = \Gamma _1 / (\Gamma _1 + \Gamma _3)\) and \(\omega _3 = 1 - \omega _1\), respectively. The weights can be expressed in terms of \(R_P\) and \(\mathcal {B}(\tau \rightarrow \mu \nu \bar{\nu })\):

    $$\begin{aligned} \omega _1 = \frac{1}{1 + R_P \mathcal {B}(\tau \rightarrow \mu \nu \bar{\nu })}. \end{aligned}$$
    (29)

All samples are obtained from a Markov Chain Monte Carlo setup, which implements the Metropolis–Hastings algorithm [20, 21]. The first \(8\times 10^5\) samples per set are discarded, in order to minimize the impact from the Markov Chains’ starting values. In order to avoid correlations from rejection of proposals, we only take every tenth sample. The effective sample size is therefore \(4\times 10^5\). We provide the so-obtained pseudo-events online [22] in the binary HDF5 format.Footnote 3

3.1 \(\bar{B}\rightarrow D \mu X_{\bar{\nu }}\)

Distribution in \(\cos \vartheta _{[\mu ]}\) In the neutrino-inclusive decay, the misreconstructed observable \(\cos \vartheta _{[\mu ]}\) as given in Eq. (13) is no longer bounded by \(+1\). We find that it attains its maximal value

$$\begin{aligned} \max \cos \vartheta _{[\mu ]}\big |_{3\nu }&\simeq 56.7\quad \text {for} \quad q^2 = (M_B - M_D)^2,\quad q_{[\nu _\tau \bar{\nu }_\mu ]}^2\nonumber \\&= m_\tau ^2, \quad \cos \vartheta _{[\tau ]}= -\cos \vartheta ^{*}_{[\mu ]}= 1. \end{aligned}$$
(30)

The distribution of \(\cos \vartheta _{[\mu ]}\) in the neutrino-inclusive decay is shown in Fig. 1a, where we also disentangle the individual \(1\nu \) and \(3\nu \) contributions. We find that \(\cos \vartheta _{[\mu ]}\) exceeds 1 for \(\sim 23~\%\) of the \(3\nu \) events, and exceeds 2 for \(\sim 1.3~\%\) of \(3\nu \) events. As a consequence, we decide against a parametrization of the neutrino-inclusive PDF \(P(\cos \vartheta _\mu )\) in terms of Legendre polynomials (or any other orthonormal polynomial basis).

Fig. 2
figure 2

Histograms of \(4\times 10^5\) pseudo-events for the neutrino-inclusive decay \(B \rightarrow \pi \mu X_{\bar{\nu }}\) (a, b), as well as for the decay \(B \rightarrow \pi \tau (\rightarrow \mu \nu \bar{\nu })\bar{\nu }\) (c). We show histograms of distributions in the (misreconstructed) angle \(\cos \vartheta _{\mu }\) (a), and \(E_\mu \), the muon energy in the \(\bar{B}\) rest frame (b). The red areas correspond to the neutrino-inclusive decay, while the blue areas highlight the contributions stemming only from \(B\rightarrow \pi \tau (\rightarrow \mu \nu \bar{\nu })\bar{\nu }\). We also show the histogram of \(E_\mu \big |_{3\nu }\) and its compatibility with our ansatz Eq. (35) (c)

On the other hand, our findings imply that the \(\cos \vartheta _{[\mu ]}\) distribution can be used to extract the product \( R_D \mathcal {B}(\tau \rightarrow \mu \bar{\nu }\nu )\) from data. We can indeed write

$$\begin{aligned} R_D \mathcal {B}(\tau \rightarrow \mu \bar{\nu }\nu ) = \frac{\rho ^\mathrm{exp}_{D}}{\rho _D^0 - \rho ^\mathrm{exp}_{D}} \end{aligned}$$
(31)

where

(32)

Based on our MC pseudo-events, we find

$$\begin{aligned} \rho ^0_D = {0.234 \pm 0.001}\, \end{aligned}$$
(33)

where the error is dominantly statistical, and arises from our limited number of MC samples. We explicitly cross check our uncertainty estimate by re-running the simulations with modified inputs on the \(B\rightarrow D\) form factors. We find that shifting any single individual constraint in Table 1 by \(1\sigma \) yields results that are compatible with the interval given in Eq. (33).

The distribution in \(E_\mu \)  The distribution of \(E_\mu \) in the neutrino-inclusive decay is shown in Fig. 1b. We find that a lower cut \(E_\mu > 1.0\,\,\mathrm{GeV}\,\) can reduce the rate of misidentified \(3\nu \) events by a factor of \(\sim 4\), while \(\sim 76~\%\) of the \(1\nu \) events (the signal) remain. This corresponds to a reduction of the rate of background events in the neutrino-inclusive decay from its maximum value of \(R_D \mathcal {B}(\tau \rightarrow \mu \nu \bar{\nu }) \approx 5.2~\%\) down to \(1.3~\%\).

Alternatively, one can subtract the \(3\nu \) background from the neutrino-inclusive rate. For this purpose we proceed to obtain the relevant PDF of \(3\nu \) events. Since the ranges of \(E_\mu \Big |_{1\nu }\) and \(E_\mu \Big |_{3\nu }\) are very similar, we can remap their union to a new kinematic variable y,

$$\begin{aligned}&y \equiv \frac{2 E_\mu }{E_\mu ^\text {max}} - 1, \quad \text {with }E_\mu ^\text {max}\nonumber \\&\quad = \max \left( E_\mu \Big |_{1\nu },\,E_\mu \Big |_{3\nu }\right) \simeq 2.31\,\,\mathrm{GeV}\,,\nonumber \\&\qquad \text {so that } -1 \le y \le +1. \end{aligned}$$
(34)

We then make an ansatz for the PDF \(P_3(y) \equiv {\mathrm {d}^{}}\Gamma _3\,/{\mathrm {d}^{}}y\,\) by expanding in Legendre polynomials \(p_k(y)\):

$$\begin{aligned} P_3(y) = \frac{1}{2} + \sum _{k=1}^{12} c^{(3)}_k p_k(y). \end{aligned}$$
(35)

Since the Legendre polynomials form an orthogonal basis of function on the support \([-1, +1]\), the coefficients \(c^{(3)}_k\) are independent of the degree of \(P_3(y)\). Their mean values and covariance are obtained using the method of moments; see [23] for a recent review. We find that our ansatz Eq. (35) describes the PDF exceptionally well, and refer to Fig. 1c for the visualization. Our results for the mean values and covariance matrix of the moments are compiled in Table 3. They can be used in upcoming experimental studies in order to cross check the signal/background discrimination.

3.2 \(\bar{B}\rightarrow \pi \mu X_{\bar{\nu }}\)

Based on the \(\bar{B}\rightarrow \pi \) form factors parameters as described in Appendix A, we obtain

$$\begin{aligned} R_\pi ^\text {SM} = 0.70 \pm 0.01, \end{aligned}$$
(36)

which is in good visual agreement with the plot of \(R_\pi \) in figure 8 of Ref. [24]. This result implies a potentially larger impact of the \(3\nu \) decays as a background in the extraction of both \(R_\pi \) and \(|V_{ub}|\).

Distribution in \(\cos \vartheta _{[\mu ]}\) As in the case of \(\bar{B}\rightarrow D\mu X_{\bar{\nu }}\), the misreconstructed observable \(\cos \vartheta _{[\mu ]}\) is no longer bounded from above by \(+1\). However, we find that its maximal value is much smaller for \(\bar{B}\rightarrow \pi \) transitions than it is for \(\bar{B}\rightarrow D\) transitions:

$$\begin{aligned}&\max \cos \vartheta _{[\mu ]}\big |_{3\nu } \simeq 3.75\quad \text {for } \quad q^2 = (M_B - M_\pi )^2,\nonumber \\&\quad q_{[\nu _\tau \bar{\nu }_\mu ]}^2 = m_\tau ^2,\quad \cos \vartheta _{[\tau ]}= -\cos \vartheta ^{*}_{[\mu ]}= 1. \end{aligned}$$
(37)

A consequence of this smaller upper bound in \(\bar{B}\rightarrow \pi \) transitions, the tail of \(3\nu \) events is much lighter; see Fig. 2a. This is also reflected in our numerical result for the ratio \(\rho ^0_\pi \),

$$\begin{aligned} \rho ^0_{\pi } = {(2.89 \pm 0.03)} \times 10^{-2}. \end{aligned}$$
(38)

We can therefore not recommend to extract the ratio \(R_\pi \) through a lower cut on \(\cos \vartheta _{[\mu ]}\). Our result also shows that more than \(97~\%\) of \(3\nu \) events fall in the physical region of \(1\nu \) events.

Distribution in \(E_\mu \) We find that a lower cut \(E_\mu > 1.5\,\,\mathrm{GeV}\,\) can reduce the rate of of misidentified \(3\nu \) events by a factor of \({\sim }10\), while \({\sim }69~\%\) of the \(1\nu \) events (the signal) remain. This corresponds to a reduction of the rate of background events in the neutrino-inclusive decay from its maximum value of \(R_\pi \mathcal {B}(\tau \rightarrow \mu \nu \bar{\nu }) \simeq 12.1~\%\) down to \({\sim }1.2~\%\).

For the range of \(E_\mu \) we find

$$\begin{aligned} \max \left( E_\mu \Big |_{1\nu }, E_\mu \Big |_{3\nu }\right) \simeq 2.64\,\,\mathrm{GeV}\,, \end{aligned}$$
(39)

and the energy ranges are overlapping given our numerical precision. Thus, the description of the neutrino-inclusive rate though \(E_\mu \), or equivalently y, should work even better for \(\bar{B}\rightarrow \pi \) transitions than for \(\bar{B}\rightarrow D\) transitions. Our results for the mean values and covariance matrix of the Legendre moments \(c_k^{(3)}\) are compiled in Table 4. We refer to Fig. 2c for a comparison of \(P_3(y)\) with our MC pseudo-events.

3.3 Implications for the extraction of \(|V_{cb}|\) and \(|V_{ub}|\)

Using the above results we can finally draw some semi-quantitative conclusions about the error in the extraction \(|V_{cb}|\) and \(|V_{ub}|\) from \(b\rightarrow c(u) \ell \nu \) decays. The presence of the \(\tau \rightarrow \mu \bar{\nu }\nu \) background in those processes can be dealt with, experimentally, in different ways. The two extreme cases we can envisage are the following: (i) reduction of the background via explicit cuts; (ii) fully inclusive subtraction. The first method can be applied to exclusive decays such as those discussed in the present paper. As shown above, combining cuts in \(E_\mu \) and \(\cos \vartheta _{[\mu ]}\) leads to a significant reduction of the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination in \(\bar{B}\rightarrow D\mu X_{\bar{\nu }}\), with negligible implications for the extraction of \(|V_{cb}|\). However, this procedure cannot be applied to fully inclusive modes. In the latter case, the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination is more likely to be simply subtracted from the total number of events. If this subtraction is made assuming the SM expectation of \(R_D\) (and \(R_{D*}\)), it leads to systematic error if \(\Delta R_{D} \not =0\), i.e. in the presence of New Physics [12]. The maximal value of this error is

$$\begin{aligned} \frac{ \Delta |V_{cb}|^{(\mathrm incl.)} }{ |V_{cb}| } = \frac{1}{2} \Delta R_{D} \mathcal {B}(\tau \rightarrow \mu \bar{\nu }\nu ) \approx 0.9~ \%, \end{aligned}$$
(40)

which is not far from the combined theory and experimental error presently quoted for \(|V_{cb}|\) [14]. We thus conclude that the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination must be carefully analyzed in the determination of \(|V_{cb}|\).

The impact of the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination is more difficult to be estimated in the \(|V_{ub}|\) case. On the one hand, the large value of \(R_\pi \) leads to a potentially larger impact. On the other hand, even in inclusive analyses some cut on \(E_\mu \) is unavoidable in order to reduce the \(b\rightarrow c \ell \nu \) background: as shown above, this naturally leads to a significant reduction of the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination. Given the present large experimental errors, the \(\tau \rightarrow \mu \bar{\nu }\nu \) contamination is likely to be a subleading correction in the extraction of \(|V_{ub}|\), but it is certainly an effect that has to be properly analyzed in view of future high-statistics data.

4 Summary

Lepton Flavor Universality tests in charged-current semileptonic B decays provide a very interesting window on possible physics beyond the SM. In the paper we have analyzed how the leptonic \(\tau \rightarrow \mu \bar{\nu }\nu \) decays affect the determination of the LFU ratios \(R_P\), where \(P=D,\pi \). In particular, we have presented a complete analytical determination of the observable distributions (energy spectrum and helicity angle of the muon) of the \(\bar{B}\rightarrow P\tau (\rightarrow \mu \bar{\nu }\nu )\bar{\nu }\) decay chain. This result has allowed us to identify clean strategies both to extract \(R_P\) from measurements of the \(\bar{B}\rightarrow P\mu X_{\bar{\nu }}\) neutrino-inclusive rate, and also to minimize the impact of the \(\tau \rightarrow \mu \bar{\nu }\nu \) decay in the three-body \(\bar{B}\rightarrow P \mu \nu \) modes. Finally, this study has also allowed us to conclude that the \(b \rightarrow c \tau (\rightarrow \ell \bar{\nu }\nu ) \nu \) background in \(b\rightarrow c \ell \nu \) decays represents a non-negligible source of uncertainty for the extraction of \(|V_{cb}|\) in the presence of NP modifying \(R_D\): its impact could reach the \(\sim 1~\%\) level and has to be analyzed with care mode by mode.