1 Introduction

It is reported by several experimental collaborations, e.g., by CMS [3], ATLAS [4], LHCb [5], BABAR [6,7,8,9] and Belle [10], that the branching ratios for the charged lepton flavour violating (CLFV) decays \(\tau ^{\pm } \rightarrow \ell ^{\pm } \ell '^{+} \ell '^{-}\), with \(\ell , \ell ' = e, \mu \), can be as large as \(10^{-14}\) in the Standard Model extended with either a Dirac or a Majorana mass term for neutrinos. This follows from a claim by Pham [2] that for these decays the GIM mechanism [11] produces a suppression of the form \(|\sum _i U_{\ell i} U^*_{Li} \log x_i|^2\), where \(x_i = m_{\nu i}^2/M_W^2\), \(i=1,2,3\), \(m_{\nu i}\) and \(M_W\) are the masses of the three neutrinos and the W boson, and U is the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) mixing matrix [12, 13]. The result in [2] is in sharp contrast with the first evaluation by Petcov [1], which showed that these CLFV decays are instead power suppressed by \(\vert \sum _i U_{\ell i} U^*_{L i} \, x_i \log x_i\vert ^2\), so that the smallness of the ratios \(m_{\nu i}/M_W\) crushes the branching fractions well below \(10^{-54}\), far beyond the sensitivity of any foreseeable experiment.

The calculations in [1] and [2] differ as follows. Reference [1] employed for the evaluation of the one-loop diagrams the zero-momentum-limit (ZML) approximation, which assumes vanishing momenta and masses of the external particles while it retains the dependence on the internal masses of neutrinos and the W boson. The ZML implicitly assumes the mass scale hierarchy

$$\begin{aligned} \mathrm{(ZML)} \quad {\mathcal {P}} \ll m_{\nu i} \ll M_W, \end{aligned}$$

where \({\mathcal {P}}\) generically stands for any of the external particle momenta and masses, e.g. \({\mathcal {P}} \sim m_L\) or \({\mathcal {P}} \sim m_\ell \). This approximation, even if far from the physical situation, allows a substantial simplification of the one-loop integrals, as in this way they depend only on \(x_i\). Ref. [2], on the contrary, argued that once the external momentum dependence is taken into account, the GIM cancellation in \(L \rightarrow \ell \ell '\ell '\), with \(L=\tau \) or \(\mu \), becomes actually much milder, with a suppression only of the form \(|\sum _i U_{\ell i} U^*_{Li} \log x_i|^2\), which leads to branching ratio values of the order of \(10^{-14}\).

If the prediction in [2] were true, it would imply, with current values for neutrino mixing angles and mass splittings [14], that the branching ratio of \(\mu \rightarrow eee\) could reach \(10^{-17}\), for a lightest-neutrino mass of the order of \(10^{-10}~\hbox {eV}\) or smaller. This would be just around the corner for the Mu3e experiment currently under development at the Paul Scherrer Institute in Switzerland, which aims to reach a sensitivity of Br\((\mu \rightarrow e e e) \sim 10^{-16}\) [15]. For the tau, the rates would be in the range \(10^{-16} - 10^{-13}\), still several orders of magnitude smaller than the current world averages, Br\((\tau \rightarrow \ell \ell ' \ell ') \lesssim 10^{-8}\) [16], and the expected sensitivity of Belle II, Br\((\tau \rightarrow \ell \ell ' \ell ') \lesssim 10^{-10}\) [17], and the HL-LHC, Br\((\tau \rightarrow \ell \ell ' \ell ') \lesssim 10^{-9}\) [18,19,20], but more than 40 orders of magnitude larger than the prediction of [1].

Therefore, the question we address in this letter is if the branching ratios of \(L\rightarrow \ell \ell '\ell '\) can really change so dramatically once one assumes the physical limit (PL), i.e. the hierarchy

$$\begin{aligned} (\mathrm {PL}) \quad m_{\nu i} \ll {\mathcal {P}} \ll M_W, \end{aligned}$$

instead of the ZML, and keeps full dependence of external momenta and masses in the loop diagrams.

Fig. 1
figure 1

One-loop diagrams contributing to the CLFV decay \(L^- \rightarrow \ell ^- \ell '^+\ell '^-\) in the unitary gauge: the Z penguins (a, b), the photon penguin (c) and the box (d). Wave function corrections (not depicted here) must be included as well

In [2] it is argued that in the Z-penguin diagram shown in Fig. 1a there are two propagators of nearly massless fermions, which give rise to a \(\log x_i\) when the momentum q of the Z boson approaches \(q^2 = 0\). This argument is supported by a computation of the Z-penguin as an expansion in \(q^2/M_W^2\) taking the form \(f(q^2,x_i) = f_0(x_i) + (q^2/M_W^2) f_1(x_i) + \cdots \). By noticing that \(f_0 \sim x_i \log x_i\) is suppressed—the term computed in [1]—while \(f_1 \sim \log x_i\) is not, Ref. [2] concludes that \((q^2/M_W^2) \log x_i \) must dominate the branching ratio in the \(x_i \rightarrow 0\) limit.

This conclusion is flawed. First of all, in [2] the loop integrals written in terms of Feynman parameters are computed via a simple Taylor expansion of the denominator appearing inside. Such approximation is not legitimate for arbitrary values of the Feynman parameters, so it does not lead to an expansion of the integral itself. A proper asymptotic expansion of a Feynman integral can be obtained, for instance, via the expansion-by-regions method [21, 22], in which one divides the whole integration domain into various regions and then performs different Taylor expansions in each region. Only the sum of all regions’ contributions eventually yields the desired asymptotic series. In addition to that, even if such \(q^2/M_W^2\) expansion were performed correctly, the calculation presented in [2] implicitly assumes also the hierarchy \(q^2 \ll m_{\nu i}^2\). Therefore, the series expansion \(f_0(x_i) + (q^2/M_W^2) f_1(x_i) +\cdots \) does not reproduce the correct \(x_i \rightarrow 0\) limit at fixed values of \(q^2\), since this limit lies beyond the validity range of \(q^2 \ll m_{\nu i}^2\).

Recently, Ref. [23] presented a calculation of \(L\rightarrow \ell \ell ' \ell '\) in the PL, in which the one-loop diagrams are numerically evaluated with full dependence on \({\mathcal {P}}\). At variance with [2], they found branching ratios compatible with those in the ZML or smaller. However, the authors of Ref. [23] neglect the contribution from \(\gamma \)-penguins (as in Fig. 1c) and therefore their results are gauge dependent. Indeed, in processes with flavour changing neutral currents the gauge cancels entirely only in the sum of boxes, Z- and \(\gamma \)-penguins [24, 25]. Even if Ref. [1] retained only the logarithmic enhanced term \(x_i \log x_i\) arising only from the Z penguins and the boxes, the omission of \(\gamma \)-penguins is not legitimate anymore as soon as one departs from this approximation. So we are still left uncertain whether the branching ratios in [23] are smaller as a consequence of calculating in the Feynman gauge or if there is a deeper physical meaning.

In this letter we present the decay widths of \(L \rightarrow \ell \ell ' \ell '\) in the PL, fully analytic in \(M_W\), \(m_\nu \) and external momenta and masses. We compute them by making a systematic asymptotic expansion in \({\mathcal {P}}/M_W\) and \(m_\nu /{\mathcal {P}}\) of all Feynman diagrams by means of the expansion by regions. We will show that the neutrino mass dependence \(|\sum _i x_i \log x_i|^2\) in the ZML is replaced in the PL by a much smaller enhancement \(|\sum _i x_i \log (m_L^2/M_W^2)|^2\). We will give an explanation of this exchange of mass scales in the logarithm by analysing the effective operators mediating the decay once the Z and the W bosons are integrated out.

Table 1 Branching ratio for the CLFV decays \(L \rightarrow \ell \ell '\ell '\) in the ZML and the PL for normal ordering (NO) and inverted ordering (IO) of neutrino masses. The ratio between the two is also reported. In the ZML we assume \(m_1 = 0\) (NO) or \(m_3=0\) (IO)

2 Details of the calculation

Let us consider the SM extended with neutrino masses of either a Dirac or Majorana nature. The flavour eigenstates of the left-handed neutrino fields \(\nu _{\ell L}\) entering in the weak interactions become linear combinations of the three mass eigenstates \(\nu _i\) with masses \(m_{\nu i}\):

$$\begin{aligned} \nu _{\ell L} = \sum _{i=1}^3 U_{\ell i} \nu _{i L}, \quad \ell = e,\mu ,\tau , \end{aligned}$$
(1)

where \(\nu _{i L}\) is the left-handed component of \(\nu _i\) and U is the PMNS matrix. The decay of a heavy lepton \(L=\mu ,\tau \) into three lighter charged leptons \(\ell ,\ell '=\mu ,e\),

$$\begin{aligned} L^{\pm } \rightarrow \ell ^{\pm } \ell '^+ \ell '^-, \end{aligned}$$
(2)

with masses \(m_L\), \(m_\ell \) and \(m_{\ell '}\), respectively, proceeds then via three classes of one-loop diagrams shown in Fig. 1: the boxes, the Z and \(\gamma \) penguins. We neglect diagrams with the exchange of a Higgs boson, as they are further suppressed by two extra powers of \(1/M_W^2\) due to the Yukawa interaction. In the case of \(L \rightarrow \ell \ell '\ell '\) there are additional box diagrams with the incoming L line connected to the outgoing \(\ell '\) line, and the outgoing \(\ell \) line to the incoming \(\ell '^{-}\) line. They can be neglected since they have two instances of CLFV and thus two of the GIM-breaking factors. The partial width given in [1] was obtained in the ZML. In this approximation, the one-loop integrals depend only on the ratio \(x_i\). To leading order in \(x_i\), the amplitudes of the three classes of diagrams are:

$$\begin{aligned} i{\mathcal {A}}_\gamma&=-\frac{\alpha G_F}{ \sqrt{2} \pi } (\bar{\ell } \gamma ^\mu P_L L)(\bar{\ell '} \gamma _{\mu } \ell ') \sum _iU_{\ell i} U^*_{L i} \,x_i , \end{aligned}$$
(3)
$$\begin{aligned} i{\mathcal {A}}_Z&=\frac{\alpha G_F}{\sqrt{2} \pi } \sum _iU_{\ell i} U^*_{L i} \, x_i (3 + \log x_i) \nonumber \\&\quad \times \Bigg [(\bar{\ell } \gamma ^\mu P_L L)(\bar{\ell '} \gamma _{\mu } \ell ') -\frac{1}{2\sin ^2 \theta _W}(\bar{\ell } \gamma ^\mu P_L L) (\bar{\ell '} \gamma _{ \mu } P_L\ell ')\Bigg ], \end{aligned}$$
(4)
$$\begin{aligned} i{\mathcal {A}}_\mathrm {Box}&=\frac{\alpha G_F}{ 2 \sqrt{2} \pi \sin ^2 \theta _W} (\bar{\ell } \gamma ^\mu P_L L)(\bar{\ell '} \gamma _{\mu } P_L \ell ') \nonumber \\&\quad \times \sum _iU_{\ell i} U^*_{L i} \,x_i (1+\log x_i), \end{aligned}$$
(5)

where \(G_F\) and \(\alpha \) are the Fermi and fine structure constants, respectively, and \(\sin ^2 \theta _W\) is the sine of the Weinberg angle. Retaining only the terms enhanced by \(\log x_i \sim 50\), which appear in the boxes and Z penguins, one obtains the prediction for the rate in the ZML [1]:

$$\begin{aligned} \frac{\Gamma (L \rightarrow \ell \ell \ell )}{\Gamma _0} =\frac{3 \alpha ^2}{16\pi ^2}\left| \sum _{i=2}^3U_{\ell i} U^*_{L i} \frac{\Delta m_{i1}^2}{M_W^2}\log \frac{\Delta m_{i1}^2}{M_W^2}\right| ^2 , \end{aligned}$$
(6)

and \(\Gamma (L \rightarrow \ell \ell ' \ell ', \ell \ne \ell ') = \frac{2}{3} \Gamma (L \rightarrow \ell \ell \ell ) \), where \(\Gamma _0 = G_F^2 m_L^5/(192 \pi ^3)\). Equation (6) is obtained by taking the limit \(m_1 \rightarrow 0\) and assumes normal neutrino mass hierarchy, i.e. \(m_1<m_2 < m_3\). For inverted mass hierarchy, the subscript ‘1’ must be substituted with ‘3’ and \(i=1,2\). Equation (6) also neglects subleading \(m_{\ell ,\ell '}/m_L\) corrections from phase space integration. The values of the branching ratios in the ZML with normal and inverted mass hierarchy are reported in Table 1. Current PDG values are employed for the lepton masses, neutrino mass splittings and neutrino mixing angles [14].

Let us now describe our calculation performed in the PL. We generated the complete set of diagrams in the Feynman gauge, and their relative counter-terms, using FeynArts [26] with a modified version of the SM file to account for Dirac neutrino masses and lepton flavour mixing. The amplitudes were reduced to one-loop tensor integrals using Form [27], via the FormCalc package [28], keeping the complete dependence on \(M_W\), \({\mathcal {P}}\) and \(m_{\nu i}\). The setup was independently checked by a second implementation based on FeynCalc [29].

Nowadays, lengthy expressions for the tensor integrals could be obtained in principle in an analytical form with full dependence on \(m_{\nu i}, M_W, m_L, m_{\ell ,\ell '}\) and the invariants \(s_{ij} =(p_i+p_j)^2\), with \(p_{1-3}\) the momenta of the three outgoing leptons, however their use is prohibitively cumbersome. It is therefore more helpful to compute them as series in the small parameters \({\mathcal {P}}^2/M_W^2\) and \(m_{\nu i}^2/{\mathcal {P}}^2\). To this end, we employed the method of expansion by regions (for an introduction see e.g. [22]). For all one-loop diagrams, we divided the integration domain into different regions and, for each region, we performed a Taylor expansion with respect to the parameters that are considered small there. Afterwards, by integrating every expanded integrand over the whole domain, and by summing the contributions from all the regions, we obtained the desired asymptotic expansion of the original one-loop diagram. The advantage of this method, compared for instance to an expansion of the full result after integration, is that the integrals arising in each region can be handled much more easily than the initial one, as typically they depend on just one or two mass scales.

We performed first an expansion assuming \({\mathcal {P}} \sim m_{\nu _i} \ll M_W\), without distinguishing at this point the two scales \(m_{\nu i}\) and \({\mathcal {P}}\). In a second step, the integrals arising from the first stage are further expanded in the limit \(m_{\nu i} \ll {\mathcal {P}}\). The total amplitude is then obtained by retaining from this expansion only the leading dependence on \(m_{\nu i}\), while higher order terms further suppressed by \({\mathcal {P}}^2/M_W^2\) or \(m_{\nu i }^2/{\mathcal {P}}^2\), or terms independent of \(m_{\nu i}\), are discarded. We performed several numerical checks at different stages of the calculation as a sanity check. To this end we took advantage of Mathematica’s arbitrary-precision numbers and Package-X’s analytic expressions of one-loop integrals [30], available in any kinematic configuration. We verified that our approximated expressions for the tensor integrals became increasingly accurate both by including higher order terms in the expansion as well as by taking the limit \(M_W \rightarrow \infty \) and \(m_{\nu i} \rightarrow 0\), at fixed values of \({\mathcal {P}}\).

3 Results

The partial widths are given by integrating the squared amplitude over the three-particle phase space of \(L\rightarrow \ell \ell '\ell '\). These massive phase space integrals depend on two variables \(s_{ij}\), plus two or three masses of the external particles. By employing the expansion by regions one more time, we computed the phase space integrals as series in \(m_{\ell ^{(')}}/m_L\), retaining only the leading terms in the final expressions for the rate. We obtain for normal neutrino mass hierarchy:

$$\begin{aligned}&\frac{\Gamma (L \rightarrow \ell \ell \ell )}{\Gamma _0} =\frac{3 \alpha ^2}{16 \pi ^2}\left| \sum _{i=2}^3 U_{\ell i} U_{L i}^* \frac{\Delta m_{i1}^2}{M_W^2} \right| ^2 \nonumber \\&\quad \times \Bigg [\log ^2 x_L+2 \log x_L -\frac{1}{6}\log x_\ell +\frac{19}{18} +\frac{17}{18}\pi ^2 \nonumber \\&\quad \qquad -\frac{1}{\sin ^2 \theta _W} \left( \log x_L + \frac{11}{12} \right) +\frac{3}{8 \sin ^4 \theta _W}\Bigg ], \end{aligned}$$
(7)

where \(x_L = m_L^2/M_W^2\) and \(x_\ell = m_\ell ^2/M_W^2\). For \(L \rightarrow \ell \ell '\ell '\) (\(\ell \ne \ell '\)) we have:

$$\begin{aligned}&\frac{\Gamma (L \rightarrow \ell \ell '\ell ')}{\Gamma _0} =\frac{3 \alpha ^2}{16 \pi ^2}\left| \sum _{i=2}^3 U_{\ell i} U_{L i}^* \frac{\Delta m_{i1}^2}{M_W^2} \right| ^2 \nonumber \\&\quad \times \Bigg [\frac{2}{3} \log ^2 x_L+\frac{25}{18} \log x_L -\frac{1}{6} \log x_{\ell '}+\frac{55}{108}+\frac{2}{3}\pi ^2 \nonumber \\&\quad \quad -\frac{1}{\sin ^2 \theta _W} \left( \frac{\log x_L}{2} + \frac{11}{24}+\frac{\pi ^2}{18}\right) \nonumber \\&\quad \qquad +\frac{1}{\sin ^4 \theta _W}\left( \frac{3}{16}+\frac{\pi ^2}{36} \right) \Bigg ]. \end{aligned}$$
(8)

Equations (7) and (8) depend only on the neutrino mass splittings \(\Delta m_{ij}^2\) and not on the value of the lightest neutrino’s mass. The expressions with inverted neutrino hierarchy are obtained similarly as for the ZML. Higher order terms not included in Eqs. (7) and (8) are suppressed by \(m_L^2/M_W^2\) or \(m_{\nu i}^2/m_L^2\). These corrections would arise by further expanding the squared amplitude. In addition, there are also subleading \(m_\ell /m_L\) terms from the phase space integration.

At variance with the results presented in [2], Eqs. (7) and (8) are power suppressed by \(\vert \sum _i U_{\ell i} U_{L i}^* x_i\vert ^2\) and yield values for the branching ratios of the order of \(10^{-55}\), see Table 1. Moreover, compared to Eq. (6) in the ZML, they do not have a logarithmic enhancement \(\log ^2 x_i \sim 2500\). On the contrary in its place we get only \(\log ^2 x_\mu \sim 176\) or \(\log ^2 x_\tau \sim 58\), which are of comparable size with respect to other terms appearing in Eqs. (7) and  (8). For this reason, the branching ratios in the PL turn out to be about one order of magnitude smaller than those in the ZML.

Note also that the presence of the singular terms \(\log x_{\ell }\) or \(\log x_{\ell '}\) is not in contradiction with the Kinoshita–Lee–Nauenberg theorem [31, 32] and the cancellation of mass singularities for inclusive observables. In fact Eqs. (7) and  (8) are valid strictly in the PL, i.e. when \(m_{\nu i} \ll m_{\ell ^{(')}}\). The limiting case of vanishing charged-lepton masses and non-zero neutrino masses violates the assumptions of our derivation and therefore is not a meaningful limit of our expressions.

In the course of this letter, we have treated the neutrinos’ masses as being Dirac in nature. This is appropriate towards the goal of evaluating the claim of [2]. The further evaluation of the case of Majorana masses requires additional particle content and implementation of a mechanism such as the well known seesaw mechanism [33,34,35,36]. As such, this evaluation depends on the New Physics model being examined and lies beyond the scope of this SM calculation.

Fig. 2
figure 2

Example of two diagrams mediating \(L \rightarrow \ell \ell ' \ell '\) in a low energy effective field theory description

Finally, we can understand the mechanism that converts the \( x_i \log x_i\) in the ZML into a \(x_i \log x_L\) in the PL by looking at the underlying effective theory arising after integrating out the Z and the W bosons. For simplicity, let us concentrate only on the operators associated with such logarithmic enhancement in the box 1d. Both in the ZML and in the PL, we can shrink the two W propagators to a point-like interaction and match the amplitude onto the the following dimension-six and dimension-eight operators:

$$\begin{aligned} O_6^{L\ell '}&=(\bar{\nu }_i \gamma ^\mu P_L L) \, (\bar{\ell }' \gamma _{\mu } P_ L \nu _j), \nonumber \\ O_6^{\ell \ell '}&=(\bar{\ell } \gamma ^\mu P_L \nu _i) \, (\bar{\nu }_j \gamma _{\mu } P_L \ell '), \nonumber \\ O_8&= m_{\nu i}^2 \, (\bar{\ell } \gamma ^\mu P_L L) \, (\bar{\ell '} \gamma _\mu P_L \ell '). \end{aligned}$$
(9)

The first two operators correspond to the usual Fermi interaction mediating \(\mu \) and \(\tau \) leptonic decays. They contribute to \(L \rightarrow \ell \ell '\ell '\) via the one-loop diagram in Fig. 2a. The third operator in (9) is necessary to renormalize the effective theory, i.e. to cancel the UV divergence from the diagram 2a. These operators’ Wilson coefficients are:

$$\begin{aligned}&C_6^{L\ell '} = \frac{4 G_F}{\sqrt{2}} U_{\ell ' j} U_{L i}^*, \qquad C_6^{\ell \ell '} = \frac{4 G_F}{\sqrt{2}} U_{\ell ' j}^* U_{\ell i},\nonumber \\&C_8 (\mu ) = \frac{G_F^2}{2 \pi ^2 } \log \left( \frac{M_W^2}{\mu ^2}\right) U_{\ell i} U_{L i}^*. \end{aligned}$$
(10)

We can imagine performing the matching between the SM and the effective theory at a scale \(\mu =M_W\), and evolving the coefficients to a lower scale via the renormalization group. The coefficient \(C_8\) explicitly depends on the renormalization scale \(\mu \) and this dependence reveals the difference between the PL and the ZML. In the ZML, the evolution of \(C_8\) can proceed down to a scale \(\mu \sim m_{\nu i}\), at which point we can integrate out the neutrinos and remove the operators \(O_6^{L\ell '}\) and \(O_6^{\ell \ell '}\) which contain the neutrino field. We are then left with an effective theory with only \(O_8\), whose Wilson coefficient is frozen at \(C_8(m_{\nu i})\), i.e. it contains a \(\log (M_W^2/m_{\nu i}^2)\) (compare with Eq. (5)). On the contrary, in the PL, \(C_8\) can run only until the scale \(\mu \sim m_L\) is reached. In this case, all operators in (9) are still active at the scale \(m_L\), however \(C_8\) produces only a milder \(\log (M^2_W/m_L^2)\) enhancement. Similar considerations can be applied as well to the Z penguin 1a. Therefore, the ZML overestimates the values for the branching ratios as it allows an unphysical evolution of these operators between \(M_W\) and \(m_{\nu i}\), while in reality the running stops at the physical intermediate scale \(m_L\) where the process happens.

4 Conclusions

Several experimental collaborations reported that the branching fractions of \(L\rightarrow \ell \ell '\ell '\) can be as large as \(10^{-14}\), following the observation in [2] that the GIM cancellation for these decays is not so severe and takes the form of \(|\sum _i U_{\ell i} U^*_{Li} \log x_i|^2\). In this letter we showed that this conclusion is wrong.

We calculated and presented for the first time the branching ratios in the PL by performing a series expansion of all one-loop diagrams in the small parameters \({\mathcal {P}}/M_W\) and \(m_{\nu i}/{\mathcal {P}}\). Our fully analytic expressions prove that the GIM suppression in these decays is power-like \(|\sum _i U_{\ell i} U^*_{Li} x_i |^2\), similar to that found previously in [1] for the ZML, so that the claim from [2] must be rejected. We predicted the branching ratios in the Standard Model including neutrino masses to be of the order of \(10^{-55}\), even smaller than those obtained in the ZML, as the latter prediction contains an unphysical logarithmic enhancement.

In the end, we remark that since the GIM suppression is solely governed by the underlying effective description of the process, i.e. the hierarchy of the internal mass scales and the convergence properties of Feynman integrals, external momentum effects could not have affected the GIM cancellation to such a large extent, as claimed in Ref. [2], compared to the finding in the ZML.