1 Introduction

Lepton flavor violating (LFV) processes are forbidden in the standard model (SM) [1,2,3] with massless neutrinos. However, the experimental evidence of neutrino oscillations [4,5,6] claims for an extended model with neutrino mass terms. For massive neutrinos, the mass matrix will be nondiagonal in the interaction (weak) basis, as occurs in the quark sector [7, 8], and the mixing of three light neutrinos could be described through the \(3\times 3\) unitary Pontecorvo–Maki–Nakagawa–Sakata (PMNS) matrix [9,10,11]. In such a scenario, charged LFV transitions could arise, for instance, from one loop diagrams involving a couple of \(W\ell \nu _\ell \) vertices with different flavor neutrinos each. However, it turns out natural having a strong suppression for this class of processes owing to a GIM-like mechanism [12], just as it has been reported for the \(\mu ^-\rightarrow e^- \gamma \) decay, with a prediction at an unobservably low rate: \(BR(\mu ^-\rightarrow e^-\gamma )\sim \mathscr {O}(10^{-55})\) [13,14,15], which is far away from the capacity of any current or foreseen experimental facility.

By way of contrast, the prediction for the \(\tau ^-\rightarrow \mu ^- \ell ^+ \ell ^-\) (\(\ell =\mu , e\)) decays given by Ref. [17] claims that the GIM cancellation for these processes is much milder and a value of \(BR(\tau ^- \rightarrow \mu ^- \ell ^+\ell ^-)\ge 10^{-14}\) is reported. An updated evaluation using the amplitude derived in Ref. [17], employing the latest global fit results for neutrino mixing [18,19,20,21] yields a branching fraction \(\sim 4\times 10^{-16}\) for the three muon channel. Both values are still far away from the PDG upper bounds, \(1.5\,\times \,10^{-8}\) (for \(\ell =e\)) and \(2.1\,\times \,10^{-8}\) (\(\ell =\mu \)) at \(90\%\) confidence level.Footnote 1 Similarly, we verified that using the values reported in Refs. [18,19,20,21] for the neutrino mixing parameters, Pham’s result [17] would predict a \(\mu ^-\rightarrow e^-e^+e^-\) branching ratio of \(\sim 2\,\times \,10^{-21}\), larger than Petcov’s prediction [15] (\(\sim 10^{-53}\) evaluated with updated neutrino masses and mixing parameters, see Table 1) by at least some thirty orders of magnitude. Again, the current upper limit on this decay channel (\(1\,\times \,10^{-12}\) at \(90\%\) C.L. [18]) is still far from testing Pham’s result [17]. This author claims that this unexpectedly large estimation is due to the presence of a logarithmic divergent term depending on the neutrino mass, which comes from a one-loop diagram that involves two neutrino propagators (diag. (d) in our Fig. 1).

Certainly, considering the effects or processes that arise from quantum corrections could involve divergent loop integrals. However, in any renormalizable theory, the possible divergences must vanish order by order (in the loop or effective field theory expansion) to be able to define (finite) observables. In fact, in a Quantum Field Theory the divergences can be classified into two types: ultraviolet (UV) and infrared divergences (IR). The former (UV) appear in the high-energy regime and they can be healed redefining the theory parameters, whereas the latter (IR) occur in the low-energy regime and can be classified in soft and collinear divergences, which cancel however in properly defined (IR-safe) observables [25, 26]. We show that the seeming logarithmic divergent behavior of the LFV amplitude reported in Ref. [17] is not present, as the vanishing momentum transfer approximation considered in that paper lies outside the physical region. Consequently, the rates of \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays in the SM extended with massive neutrinos are extremely suppressed, in agreement with Ref. [15]. It is worth noting that, as neutrino oscillations, the LFV amplitudes must vanish in the limit of degenerate neutrinos. Moreover, according to the KLN theorem [25, 26], the amplitude for massless neutrinos can go to zero, but it is impossible that it presents an IR divergence. This requirement is satisfied by the result of Ref. [15], but it is not the case in Ref. [17] which behaves as \(\sum _j U_{Lj}U_{\ell j}^*\log (m_\nu /m_W)\) for very small neutrino masses. Our result, as it will be shown below, satisfies the expected agreement with the previous consistency remarks.

In Sects. 2 and 3 we discuss in detail our computation of these processes and compare it to those in Refs. [15] and [17], showing explicitly why the approximation in the latter is unreliable, and reproducing the results of [15] when masses and momenta of the external particles are neglected at the amplitude level. In Sect. 4 we analyze the (quite good) numerical accuracy of this latter approximation. Finally, we state our conclusions in Sect. 5. Several appendices complete technical details of our calculation.

Fig. 1
figure 1

Feynman diagrams for the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays in the presence of lepton mixing (i.e., non-vanishing neutrino masses). In ‘renormalizable’ \(R_\xi \) gauges, similar diagrams need to be added, which are obtained replacing the W gauge bosons by the respective would-be Goldstone bosons. Notice that diagram d only involves the Z gauge boson, whereas the a, b and c diagrams can also be mediated by the photon. Additionally, when \(\ell =\ell ^{\prime }\) similar contributions (exchanging \(p \leftrightarrow p_1\)) to the amplitudes of diagrams ae must be subtracted in order to antisymmetrize the amplitude. On the other hand, when \(\ell \ne \ell ^\prime \), since the vertices of the neutral bosons \(\gamma \) and Z with a pair of fermions are flavor-conserving, only a similar e box diagram must be added interchanging \(\ell (p)\leftrightarrow \ell ^{\prime }(p_1)\)

2 Z-Penguin contribution emission from internal neutrino line

The \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays can be induced through the diagrams depicted in Fig. 1. Since the main purpose of this work is to falsify the existence of the logarithmic divergent term claimed in Ref. [17], we first concentrate on the amplitude of the diagram (d). We have, however, verified the corresponding expressions for the loop integrals in Ref. [15] for the particular process \(\mu ^-\rightarrow e^- e^-e^+\), when masses and momenta of external leptons are neglected in the computations. Particularly, in Ref. [15] it is shown that the corresponding branching ratio is completely dominated by those diagrams with two neutrino propagators, i.e. (d) and (e) in Fig. 1, which contribute comparably.

In our analysis, we keep employing the convention used by Ref. [17], in order to denote the masses and momenta (see Fig. 1) of the external leptons, that is M and P (m and p) stand for the mass and momentum of the \(L^-\) (\(\ell ^-\)) lepton, respectively. In this way, the amplitude of the diagram (d) can be written as

$$\begin{aligned} \mathscr {M}_{d}\sim & {} \frac{i}{m_Z^2}l_{L\ell }^{\lambda }\times l_{\ell '\ell '\lambda }, \end{aligned}$$
(1)

where \(l_{\ell '\ell '\lambda }=-ig/(2c_W)\bar{u}_{p_{1}}\gamma _{\lambda }(g_v^{\ell '}-g_a^{\ell '} \gamma _5) v_{p_2}\)Footnote 2 is independent of the loop integration, whereas the effective \(ZL\ell \) transition is given as follows:

$$\begin{aligned} l_{L\ell }^{\lambda }=\left( \frac{-ig}{4c_W} \right) \left( \frac{-ig}{2\sqrt{2}} \right) ^2\sum _{j=1}^3 U^*_{\ell j}U_{Lj}\bar{u}_p\varGamma ^\lambda _{j} u_P, \end{aligned}$$
(2)

where \(U_{im}\) are entries of the PMNS mixing matrix. In the Feynman-’t Hooft gauge, we have

$$\begin{aligned} \varGamma ^\lambda _{j}= & {} \int \frac{d^4 k}{(2\pi )^4}\frac{N^{\lambda }}{\left[ (p+k)^2-m_j^2 \right] \left[ (P+k)^2-m_j^2 \right] \left[ k^2-m_W^2 \right] }, \end{aligned}$$
(3)

with

(4)

After making the loop integration using dimensional regularization in order to deal with the (logarithmic) UV divergences, the Lorentz structure for the \(\varGamma ^\lambda _{j}\) factor can be written as follows,

$$\begin{aligned} \varGamma ^\lambda _{j}= & {} F_a\gamma ^{\lambda }(1-\gamma _5)+F_b \gamma ^{\lambda }(1+\gamma _5)\nonumber \\&+F_c(P+p)^{\lambda }(1+\gamma _5)+F_d(P+p)^{\lambda }(1-\gamma _5)\nonumber \\&+F_e q^{\lambda }(1+\gamma _5)+ F_f q^{\lambda }(1-\gamma _5), \end{aligned}$$
(5)

where in general \(F_k=F_k(q^2,m_j^2)\) \((k=a,\,b...,\,f)\) are functions given in terms of the momentum transfer \(q^2=(P-p)^2\), and the neutrino mass squared (of course \(F_k\) functions will also depend on the mass of the W gauge boson and external masses, but these have well-defined values).

At this point, it is worth to note that in the approximation where the momenta of the external particles are neglected in Eq. (3), such as it is done in Ref. [15] for the \(\mu \rightarrow 3e\) decay, the computation is simplified considerably, as the only possible contribution is given by the \(F^0_a\) function, where we are using a superscript 0 in order to distinguish this approximation. In this simple case, the \(F^0_{a}\) function will not depend on \(q^2\) and is given in terms of the Feynman parameters as follows

$$\begin{aligned} F^{0}_{a}\left( m_j^2\right) =\frac{1}{2\pi ^2}\int _0^1dx\int _0^{1-x}\left[ 2 + \log \left( D^0_j/\mu ^2\right) \right] dy, \end{aligned}$$
(6)

where \(D^0_j(m_j^2)=(1-x) m_j^2+x m_W^2\). Whereas in terms of PaVe functions it is given by

$$\begin{aligned} F^0_{a}\left( m_j^2\right)= & {} -\frac{1}{8\pi ^2 \left( m_j^2-m_W^2\right) {}^2}\left[ 2 m_j^2 \left( m_j^2-2 m_W^2\right) \right. \nonumber \\&\times B_0\left( 0,m_j^2,m_j^2\right) + 2 m_W^4B_0\left( 0,m_W^2,m_W^2\right) \nonumber \\&+\left. 4 m_j^2 m_W^2-3 m_j^4-m_W^4 \right] . \end{aligned}$$
(7)

Now, one analytical expression for the \(F_a^0\) function can be obtained in a straightforward way either integrating over the Feynman parameters in Eq. (6) or using the definition of the \(B_0(0,m^2,m^2)\) scalar function in Eq. (7). In such a way that after making an expansion around \(m_j^2=0\) we obtained

$$\begin{aligned} F^0_a= & {} \frac{1}{2\pi ^2}\left[ \frac{m_j^2}{m_W^2}\log \left( \frac{m_W^2}{m_j^2} \right) -\frac{m_j^2}{2m_W^2}+\frac{1}{2}\log \left( \frac{m_W^2}{\mu ^2} \right) \right. \nonumber \\&+\left. \frac{1}{4}+\vartheta \left( \frac{m_j^2}{m_W^2}\right) ^2 \right] . \end{aligned}$$
(8)

From Eq. (8) it becomes clear that, in this approximation, the amplitude will be proportional to the neutrino mass squared, where the dominant contribution, due to the big gap between the neutrino and W boson mass scales, comes from the first term as it involves a relative factor \(\log \left( \frac{m_W^2}{m_j^2} \right) \) compared to the second one,Footnote 3 whereas the terms independent of neutrino masses will vanish by the GIM-like mechanism.

Therefore, the structure of the matrix element for the contribution of the diagram (d) in Fig. 1 in the approximation where masses and momenta of the external particles are neglected is given by

$$\begin{aligned} \mathscr {M}^0_{d}= & {} -i\frac{G_F^2 m_W^2\beta _{F^0_a}}{4}\,\bar{u}_p\gamma _{\lambda }(1- \gamma _5) u_P \times \bar{u}_{p_{1}}\gamma ^{\lambda }(1- \gamma _5) v_{p_2}\nonumber \\&+i G_F^2 m_W^2 s_W^2\beta _{F^0_a}\,\bar{u}_p\gamma _{\lambda }(1- \gamma _5) u(P) \times \bar{u}_{p_{1}}\gamma ^{\lambda } v_{p_2}\,, \end{aligned}$$
(9)

where we have defined

$$\begin{aligned} \beta _{F^0_a}=\sum _j U_{Lj}U^*_{\ell j}F^0_a\left( m_j^2\right) . \end{aligned}$$
(10)

We verified that Eq. (9) reproduces the result reported in Ref. [15] considering only the first term in Eq. (8) and the simple case of two families.

Returning to the general case (non-zero masses and momenta of the external particles), we also have obtained the \(F_k\) functions using both Feynman parametrization (we will denote the corresponding expressions by \(F_{F_k}\)) and the Passarino–Veltman (PaVe) technique (denoted by \(F_{PV_k}\)) [27, 28], employing FeynCalc [29]. In particular, we agree with the expressions previously reported in Ref. [17] in terms of the Feynman parameters,Footnote 4 namely the \(F_{F_k}\) functions can be written as

$$\begin{aligned} F_{F_k}\left( q^2, m_j^2\right) =\frac{1}{2\pi ^2}\int _0^1 dx\int _0^{1-x}f_k\left( q^2, m_j^2\right) dy, \end{aligned}$$
(11)

where

$$\begin{aligned} f_{a}= & {} 2 + \log \left( D_j(q^2)/\mu ^2\right) \nonumber \\&+\frac{(q^2-m^2)x(y-1)+M^2x(x+y)+q^2y(y-1)}{D_j}, \nonumber \\ \end{aligned}$$
(12)
$$\begin{aligned} f_b= & {} \frac{m M x}{D_j}, \end{aligned}$$
(13)
$$\begin{aligned} f_c= & {} -\frac{M x (x+y)}{D_j}, \end{aligned}$$
(14)
$$\begin{aligned} f_d= & {} -\frac{ m x (1-y)}{D_j}, \end{aligned}$$
(15)
$$\begin{aligned} f_e= & {} \frac{M x(2-3y-x)-2 M y (y-1)}{D_j}, \end{aligned}$$
(16)
$$\begin{aligned} f_f= & {} \frac{x m (y-1)+2 m y (y-1)}{D_j}, \end{aligned}$$
(17)

and \(D_j\) is defined as

$$\begin{aligned} D_j\left( q^2, m_j^2\right)= & {} -(x-1) m_j^2-m^2xy+x m_W^2\nonumber \\&+M^2 x(x+y-1)-q^2 y(1-x-y). \nonumber \\ \end{aligned}$$
(18)

We have omitted in \(f_a\) the term associated with the UV divergence since it is independent of \(m_j\) and vanishes owing to the GIM-like mechanism.

On the other hand, the \(F_{k}\) functions in terms of the PaVe scalar functions are given as follows

$$\begin{aligned} F_{PV_k}\left( q^2, m_j^2\right) =\frac{1}{2\pi ^2}\frac{N_{F_k}}{D_{F_k}}, \end{aligned}$$
(19)

with

$$\begin{aligned} D_{F_a}= & {} 2 D_{F_b}=-2\lambda \left( m^2,M^2,q^2\right) , \end{aligned}$$
(20)
$$\begin{aligned} D_{F_c}= & {} D_{F_e}= \frac{M}{2}D_{F_a}^2 \quad D_{F_d}\,=\,D_{F_f}\,=\,\frac{m}{2}D_{F_a}^2\, , \end{aligned}$$
(21)
$$\begin{aligned} N_{F_k}= & {} \xi _{k_1}B_0\left( m^2,m_j^2,m_W^2\right) +\xi _{k_2}B_0\left( M^2,m_j^2,m_W^2\right) \nonumber \\&+\xi _{k_3}B_0\left( q^2,m_j^2,m_j^2\right) +\xi _{k_4}B_0\left( 0,m_j^2,m_W^2\right) \nonumber \\&+\xi _{k_5}C_0\left( m^2,M^2,q^2,m_j^2,m_W^2,m_j^2\right) +\xi _{k_0}, \end{aligned}$$
(22)

where \(\lambda \) is the Kallen function \(\lambda (x,y,z)=x^2+y^2+z^2-2(xy+xz+yz)\), and the \(\xi _{k}\) factors can be found in the “Appendix A”.Footnote 5

Unlike the approximation made in Ref. [15], the presence of masses and momenta of the external particles in the computation hinders the way for the derivation of analytical expressions for the integrals in Eqs. (11) or (19).Footnote 6 Nevertheless, we have done a numerical cross-check between both expressions, where we have employed the Looptools package [30, 31] for the evaluation of the PaVe functions and a numerical Mathematica [32] routine for the evaluation of the parametric integrals (see Fig. 2). We have found an excellent agreement between these two expressions for values of \(q^2<4m_j^2\), which are, however, outside of the physical domain for the considered decays, since \(q^2_\mathrm{{min}}=4m_{\ell '}^2\gg m_j^2\). In this way, owing to the simplified integrals, we verified that a better precision is found in terms of PaVe functions than using Feynman parameters. This feature is illustrated, as an example, for the particular case of the \(Z\tau \mu \) transition in Fig. 2 for the (dominant, as we will show) \(F_a\) factor.

Fig. 2
figure 2

Numerical evaluation of the \(F_a\) function for the effective \(Z\tau \mu \) vertex as a function of the neutrino mass, taking the minimal value of \(q^2 = 4m_\mu ^2\) for the particular \(\tau ^- \rightarrow \mu ^-\mu ^-\mu ^+\) channel. Black dashed line stands for the numerical evaluation of the \(F_a\) function in terms of the Feynman parameters depicted by \(F_{F_a}\) (Eq. (11)), whereas the red line corresponds to the evaluation in terms of the PaVe functions represented by \(F_{PV_a}\) (Eq. (19)). We have found some numerical instabilities for the evaluation of the \(F_{F_a}\) function in the region 0.01 GeV \(<m_j< 0.1\) GeV. On the other hand, better precision is achieved in the evaluation of the \(F_{PV_a}\) function with the help of the Looptools package. In order to perform a comparison with the approximation done in Ref. [15], we also show the complete \(F_a^0\) given by the Eqs. (6) or (7) (purple dotdashed line)

At this point, we want to stress that we disagree with the approximation done in Ref. [17] in order to estimate the relevant dependence on the neutrino mass of the \(F_k\) functions. We highlight that we are studying a process where the momentum transfer \(q^2\) must be non-vanishing and in principle is much larger than the neutrino squared mass, \(m_j^2\), which comes from the loop computation. Therefore, using an expansion around \(q^2=0\) in order to simplify the integration over the Feynman parameters keeping the terms proportional to \(m_j^2\) in the denominators of Eqs. (1217), as it is done in Ref. [17], modifies substantially the behavior of the original functions in the interesting physical region for the neutrino masses and, as a consequence, it gives rise to an incorrect infrared logarithmically divergent behavior of the \(F_k\) functions when \(m_j\) goes to zero, without any possible cure. In particular, the dependence on the momentum transfer, \(q^2\), plays a crucial role in the behavior of the \(F_k\) functions. In this respect, we point out the presence of a small imaginary part in the \(F_a\) function, which emerges for the physical values \(4m_j^2<q^2\).

As we mentioned before, the \(q^2\) minimum in the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decay is given by \(4m_{\ell '}^2\), which is much larger than the neutrinos masses. This, together with the difficulties in obtaining analytical expressions directly for the \(F_k\) functions suggests employing some numerical approximation to deal with the problem. Because of this, we approximate the \(F_k\) functions in the physical region for the neutrinos masses by fitting the curves for the real and imaginary parts of the \(F_k\) functions evaluated in terms of the PaVe functionFootnote 7 We have found a reasonably good fit of the form

$$\begin{aligned} F_k= & {} \frac{1}{2\pi ^2 u}\left( Q_{k}+\frac{m_j^2}{ m_W^2}R_{k}\right) , \end{aligned}$$
(23)

where \(u=1\) for \(k=a,b\) and \(u=M\) for \(k=c,d,e,f\) and the respective values for the \(Q_k=Q_{R_k}+i Q_{R_I}\) and \(R_k=R_{R_k}+i R_{R_I} \) factors of all considered channels are given in “Appendix D”.

From Eq. (23), it turns clear that the \(Q_k\) factors will not contribute owing to the GIM-like mechanism, whereas the relevant contributions are given by the \(R_k\) factors. Then, according to our numerical results, we find that the relevant factors of the \(F_b\), \(F_c\) and \(F_d\) functions are suppressed with respect to the \(F_a\) factor. On the other hand, despite the respective factors of \(F_e\) and \(F_f\) functions are larger than those of the \(F_a\) function, when the momentum transfer becomes smaller and smaller their helicity suppression makes them negligible. Therefore, we will concentrate on the contribution of the \(F_a\) function.

Furthermore, in order to check our results, we have made an expansion for the PaVe functions involved in Eq. (22), following the same strategy that Cheng and Li for the \(\mu \rightarrow e\gamma \) decay [13], that is: expanding the loop integrals around \(m_j^2=0\) (more details on our expansions are given in “Appendix E”). It must be noted that, since neutrino masses are the smallest energy scale in the problem, this is the expansion that is most efficient for the considered decays. Using the Package-X program [35], we could rewrite the \(F_{PV_a}\) contribution as follows:

$$\begin{aligned} F_{PV_a}\left( q^2,m_j^2\right) =\frac{1}{2\pi ^2}\left[ Q_a+\frac{m_j^2}{m_W^2}R_a +\vartheta \left( \frac{m_j^4}{m_W^4}\right) \right] , \end{aligned}$$
(24)

where

$$\begin{aligned} Q_a= & {} -\lambda \left( m^2,M^2,q^2\right) ^{-1} \left[ \phantom {+\left. f_{Q_{a_4}}\log \left( \frac{m_W^2}{q^2} \right) +f_{Q_{a_5}}\right] } f_{Q_{a_1}}C_0\left( m^2, M^2, q^2, 0, m_W^2,0\right) \right. \nonumber \\&+\left. f_{Q_{a_2}}\log \left( \frac{m_W^2}{m_W^2-m^2} \right) + f_{Q_{a_3}}\log \left( \frac{m_W^2}{m_W^2-M^2} \right) \right. \nonumber \\&+\left. f_{Q_{a_4}}\log \left( \frac{m_W^2}{q^2} \right) +f_{Q_{a_5}}\right] -\frac{1}{2} \varDelta , \end{aligned}$$
(25)
$$\begin{aligned} R_a= & {} -m_W^2\lambda \left( m^2,M^2,q^2\right) ^{-1} \left[ \phantom {+ \left. f_{R_{a_4}}\log \left( \frac{m_W^2}{q^2} \right) +f_{R_{a_5}}\right] } f_{R_{a_1}}C_0\left( m^2, M^2, q^2, 0,m_W^2,0\right) \right. \nonumber \\&+\left. f_{R_{a_2}}\log \left( \frac{m_W^2}{m_W^2-m^2} \right) + f_{R_{a_3}}\log \left( \frac{m_W^2}{m_W^2-M^2} \right) \right. \nonumber \\&+ \left. f_{R_{a_4}}\log \left( \frac{m_W^2}{q^2} \right) +f_{R_{a_5}}\right] , \end{aligned}$$
(26)

in which \(\varDelta =\frac{1}{\textstyle {\epsilon }}-\gamma _E+\log (4\pi )\), and the \(f_Q\) and \(f_R\) factors can be found in the “Appendix E”. We verified that our numerical fits for the \(Z\tau \mu \) and \(Z\tau e\) vertex are in a very good agreement with Eq. (26), whereas a deviation is found for the \(Z\mu e\) vertex, as can be seen in Table 9. We consider the results obtained from Eq. (26) for the effective vertices as our reference ones.

In this way, we can approximate the amplitude for the diagram (d) according to Eq. (9) replacing \(F_a^0\) by

$$\begin{aligned} F_a\approx \frac{1}{2\pi ^2}\frac{m_j^2}{m_W^2}R_a. \end{aligned}$$
(27)

The penguin contributions to the branching ratios of the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays are smaller by around one (two) orders of magnitude for the \(\mu \) (\(\tau \)) decays when one keeps external momenta and masses than when they are neglected. We will see in Sect. 4 a similar difference in the full result.

3 Contributions of the box diagrams

Now, in order to make a complete comparison with the approximation done in Ref. [15] we have also obtained the amplitude for the box diagram (e) in Fig. 1. Note that unlike the penguin diagram (d), which involves two neutrino propagators of the same mass state, the box diagram (e) can involve two neutrino propagators with different mass states. Thus, in full generality, the amplitude can be written as follows:

$$\begin{aligned} \mathscr {M}_{e}=\left( \frac{-ig}{2\sqrt{2}}\right) ^4\sum _{i,j} U_{Lj}U^*_{lj} U_{\ell ^\prime i}U^*_{\ell ^\prime i}T_{\sigma \sigma '}I^{\sigma \sigma '}, \end{aligned}$$
(28)

where we defined

$$\begin{aligned} T_{\sigma \sigma '}=4\,\bar{u}_p\gamma _\mu \gamma _\sigma \gamma _\nu (1-\gamma _5)u_P\times \bar{u}_{p_1}\gamma ^\nu \gamma _{\sigma '}\gamma ^\mu (1-\gamma _5)v_{p_2}, \end{aligned}$$
(29)

and the relevant loop integral is given by (see Fig. 1e)

$$\begin{aligned} I^{\sigma \sigma '}=\int \frac{d^4k}{(2\pi )^4}\frac{(P+k)^\sigma (k +p_1)^{\sigma '}}{D_B}, \end{aligned}$$
(30)

with

$$\begin{aligned} D_B= & {} \left( k^2-m_W^2\right) \left[ (p_1+p_2+k)^2-m_W^2\right] \left[ (P+k)^2-m_j^2\right] \nonumber \\&\times \left[ (k+p_1)^2-m_i^2\right] . \end{aligned}$$
(31)

Since we have written the Eq. (30) in terms of the momenta P, \(p_1\) and \(p_2\), the integral must take the form

$$\begin{aligned} I^{\sigma \sigma '}= & {} i\left( g^{\sigma \sigma '}H_a+P^{\sigma }P^{\sigma '}H_b+P^{\sigma }p_1^{\sigma '}H_c +P^{\sigma }p_2^{\sigma '}H_d\right. \nonumber \\&+\left. p_1^{\sigma }P^{\sigma '}H_e+p_1^{\sigma }p_1^{\sigma '}H_f +p_1^{\sigma }p_2^{\sigma '}H_g+p_2^{\sigma }P^{\sigma '}H_h\right. \nonumber \\&+ \left. p_2^{\sigma }p_1^{\sigma '}H_i+p_2^{\sigma }p_2^{\sigma '}H_j \right) . \end{aligned}$$
(32)

The \(H_k\) factors depend upon the kinematic variables \(s_{12}=(p_1+p_2)^2=q^2\) and \(s_{13}=(p_1+p)^2\), in addition of \(m_i\) and \(m_j\).

Anew, in the approximation where momenta of the external particles are neglected in Eq. (30), the only contribution is given by the \(H^0_a\) function, which will not depend either on \(s_{12}\) or \(s_{13}\). In such a case, we obtained the following simplified expression

$$\begin{aligned} H_{a}^0(m_j^2, m_i^2)=\frac{1}{16\pi ^2}\int _0^1 dx\int _0^{1-x}dy\int _0^{1-x-y}\frac{-1}{2M_{F_0}^2}dz, \end{aligned}$$
(33)

where

$$\begin{aligned} M_{F_{0}}^2= & {} m_W^2(x+y)-m_j^2(x+y-1)+(m_i^2-m_j^2)z. \end{aligned}$$
(34)

Whereas, in terms of PaVe functions, \(H^0_a\) reads

$$\begin{aligned}&H^0_{a}\left( m_j^2, m_i^2\right) \nonumber \\&\quad =\frac{1}{16\pi ^2}\left( \frac{m_j^4}{4 \left( m_j^2-m_i^2\right) \left( m_j^2-m_W^2\right) {}^2}B_0\left( 0,m_j^2,m_j^2\right) \right. \nonumber \\&\qquad +\left. \frac{m_i^4}{4 \left( m_i^2-m_j^2\right) \left( m_i^2 -m_W^2\right) {}^2}B_0\left( 0,m_i^2,m_i^2\right) \right. \nonumber \\&\qquad +\left. \frac{2 m_i^2 m_j^2 m_W^2-m_W^4 \left( m_i^2+m_j^2\right) }{4 \left( m_i^2-m_W^2\right) {}^2 \left( m_j^2-m_W^2\right) ^2}B_0\left( 0,m_W^2,m_W^2\right) \right. \nonumber \\&\qquad +\left. \frac{m_W^2}{4 \left( m_i^2-m_W^2\right) \left( m_W^2-m_j^2\right) }\right) . \end{aligned}$$
(35)

In the same way as for the \(F_a^0\) form factor, an analytical expression for \(H_a^0\) can be obtained easily from either Eqs. (33) or  (35). This time, making a double Taylor expansion, first around \(m_i^2=0\) and then around \(m_j^2=0\), we obtained that

$$\begin{aligned} H^0_a\left( m_j^2, m_i^2\right)= & {} \frac{1}{64\pi ^2 m_W^4}\left[ \left( m_i^2+m_j^2 \right) \left( \log \left( \frac{m_W^2}{m_j^2}\right) -1\right) \right. \nonumber \\&+\left. \frac{m_i^2m_j^2}{m_W^2}\left( 2\log \left( \frac{m_W^2}{m_j^2}\right) -1\right) \right. \nonumber \\&+\left. -m_W^2+\vartheta \left( \frac{m_i^4}{m_W^2}\right) \vartheta \left( \frac{m_j^4}{m_W^2}\right) \right] . \end{aligned}$$
(36)

Using that \(T_{\sigma \sigma '}g^{\sigma \sigma '}=16\bar{u}_p\gamma _{\lambda }(1- \gamma _5) u_P \times \bar{u}_{p_{1}}\gamma ^{\lambda }(1- \gamma _5) v_{p_2}\), the amplitude -in this approximation- is given by

$$\begin{aligned} \mathscr {M}_{e}^0=i8G_F^2 m_W^4\beta _{H^0_a}\,\bar{u}_p\gamma _{\lambda }(1- \gamma _5) u_P \times \bar{u}_{p_{1}}\gamma ^{\lambda }(1- \gamma _5) v_{p_2}, \end{aligned}$$
(37)

with

$$\begin{aligned} \beta _{H^0_a}=\sum _{j,i} U_{Lj}U^*_{\ell j} U_{\ell ' i}U^*_{\ell ' i} H^0_a\left( m_i^2, m_j^2\right) . \end{aligned}$$
(38)

Again, we verified that taking into account the first term in Eq. (36) and considering only two families, Eq. (37) reproduces the expression reported in Ref. [15] for the amplitude of the box diagram 1 (e) in the \(\mu \rightarrow 3e\) decay. Furthermore, our results are consistent with the previous expressions reported in Ref. [23] for the box contribution associated with the effective \(K^+\rightarrow \pi ^+\nu _\ell \bar{\nu }_\ell \) decay in the quark sector, where the approximation of taking masses and momenta of the external particles equal to zero is excellent, owing to the presence of the heavy top quark inside the loop.

In the general case, we also obtained the \(H_k\) (\(k=a,b,...,j\)) functions in terms of both Feynman parameters integrals, \(H_{F_k}\), and PaVe functions, \(H_{PV_k}\). This time, the \(H_k\) functions will depend on the squared masses of two different neutrinos, \(m_j^2\) and \(m_i^2\), and on two independent phase space variables \(s_{12}\) and \(s_{13}\). Using Feynman parametrization these functions read

$$\begin{aligned}&H_{F_k}(s_{12},s_{13},m_i^2, m_j^2)\nonumber \\&\quad =\frac{1}{16\pi ^2}\int _0^1 dx\int _0^{1-x}dy\int _0^{1-x-y}h_kdz\,, \end{aligned}$$
(39)

where

$$\begin{aligned} h_a= & {} -\frac{1}{2M_F^2}, \end{aligned}$$
(40)
$$\begin{aligned} h_b= & {} \frac{z(z-1)}{M_F^4}, \end{aligned}$$
(41)
$$\begin{aligned} h_c= & {} -\frac{(z-1)(x+z)}{M_F^4}, \end{aligned}$$
(42)
$$\begin{aligned} h_d= & {} \frac{y(z-1)}{M_F^4}, \end{aligned}$$
(43)
$$\begin{aligned} h_e= & {} -\frac{z(x+z-1)}{M_F^4}, \end{aligned}$$
(44)
$$\begin{aligned} h_f= & {} \frac{(x+z-1)(x+z)}{M_F^4}, \end{aligned}$$
(45)
$$\begin{aligned} h_g= & {} -\frac{y(x+z-1)}{M_F^4}, \end{aligned}$$
(46)
$$\begin{aligned} h_h= & {} \frac{yz}{M_F^4}, \end{aligned}$$
(47)
$$\begin{aligned} h_i= & {} -\frac{y(x+z)}{M_F^4}, \end{aligned}$$
(48)
$$\begin{aligned} h_j= & {} \frac{y^2}{M_F^4}. \end{aligned}$$
(49)

In the previous expressions, the denominator function is given by

$$\begin{aligned} M_F^2= & {} -m_j^2 (x+y-1)+m_{\ell '}^2 (x+y-1) (x+y)\nonumber \\&+ m_W^2(x+y)\nonumber \\&-s_{12} x y+z^2 \left( 2 m_{\ell '}^2+m^2+M^2-s_{12}-s_{13}\right) \nonumber \\&+z \left[ m_i^2-m_j^2+(x+y) \left( 3 m_{\ell '}^2-s_{12}-s_{13}\right) -2 m_{\ell '}^2\right. \nonumber \\&+\left. m^2 (x-1)+M^2 (y-1)+s_{12}+s_{13}\right] . \end{aligned}$$
(50)

Expressions are rather lengthy in terms of the PaVe functions so that here we only present the expression for the dominant \(H_a\) function, which can be written as

$$\begin{aligned} H_{PV_a}(s_{12}, s_{13}, m_j^2, m_i^2)=\frac{1}{16\pi ^2}\frac{N_{H_a}}{D_{H_a}}, \end{aligned}$$
(51)

with

$$\begin{aligned} D_{H_a}= & {} 4 \big (m^4 m_{\ell '}^2-m^2 \big [M^2 \big (2 m_{\ell '}^2-s_{12}\big )+s_{12} \big (m_{\ell '}^2+s_{13}\big )\big ]\nonumber \\&+M^4 m_{\ell '}^2-M^2 s_{12} \big (m_{\ell '}^2+s_{13}\big )\nonumber \\&+s_{12} \big (-2 s_{13} m_{\ell '}^2+m_{\ell '}^4+s_{13} \big (s_{12}+s_{13}\big )\big )\big ), \end{aligned}$$
(52)

and

$$\begin{aligned} N_{H_a}= & {} \chi _{k_1}C_0\bigg (m^2,M^2,s_{12},m_W^2,m_i^2,m_W^2\bigg )\nonumber \\&+\chi _{k_2}C_0\bigg (m_{\ell '}^2,m_{\ell '}^2,s_{12},m_W^2,m_j^2,m_W^2\bigg )\nonumber \\&+\chi _{k_3}C_0\bigg (M^2,m_{\ell '}^2, m^2 + M^2 \nonumber \\&+ 2m_{\ell '}^2 - s_{12}- s_{13}, m_i^2,m_W^2, m_j^2\bigg )\nonumber \\&+\chi _{k_4}C_0\bigg (m^2, m_{\ell '}^2, m^2 + M^2 \nonumber \\&+ 2m_{\ell '}^2 - s_{12} - s_{13}, m_i^2, m_W^2, m_j^2\bigg )\nonumber \\&+\chi _{k_5}D_0\bigg (m^2, M^2, m_{\ell '}^2, m_{\ell '}^2, s_{12}, m^2 \nonumber \\&+ M^2+ 2m_{\ell '}^2 - s_{12}\nonumber \\&- s_{13}, m_W^2, m_i^2, m_W^2, m_j^2\bigg ). \end{aligned}$$
(53)

where \(\chi _{k}\) factors are reported in “Appendix A”.

Fig. 3
figure 3

Numerical evaluation of the \(H_a\) function versus the neutrino mass. We are considering that \(\varDelta m_{ij}^2=10^{-3}\) eV\(^2\) and the values of \(s_{12}=10^{8}\) eV\(^2\) and \(s_{13}=10^{9}\) eV\(^2\) associated with a representative point in the physical phase space for the particular \(\tau ^-\rightarrow \mu ^-\mu ^-\mu ^+\) channel. In analogous way to the Fig. 2, black dashed line stands for the numerical evaluation of the \(H_a\) function in terms of the Feynman parameters depicted by \(H_{F_a}\) (Eq. (39)), whereas the red line corresponds to the evaluation in terms of the PaVe functions represented by \(H_{PV_a}\) (Eq. (51)). Numerical instabilities for the evaluation of the \(H_{F_a}\) function around 0.001 GeV \(<m_j<1\) GeV are found. A better precision is achieved for the evaluation of the \(F_{PV_a}\) function with the help of the Looptools package. In order to perform a comparison with the approximation done in Ref. [15], we also show the complete \(H_a^0\) given by the Eqs. (33) or (35) (purple dotdashed line)

As far as the general case is concerned, we can see that although there are additional contributions associated with the \(H_k\) functions, with \(k=b\), c, d, \(\ldots \) j; they are expected to be suppressed, as they correspond to higher-dimensional operators, with respect to the \(H_a\) function associated with a \((V-A)\times (V-A)\) operator. Therefore, we will concentrate on the \(H_a\) function in order to estimate the box diagram contribution. We also have done a numerical cross-check between the expressions for the \(H_a\) function given in terms of the Feynman parameters Eq. (39) and the PaVe functions Eq. (51), as can be seen in Fig. 3. In this case, it turns very complicated and far away of the purpose of this work to obtain an analytical expression for the \(H_a\) function in Eq. (53) making an expansion for the respective scalar PaVe functions, owing to the number of propagators involved and the dependence on two different neutrino masses. However, we can expect a good approximation through our numerical results, as it happens with the penguin contribution.

Thus, we estimate the relevant dependence on the neutrino mass for the \(H_a\) function fitting the curve for the real and imaginary parts of the \(H_a\) function evaluated in terms of the PaVe functions considering fixed values for the \(m_i\), \(s_{12}\), and \(s_{13}\) parameters.Footnote 8 We obtained a good fit of the form

$$\begin{aligned} H_a= & {} \frac{1}{16\pi ^2}\left( Q_{H_a}+\frac{m_j^2}{ m_W^4}R_{H_a}\right) , \end{aligned}$$
(54)

where \(R_{H_a}\approx 1.5+i0.007 \), for all different \(\tau \rightarrow \ell ^-\ell ^{\prime -}\ell ^{\prime +}\) channels, whereas \(R_{H_a} \approx 1.5\), for the \(\mu ^-\rightarrow e^-e^-e^+\) channel. These numbers were obtained considering that \(\varDelta m^2_{ij}=10^{-3}\) eV\(^2\), and representative values for \(s_{12}\) and \(s_{13}\) within the corresponding phase space.

4 Numerical results

In order to evaluate the branching fractions for the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays we consider the results of the previous sections and the state of the art best-fit values of the three neutrino oscillation parameters [18,19,20,21]. Without loss of generality, we assume the CP-conserving scenario, and we use the values reported in Table 1.Footnote 9 The kinematics for the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays can be found in “Appendix C”.

Table 1 Values used for the evaluation of the branching fractions. We also assume a value of \(m_1^2=(0.06)^2\) \(\hbox {eV}^2\)

We will first make a partial evaluation neglecting the penguin contributions (only box diagrams are considered), which yields the values in Table 2.

Table 2 Branching ratios for the \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +} \) decays (neglecting the penguin contributions), which are obtained using the current knowledge of the PMNS matrix. Our results are obtained taking into account only the contribution from the dominant \(H_a\) function. The last column values correspond to the approximation where external masses and momenta are neglected [15]. Our results are smaller than those by three orders of magnitude, approximately
Table 3 Branching ratios including all contributions (interferences are not neglected), which are obtained using the current knowledge of the PMNS matrix. Our results are obtained taking into account only the contribution from the dominant \(H_a\) function. The last column values correspond to the approximation where external masses and momenta are neglected [15]. Our results are smaller than those by around one (two) orders of magnitude for the \(\mu \) (\(\tau \)) decays

Our final results, where the dominant penguin and box contributions are considered, are collected in Table 3, where they are compared to those obtained using Petcov’s results [15] with updated input. Our predictions are even smaller than Petcov’s updated results, as a consequence of keeping external masses and momenta in our computations (Tables 4, 5, 6, 7, 8).

These extremely suppressed branching ratios for lepton flavor violating \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays due to massive light neutrinos are found at similar rates in the case of LFV Z [33] and Higgs boson decays [34].

5 Conclusions

We have revisited the \(L^-\rightarrow \ell ^-\ell ^{\prime -}\ell ^{\prime +}\) decays in the SM with massive neutrinos. We obtained expressions in terms of both Feynman parameters and scalar Passarino-Veltman functions for the relevant loop integrals of the (dominant) diagrams that involve two neutrino propagators considering non-vanishing masses and momenta of the external particles. Opposed to the previous calculation reported in Ref. [17], we found that all the different amplitudes for these processes are strongly suppressed (as they are proportional to the neutrino mass squared). In the particular case of the penguin contribution with two neutrino propagators, we highlight that it is crucial saving the dependence on the momentum transfer in the Feynman integrals in order to evaluate the amplitude in the physical region for the neutrino masses. This fact avoids the incorrect logarithmic divergent behavior in the amplitude claimed in Ref. [17]. As far as the box contribution is concerned, we found that the dominant term comes from \(H_a\) function that is associated with a (V-A)\(\times \)(V-A) operator, and it is in good agreement with the approximation done in Ref. [15].

Current and forthcoming experiments were approaching the limits predicted by Ref. [17] on the SM prediction for the lepton flavor violating \(\tau ^-\rightarrow \mu ^- \ell ^+ \ell ^-\) (\(\ell =\mu , e\)) decays due to non-zero neutrino masses. This prediction was at odds with Ref. [15] corresponding computation for the \(\mu ^-\rightarrow e^- e^+ e^-\) decays predicting an extremely suppressed, unmeasurable branching ratio (as in \(L^-\rightarrow \ell ^- \gamma \) processes). The most important result of our analysis is the confirmation (in agreement with Ref. [15]) that any future observation of \(L^{-}\rightarrow \ell ^- \ell ^{\prime -} \ell ^{\prime +}\) decays would imply the existence of New Physics.