1 Introduction

One of the most interesting aspects of low scale seesaw models [1,2,3,4,5,6,7] with moderately heavy right handed (RH) neutrinos is that they can accommodate easily and successfully the low energy neutrino data, and at the same time they may provide sizable rates for processes with Lepton Flavor Violation (LFV) in the charged lepton sector. The origin of these potential large LFV rates is the allowed large neutrino Yukawa coupling matrices in these models, \(Y_\nu \simeq \mathcal{O}(1)\), which under the assumption of being non-diagonal in flavor and considering loops involving the RH neutrinos in the internal legs may generate radiatively such LFV processes [8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27].

Here, we consider the Inverse Seesaw model (ISS) [1,2,3,4] as a specific realization of these low scale seesaw models, and work with three pairs of RH neutrinos with opposite lepton numbers, which for simplicity are assumed to be quasi-degenerate. The RH mass scale \(M_R\) introduced in the ISS by the mass term involving the RH neutrinos is assumed here, also for simplicity, to be flavor diagonal and degenerate in the three diagonal entries. In the present context, we consider this new scale \(M_R\) to be above the electroweak (EW) scale, say at the energy interval \(\mathcal{O}(0.1\)–10) TeV, that is accessible at the LHC.

Table 1 Present experimental bounds on Lepton Flavor Violating Z boson decays. Here BR\((Z\rightarrow \ell _k \ell _m)\equiv \,\)BR\((Z\rightarrow \ell _k \bar{\ell }_m)+\)BR\((Z\rightarrow \bar{\ell }_k \ell _m)\)

Regarding the specific LFV processes, we focus here on the particular case of the LFV Z boson decays (LFVZD) to charged leptons with different flavor, \(Z \rightarrow \ell _k {\bar{\ell }}_m\), which have also interesting rates in low energy scale seesaw models with heavy neutrinos [3, 4, 7,8,9,10,11,12, 20, 25], and in particular in the ISS model, as studied in [20, 25] working in the physical basis or in [21] by computing the relevant Wilson coefficients. These decays, as well as the LFV Higgs decays, are being intensely searched for nowadays at the LHC [28,29,30,31,32] and, the absence of any experimental evidence of these Z decays already sets very stringent bounds on the corresponding decay rates. We summarize in Table 1 the present upper bounds on the various LFVZD channels from both the LEP data and the LHC data. On the other hand, the expectations for improving the sensitivities to these LFVZD rates in the future experiments are quite promising. In particular, the future linear colliders claim an expected sensitivity of \(10^{-9}\) [35, 36], and in the Future Circular \(e^+ e^-\) Colliders (such as FCC-ee (TLEP) [37]), where it is estimated that up to \(10^{13}\) Z bosons would be produced, the sensitivities could be improved even further.

The purpose of the present work is to compute the LFVZD rates in the ISS with a different technique: the mass insertion approximation (MIA). The main motivation to use the MIA is that, in contrast to the alternative full one-loop computation [11, 20, 25], it provides very simple analytical results and these are written explicitly in terms of the main input parameters of the ISS, concretely, the neutrino right handed mass \(M_R\) and the neutrino Yukawa coupling matrix \(Y_\nu \). Thus, working directly in the electroweak interaction basis, instead of the physical mass basis, the MIA leads to the simplest results which in turn can be used to further analyze the interesting decoupling behavior of the heavy right handed neutrinos in these LFVZD. We follow here the same method for the MIA as in our previous works [27, 38] where we applied it to the case of LFV Higgs decays. Other similar methods using the mass insertion technique to compute observables in flavor physics have also been explored in [39, 40].

Our final aim here is to compute the one-loop effective vertex, \(Z \ell _k \ell _m\) associated to the proper large \(M_R\) expansion of the involved form factors, which will show as a series in powers of \(v^2/M_R^2\), with \(v= 174\) GeV characterizing the EW scale. We believe that the simple formulas provided here for this \(Z \ell _k \ell _m\) effective vertex can be very useful to test rapidly the compatibility of these models with LFV data. As an illustration of this utility, we will explore here with the obtained \(Z \ell _k \ell _m\) effective vertex the maximum allowed LFVZD rates by present data in a specific low scale seesaw models, the ISS model. We will discuss here that these predicted rates are indeed within the reach of the future experiments.

The paper is organized as follows: in Sect. 2 we summarize the main features of the ISS model in terms of the EW interaction basis, and we present the computation of \(\Gamma (Z \rightarrow \ell _k \bar{\ell }_m)\) to one-loop within the MIA in all covariant \(R_\xi \) gauges and in the unitary gauge. Our proof of the gauge invariance of the on-shell form factor is also included in that section. Section 3 contains the computation of the one-loop effective vertex for LFVZD and the comparison of the MIA with the full results. Section 4 is devoted to explore the maximum allowed LFVZD rates using our MIA-effective vertex. The main conclusions are summarized in Sect. 5. The technicalities of the present computation, including the conventions for the one-loop integrals, the analytic expressions of the form factors for each diagram of the full and MIA computations in the Feynman–’t Hooft gauge, the expansions of the one-loop functions and the effective vertex at zero external momenta, are collected in the Appendices A, B, C, D and E respectively.

2 \(\varvec{\Gamma (Z\rightarrow \ell _k\bar{\ell }_m)}\) to one-loop within the MIA

Our computation of the partial decay width for the LFVZD in the MIA is performed in the EW basis. Therefore the starting point is the ISS Lagrangian in the EW basis, i.e., in terms of the right and the left handed neutrinos. We follow the same notation and conventions for this Lagrangian as in [27]:

$$\begin{aligned} \mathcal {L}_\mathrm {ISS} = - Y^{ij}_\nu \overline{L_{i}} \widetilde{\Phi } \nu _{Rj} - M_R^{ij} \overline{\nu _{Ri}^c} X_j - \frac{1}{2} \mu _{X}^{ij} \overline{X_{i}^c} X_{j} + h.c.,\nonumber \\ \end{aligned}$$
(1)

where L is the SM lepton doublet, \(\widetilde{\Phi }=i\sigma _2\Phi ^*\) with \(\Phi \) the SM Higgs doublet and ij are indices in flavor space that run from 1 to 3. Correspondingly, \(Y_\nu \), \(\mu _{X}\) and \(M_R\) are \(3\times 3\) matrices. The C-conjugate fermion fields are defined here as \(f_L^c=(f_L)^c=(f^c)_R\) and \(f_R^c=(f_R)^c=(f^c)_L\).

The mass matrix of the ISS, in the EW interaction basis \((\nu _L^c,\;\nu _R,\;X)\) is:

$$\begin{aligned} M_{\mathrm {ISS}}=\left( \begin{array}{c c c} 0 &{} m_D &{} 0 \\ m_D^T &{} 0 &{} M_R \\ 0 &{} M_R^T &{} \mu _X \end{array}\right) , \end{aligned}$$
(2)

with \(m_D=v Y_\nu \), and \(v = 174\,\mathrm {GeV}\). For simplicity, we choose here \(M_R\) diagonal in flavor space and with degenerate diagonal entriesFootnote 1. In Appendix B we summarize the relevant couplings in the neutrino mass basis. Notice that in the ISS model \(\mu _X\) is assumed to be small, related to the smallness of light neutrino masses, and therefore its contributions to our LFV process are negligible [27].

The relation between the neutrino electroweak interaction basis \((\nu _L^c, \nu _R, X)\) and the neutrino mass eigenstate basis, \(n_i~(i=1,\ldots ,9)\) is given by

$$\begin{aligned} \left( \begin{array}{c} \nu _L^c \\ \nu _R \\ X \end{array}\right) = U_\nu P_R \left( \begin{array}{c} n_1\\ \vdots \\ n_9\end{array}\right) , \quad \left( \begin{array}{c} \nu _L \\ \nu _R^c \\ X^c \end{array}\right) = U_\nu ^* P_L \left( \begin{array}{c} n_1\\ \vdots \\ n_9\end{array}\right) , \end{aligned}$$
(3)

where \(U_\nu \) is the rotation matrix leading to the physical neutrino masses \(m_{n_i}\), given by:

$$\begin{aligned} U_\nu ^T M_\mathrm{ISS} U_\nu = \mathrm{diag}(m_{n_1},\dots ,m_{n_9}). \end{aligned}$$
(4)

For the charged lepton sector we use the physical mass eigenstate basis in the whole paper.

Next, we write the relevant amplitude for these \(Z(p_1) \rightarrow \ell _k(-p_2) \bar{\ell }_m (p_3)\) decays in terms of the proper form factors with \(p_1=p_3-p_2\). In the present case of the ISS with right handed neutrinos and neglecting the lepton masses there is just one form factor involved [12], \(F_L\). This will be explicitly shown in our forthcoming computation. Thus we write,

$$\begin{aligned} i \mathcal{M} = i\epsilon ^{\lambda }_{Z}(p_1) \bar{u}_{\ell _k} (-p_2) (F_L \gamma _{\lambda } P_L) v_{\ell _m}(p_3). \end{aligned}$$
(5)

Then, the partial width is simply

$$\begin{aligned} \Gamma (Z \rightarrow \ell _k \bar{\ell }_m) = \frac{m_Z}{24\pi }|F_{L}|^{2}, \end{aligned}$$
(6)

where the lepton masses have been safely neglected in the phase space factor.

Now, as explained in [27], the MIA computation is organized as an ordered expansion in powers of the neutrino matrix \(Y_\nu \). The non-diagonal elements in flavor space of this matrix, \(Y_{\nu }^{ij}\) with \(i \ne j\), are the unique origin of LFV in this ISS context, and each \((vY_{\nu }^{ij})\) factor acts as a mass insertion changing lepton flavor. Thus, to a given order in this expansion, \(\mathcal{O}{(Y_\nu ^n)}\), these off-diagonal entries in the neutrino Yukawa coupling matrix, and via the loops with right handed neutrinos, generate non-vanishing contributions to the observable of our interest here, \(\Gamma (Z \rightarrow \ell _k \bar{\ell }_m)\). Specifically, the one-loop form factors receive contributions from leading order (LO) terms of \({\mathcal {O}}(Y_\nu Y_\nu ^\dagger )\) (\({\mathcal {O}}(Y_\nu ^2)\) or \({\mathcal {O}}(Y^2)\), in short) and from next to leading order (NLO) terms of \({\mathcal {O}}(Y_\nu Y_\nu ^\dagger Y_\nu Y_\nu ^\dagger )\) (\({\mathcal {O}}(Y_\nu ^4)\) or \({\mathcal {O}}(Y^4)\), in short). The systematics to compute these form factors in the MIA is fully explained in our previous work [27] which we follow closely here.

Our diagrammatic procedure consists of the systematic insertion of right handed neutrino (fat) propagators in all the possible places inside the loops which are built with the relevant interaction vertices and propagators summarized in Fig. 1. One then follows the counting of the various \(Y_\nu \) appearing in each one-loop Feynman diagram. In the present LFVZD these \(Y_\nu \) come from just two sources, the coupling of the \(\nu _R\) to the Goldstone bosons and the insertions \(\nu _L\)-\(\nu _R\) which go with \(m_D=v Y_\nu \). Generically, diagrams with one right handed neutrino propagator will contribute to the form factors at \(\mathcal{O}(Y_{\nu }^{2})\), whereas diagrams with two right handed neutrino propagators will contribute to the form factors at \(\mathcal{O}(Y_{\nu }^{4})\). The detailed computation of the fat right handed neutrino propagator leading to the expression in Fig. 1 can be found in our previous work [27]. The most relevant feature of this fat propagator is that it contains the resummation of all the insertions given by the \(M_R\) mass insertions (diagonal in flavor) and it neglects the \(\mu _X\) mass insertions which are not relevant for the LFV processes of our interest. Thus, dealing with this propagator is very convenient for the present computation of the LFVZD as it was in [27] for the case the LFV Higgs decays. It is important to note that there are no couplings between the gauge bosons W and Z with the right handed neutrinos because they are singlets of SU(2). Indeed, these \(\nu _R\) can only couple to the Higgs sector, as can be seen in Fig. 1.

Fig. 1
figure 1

Relevant Feynman rules and sign conventions for the MIA computation in a generic covariant gauge. The momentum convention is that all boson momenta are incoming. The solid thick line denotes the right handed neutrino fat propagator, as defined and computed in [27]. The cross denotes our unique LFV insertion given by \(m_{D_{ij}}=v Y_{{\nu }_{ij}}\)

In Figs. 2, 3, 4 we show the relevant one-loop diagrams in the MIA corresponding to the dominant contributions of the LO, \({\mathcal {O}}(Y^2)\), and the NLO, \({\mathcal {O}}(Y^4)\), respectively, in a generic covariant gauge. Notice that the different topologies in the MIA are of vertex corrections type and of leg corrections type, as in the full computation, and this suggests our use of a correlated notation for the labelling in the two sets of diagrams, the MIA and the full computation, summarized by diagrams with topology of type (1), type (2), etc. The final result for the full computation is collected in the Appendix B, for completeness.

The final analytical result for the MIA is the sum of all the contributions in Figs. 2, 3, 4. It gives the total form factor in the MIA to \({\mathcal {O}}(Y^2+Y^4)\) for arbitrary Z external momentum, \(p_1\), that can be summarised as follows:

$$\begin{aligned} F_{L}^{\mathrm{MIA}}= & {} F_{L}^{\mathrm{MIA\,\, (Y^2)}} +F_{L}^{\mathrm{MIA\,\, (Y^4)}}. \end{aligned}$$
(7)

At \(\mathcal{O}(Y_{\nu }^{2})\), the relevant topologies in a covariant gauge are from diagrams all containing 1 right handed neutrino propagator and one of these three combinations: (i) 1 vertex with \(\nu _R\) and 1 \(m_D\) insertion, (ii) 0 vertices with \(\nu _R\) and 2 \(m_D\) insertions, (iii) 2 vertices with \(\nu _R\) and 0 \(m_D\) insertions. Then, we get:

$$\begin{aligned} F_{L}^{\mathrm{MIA\,\, (Y^2)}}= & {} F_{L}^\mathrm{(1a)}+F_{L}^\mathrm{(1b)}+F_{L}^\mathrm{(1c)}+F_{L}^\mathrm{(1d)}+F_{L}^\mathrm{(2a)}\nonumber \\&+F_{L}^\mathrm{(2b)}+F_{L}^\mathrm{(3a)} +F_{L}^\mathrm{(4a)}+F_{L}^\mathrm{(4b)} \nonumber \\&+ F_{L}^\mathrm{(5a)}+F_{L}^\mathrm{(5b)}+F_{L}^\mathrm{(6a)}+F_{L}^\mathrm{(6b)}\nonumber \\&+F_{L}^\mathrm{(6c)}+F_{L}^\mathrm{(6d)} +F_{L}^\mathrm{(7a)}+ F_{L}^\mathrm{(8a)}+F_{L}^\mathrm{(8b)} \nonumber \\&+ F_{L}^\mathrm{(8c)}+F_{L}^\mathrm{(8d)} + F_{L}^\mathrm{(9a)}+ F_{L}^\mathrm{(10a)}\nonumber \\&+F_{L}^\mathrm{(10b)}+F_{L}^\mathrm{(10c)}+F_{L}^\mathrm{(10d)} . \end{aligned}$$
(8)

The explicit analytical results in the Feynman–’t Hooft gauge for all the relevant diagrams are collected in Appendix C. Notice that some diagrams are subleading since they are of \({\mathcal {O}}(m_\mathrm{lep}^2)\) and their contributions will be neglected from now on.

Fig. 2
figure 2

Relevant diagrams of \(\mathcal O(Y_\nu ^2)\) in a covariant gauge corresponding to vertex corrections

Fig. 3
figure 3

Relevant diagrams of \(\mathcal O(Y_\nu ^2)\) in a covariant gauge corresponding to external leg corrections

Fig. 4
figure 4

Relevant diagrams of \(\mathcal O(Y_\nu ^4)\) in a covariant gauge

At \(\mathcal{O}(Y_{\nu }^{4})\), the relevant topologies in a covariant gauge are from diagrams all containing 2 right handed neutrino propagators and one of these three combinations: (i) 2 vertices with \(\nu _R\) and 2 \(m_D\) insertions, (ii) 0 vertices with \(\nu _R\) and 4 \(m_D\) insertions, (iii) 1 vertex with \(\nu _R\) and 3 \(m_D\) insertions. Thus, we find that in the Feynman–’t Hooft gauge the most relevant diagrams are those of type (1), (6), (8) and (10) summarized in Fig. 4 whose respective contributions are given by:

$$\begin{aligned} F_{L}^{\mathrm{MIA \,\, (Y^4)}}= & {} F_{L}^\mathrm{(1e)}+F_{L}^\mathrm{(6e)}+F_{L}^\mathrm{(8e)}+F_{L}^\mathrm{(10e)}, \end{aligned}$$
(9)

and the explicit analytical results in the Feynman–’t Hooft gauge for all the relevant diagrams are collected in Appendix C. The rest of diagrams in Fig. 4 do not provide leading contributions in the large \(M_R\) expansion in the Feynman–’t Hooft gauge, but they will contribute in a generic covariant gauge, as it will be shown in the next subsection. Specifically, the diagrams (2c), (2d), (2e), (3b), (4c), (5c), (7b) and (9b), after performing the expansion in inverse powers of \(M_R\) give contributions to the form factor in the Feynman–’t Hooft gauge of \({ \mathcal O}(v^4/M_R^4)\), whereas they give contributions of \({ \mathcal O}(v^2/M_R^2)\) in a generic covariant gauge. Besides, diagrams with more than two right handed neutrino propagators will also provide subleading corrections in the heavy \(M_R\) case of our interest, since they will come with extra powers of \(M_R\) in the denominator. Then we neglect contributions of \(\mathcal{O}(Y_{\nu }^{6})\) and higher terms.

All the previous results are given in terms of the standard loop functions whose definitions and conventions are collected in Appendix A. The discussion of the results for other gauge choices like the unitary gauge and the general \(R_\xi \) covariant gauges is included in the next Sect. 2.1, where our proof of gauge invariance of the on-shell Z form factor for LFVZD will be presented.

The final analytical result is the sum of all these contributions in Eqs. (8) and (9), and this gives the total form factor in the MIA for arbitrary Z external momentum, \(p_1\), in the Feynman–’t Hooft gauge. Then, neglecting the fermion masses \(m_k\) and \(m_m\), we finally get:

$$\begin{aligned} F_{L}^{\mathrm{MIA\,\, (Y^2)}}&= \frac{1}{16 \pi ^{2}} \frac{g}{c_W} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \left( \frac{1}{2}-s_{W}^{2}\right) B_1(M_R,m_W) \nonumber \\&\quad +(1-2s_{W}^{2})C_{00}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\quad +2m_{W}^{2}C_{0}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\quad +\left( 1-2s_{W}^{2}\right) m_{W}^{2}C_{2}(0,M_R,m_W) \nonumber \\&\quad +m_{W}^{2}\big ( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{13}\nonumber \\&\quad -D_{33})\big )(p_{2},0,p_{1},m_{W},0,M_{R},0) \nonumber \\&\quad +m_{W}^{2}\big ( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{12}\nonumber \\&\quad -D_{22})\big )(p_{2},p_{1},0,m_{W},0,M_{R},0) \nonumber \\&\quad +2c_{W}^{2}m_{W}^{2}(2D_{00}\nonumber \\&\quad -p_{1}^{2}D_{2})(0,p_{2},p_{1},0,M_{R},m_{W},m_{W}) \bigg \}, \end{aligned}$$
(10)

and:

$$\begin{aligned} F_{L}^{\mathrm{MIA\,\, (Y^4)}}&= \frac{1}{16 \pi ^{2}} \frac{g}{c_W} v^{2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km}\nonumber \\&\quad \times \bigg \{ \frac{1}{2}C_{0}(p_{2},p_{1},m_W,M_{R},M_R) \nonumber \\&\quad + \left( \frac{1}{2}-s_{W}^{2} \right) \big ( 2D_{00}(0,p_{2},p_{1},M_R,M_{R},\nonumber \\&\quad m_W,m_W)+ C_2(M_R,M_{R},m_W) \big ) \bigg \}. \end{aligned}$$
(11)

We have also checked that the total MIA form factor in the Feynman–’t Hooft gauge, \(F_L^\mathrm{MIA}\), presented in Eqs. (7), (10) and (11), is finite. In fact, the only divergent diagrams in the MIA computation are (6a), (8a) and (10a), and we have proven that the divergences cancel out when adding these three diagrams.

We are mainly interested here in the form factor for the LFVZD, i.e. when the Z boson is on-shell. Then, the proper form factor is obtained by setting \(p_1^2=m_Z^2\) in the previous equations. We get:

$$\begin{aligned}&F_{L}^{\mathrm{MIA\,\, (Y^2)}}|_{p_1^2=m_Z^2}= \frac{1}{16 \pi ^{2}} \frac{g}{c_W} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km}\nonumber \\&\quad \times \bigg \{ \left( \frac{1}{2}-s_{W}^{2}\right) B_1(M_R,m_W) \nonumber \\&\qquad +(1-2s_{W}^{2}){\hat{C}}_{00}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\qquad +2m_{W}^{2}{\hat{C}}_{0}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\qquad +\left( 1-2s_{W}^{2}\right) m_{W}^{2}C_{2}(0,M_R,m_W) \nonumber \\&\qquad +m_{W}^{2}\big ( -2{\hat{D}}_{00}+m_{Z}^{2}({\hat{D}}_0+{\hat{D}}_1+{\hat{D}}_{13}\nonumber \\&\qquad -{\hat{D}}_{33})\big )(p_{2},0,p_{1},m_{W},0,M_{R},0) \nonumber \\&\qquad +m_{W}^{2}\big ( -2{\hat{D}}_{00}+m_{Z}^{2}({\hat{D}}_0+{\hat{D}}_1+{\hat{D}}_{12}\nonumber \\&\qquad -{\hat{D}}_{22})\big )(p_{2},p_{1},0,m_{W},0,M_{R},0) +2c_{W}^{2}m_{W}^{2}(2{\hat{D}}_{00} \nonumber \\&\qquad -m_{Z}^{2}{\hat{D}}_{2})(0,p_{2},p_{1},0,M_{R},m_{W},m_{W}) \bigg \}, \end{aligned}$$
(12)

and:

$$\begin{aligned}&F_{L}^{\mathrm{MIA\,\, (Y^4)}}|_{p_1^2=m_Z^2} = \frac{1}{16 \pi ^{2}} \frac{g}{c_W} v^{2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km}\nonumber \\&\quad \times \bigg \{ \frac{1}{2} {\hat{C}}_{0}(p_{2},p_{1},m_W,M_{R},M_R) \nonumber \\&\quad + \left( \frac{1}{2}-s_{W}^{2} \right) \left( 2{\hat{D}}_{00}(0,p_{2},p_{1},M_R,M_{R},m_W,m_W)\right. \nonumber \\&\quad \left. + C_2(M_R,M_{R},m_W) \right) \bigg \}, \end{aligned}$$
(13)

where the loop functions with a ‘hat’ means that they are evaluated at \(p_1^2=m_Z^2\) (and, \(p_2^2= p_3^2 = 0\), since we are neglecting the lepton masses). The definitions of all these loop functions above and their interesting limits for the present paper are collected in Appendices A and D.

2.1 Check of gauge invariance of the on-shell form factor for LFVZD

An interesting check of our results in Eqs. (12) and (13) for the form factor \(F_L\) of the Z decay in the Feynman–’t Hooft gauge is to verify the equivalence with the results in the unitary gauge (UG) and with an arbitrary covariant \(R_\xi \) gauge. In order to perform a systematic computation, we start with the UG’s calculation. We consider the contributions of \(\mathcal O(Y_\nu ^2)\) and \(\mathcal O(Y_\nu ^4)\) corresponding only to diagrams of type (2), (3), (7) and (9) of Figs. 2, 3 and 4 because there are not Goldstone bosons in this gauge. We then split the propagator \(P_W^\mathrm{UG}\) of the W gauge boson into two parts, \(P_W^{a}\) and \(P_W^{b}\):

$$\begin{aligned} P_W^\mathrm{UG}=P_W^{a}+P_W^{b}=-\frac{i g_{\mu \nu }}{p^{2}-m_{W}^{2}} +\frac{i p_{\mu }p_{\nu }}{m_{W}^{2}(p^{2}-m_{W}^{2})} \, . \end{aligned}$$
(14)

In this way, the first part \(P_W^{a}\) matches with the propagator of the W gauge boson in the Feynman–’t Hooft gauge. Thus, we classify the different contributions arising from each dominant diagram. Those contributions at \(\mathcal O(Y_\nu ^2)\) coming from \(P_W^{a}\) are the same contributions as in the Feynman–’t Hooft gauge. In addition to them, there are new contributions corresponding to \(P_W^{b}\) in diagrams (2a), (2b), (3a), (7a) and (9a) of Figs. 2 and 3. On the other hand, the dominant contributions of \(\mathcal O(Y_\nu ^4)\) correspond to \(P_W^{b}\) in diagrams (2c), (2d), (2e), (3b), (7b) and (9b) of Fig. 4. In the following, we show the result of the form factor \(F_L\) in the UG at \(\mathcal O(Y_\nu ^2+Y_\nu ^4)\) for arbitrary Z external momentum \(p_1\):

$$\begin{aligned} F_{L}^{\mathrm{UG}}=F_{L}^{\mathrm{UG\,\, (Y^2)}}+F_{L}^{\mathrm{UG\,\, (Y^4)}}, \end{aligned}$$
(15)

where,

$$\begin{aligned}&F_{L}^{\mathrm{UG\,\, (Y^2)}} \nonumber \\&\quad = \frac{1}{16 \pi ^{2}} \frac{g}{c_W} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \left( \frac{1}{2}-s_{W}^{2}\right) B_1(M_R,m_W) \nonumber \\&\qquad +\left( 2c_{W}^{2}-\frac{p_{1}^{2}}{m_{Z}^{2}}\right) C_{00}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\qquad +2c_{W}^{2}p_{1}^{2}C_{0}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\qquad +\left( 1-2s_{W}^{2}\right) m_{W}^{2}C_{2}(0,M_R,m_W) \nonumber \\&\qquad +m_{W}^{2}( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{13}\nonumber \\&\qquad -D_{33}))(p_{2},0,p_{1},m_{W},0,M_{R},0) \nonumber \\&\qquad +m_{W}^{2}( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{12}\nonumber \\&\qquad -D_{22}))(p_{2},p_{1},0,m_{W},0,M_{R},0) \nonumber \\&\qquad +2c_{W}^{2}m_{W}^{2}(2D_{00}-p_{1}^{2}D_{2})(0,p_{2},p_{1},0,M_{R},m_{W},m_{W}) \bigg \} \, , \end{aligned}$$
(16)

and,

$$\begin{aligned}&F_{L}^{\mathrm{UG\,\, (Y^4)}}\nonumber \\&\quad = \frac{1}{16 \pi ^{2}} \frac{g}{c_W} v^{2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \frac{1}{2}C_{0}(p_{2},p_{1},m_W,M_{R},M_R) \nonumber \\&\qquad \left. + \left( 2c_{W}^{2}-\frac{p_1^{2}}{m_Z^{2}} \right) D_{00}(0,p_{2},p_{1},M_R,M_{R},m_W,m_W) \right. \nonumber \\&\qquad + \left( \frac{1}{2}-s_{W}^{2} \right) C_2(M_R,M_{R},m_W) \bigg \} \, . \end{aligned}$$
(17)

These general expressions are not equal to Eqs. (10) and (11), but for the particular case of the Z decays (setting \(p_1^2=m_Z^2\) in the previous equations) and using the relations \(s_W^2+c_W^2=1\) and \(m_W^2=c_W^2m_Z^2\), we arrive to Eqs. (12) and (13) exactly. Thus, we verify the equivalence of the form factors for the Z decay in the Feynman–’t Hooft gauge and the UG when the Z is on-shell.

Finally, we present the computation in an arbitrary covariant \(R_\xi \) gauge. The relevant gauge-fixing parameter \(\xi \equiv \xi _W\) corresponds to \(SU(2)_L\) gauge group. In this case, we may split the propagator into three parts, the two previous ones, \(P_W^{a}\) and \(P_W^{b}\), and a new part, \(P_W^{c}\), that contains the \(\xi \) dependence:

$$\begin{aligned} P_W^\mathrm{R_\xi }= & {} P_W^{a}+P_W^{b}+P_W^{c}\nonumber \\= & {} -\frac{i g_{\mu \nu }}{p^{2}-m_{W}^{2}} +\frac{i p_{\mu }p_{\nu }}{m_{W}^{2}(p^{2}-m_{W}^{2})} \nonumber \\&-\frac{i p_{\mu }p_{\nu }}{m_{W}^{2}(p^{2}-\xi m_{W}^{2})} \, . \end{aligned}$$
(18)

The contribution to the form factor from \(P_W^{c}\) can be easily obtained from that of \(P_W^{b}\) by simply changing the global sign and by replacing \(m_W \rightarrow \sqrt{\xi } m_W\) in the corresponding argument of masses of the involved one-loop functions.

On the other hand, the Goldstone boson propagator is now given by:

$$\begin{aligned} P_G^\mathrm{R_\xi }=\frac{i}{p^{2}-\xi m_{W}^{2}} \, , \end{aligned}$$
(19)

thus, in comparison with the previous computation of the Feynman–’t Hooft gauge, we now have to replace \(m_W \rightarrow \sqrt{\xi } m_W\) in the masses of the one-loop functions corresponding to the Goldstone bosons.

Taking into account all the above commented properties, we find the form factor \(F_L\) in an arbitrary covariant \(R_\xi \) gauge at \(\mathcal O(Y_\nu ^2+Y_\nu ^4)\), for arbitrary Z external momentum \(p_1\), from the computation of all diagrams in Figs. 2, 3, 4. This is given by:

$$\begin{aligned} F_{L}^{\mathrm{R_\xi }}=F_{L}^{\mathrm{R_\xi \,\, (Y^2)}}+F_{L}^{\mathrm{R_\xi \,\, (Y^4)}}, \end{aligned}$$
(20)

where,

$$\begin{aligned} F_{L}^{\mathrm{R_\xi \,\, (Y^2)}}&= \frac{1}{16 \pi ^{2}} \frac{g}{c_W} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \left( 2c_{W}^{2}B_0\right. \nonumber \\&\quad \left. +\left( \frac{1}{2}-s_{W}^{2}\right) B_1\right) (M_R,m_W) \nonumber \\&\quad \left. -\left( \frac{p_{1}^{2}}{m_{Z}^{2}}-2c_{W}^{2}\right) C_{00}(p_2,p_1,M_R,m_W,m_W) \right. \nonumber \\&\quad +2c_{W}^{2}p_{1}^{2}C_{0}(p_2,p_1,M_R,m_W,m_W) \nonumber \\&\quad +\left( 1-2s_{W}^{2}\right) m_{W}^{2}C_{2}(0,M_R,m_W) \nonumber \\&\quad +m_{W}^{2}( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{13}\nonumber \\&\quad -D_{33}))(p_{2},0,p_{1},m_{W},0,M_{R},0) \nonumber \\&\quad +m_{W}^{2}( -2D_{00}+p_{1}^{2}(D_0+D_1+D_{12}\nonumber \\&\quad -D_{22}))(p_{2},p_{1},0,m_{W},0,M_{R},0) \nonumber \\&\quad \left. +2c_{W}^{2}m_{W}^{2}(2D_{00}-p_{1}^{2}D_{2})(0,p_{2},p_{1},0,M_{R},m_{W},m_{W}) \right. \nonumber \\&\quad -2c_{W}^{2}B_0(M_R,\sqrt{\xi }m_W) \nonumber \\&\quad +\left( 1-\frac{p_{1}^{2}}{m_{Z}^{2}}\right) C_{00}(p_2,p_1,M_R,\sqrt{\xi }m_W,\sqrt{\xi }m_W) \nonumber \\&\quad +\left( \frac{p_{1}^{2}}{m_{Z}^{2}}-1\right) C_{00}(p_2,p_1,M_R,\sqrt{\xi }m_W,m_W) \nonumber \\&\quad +\left( \frac{p_{1}^{2}}{m_{Z}^{2}}-1\right) C_{00}(p_2,p_1,M_R,m_W,\sqrt{\xi }m_W) \nonumber \\&\quad +\left( \xi c_W^{2}m_W^{2}-c_W^{2}p_1^2\right. \nonumber \\&\quad +\left. s_W^{2}m_W^{2}\right) C_0(p_2,p_1,M_R,\sqrt{\xi }m_W,m_W) \nonumber \\&\quad +\left( \xi c_W^{2}m_W^{2}-c_W^{2}p_1^2\right. \nonumber \\&\quad +\left. s_W^{2}m_W^{2}\right) C_0(p_2,p_1,M_R,m_W,\sqrt{\xi }m_W) \bigg \} \, , \end{aligned}$$
(21)

and,

$$\begin{aligned}&F_{L}^{\mathrm{R_\xi \,\, (Y^4)}} \nonumber \\&\quad = \frac{1}{16 \pi ^{2}} \frac{g}{c_W} v^{2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \frac{1}{2}C_{0}(p_{2},p_{1},m_W,M_{R},M_R) \nonumber \\&\qquad +2c_W^{2}C_{0}(M_{R},M_R,m_W) \nonumber \\&\qquad -\left( \frac{p_1^2}{m_Z^{2}}-2c_W^{2} \right) D_{00}(0,p_{2},p_{1},M_R,M_{R},m_W,m_W) \nonumber \\&\qquad +\left( \frac{1}{2}-s_{W}^{2} \right) C_2(M_R,M_{R},m_W) \nonumber \\&\qquad \left. -2c_W^{2}C_0(M_R,M_{R},\sqrt{\xi }m_W) \right. \nonumber \\&\qquad +\left( 1-\frac{p_1^2}{m_Z^{2}}\right) D_{00}(0,p_{2},p_{1},M_R,M_{R},\sqrt{\xi }m_W,\sqrt{\xi }m_W) \nonumber \\&\qquad +\left( \left( \frac{p_1^2}{m_Z^{2}}-1 \right) D_{00} +\left( \xi c_W^{2}m_W^{2}-c_W^{2}p_1^2\right. \right. \nonumber \\&\qquad +\left. \left. s_W^{2}m_W^{2} \right) D_0 \right) (0,p_{2},p_{1},M_R,M_{R},\sqrt{\xi }m_W,m_W) \nonumber \\&\qquad +\left( \left( \frac{p_1^2}{m_Z^{2}}-1 \right) D_{00} +\left( \xi c_W^{2}m_W^{2}-c_W^{2}p_1^2\right. \right. \nonumber \\&\qquad \left. \left. +s_W^{2}m_W^{2} \right) D_0 \right) (0,p_{2},p_{1},M_R,M_{R},m_W,\sqrt{\xi }m_W) \bigg \}. \end{aligned}$$
(22)

On the other hand, from the integral definition of the one-loop functions (see Appendix A), we obtain the following useful relations:

$$\begin{aligned}&2c_{W}^{2}\left( B_0(M_R,m_W)-B_0(M_R,\sqrt{\xi }m_W)\right) \nonumber \\&\qquad -\left( 1 -\xi \right) c_{W}^{2}m_{W}^{2}C_0(p_2,p_1,M_R,\sqrt{\xi }m_W,m_W) \nonumber \\&\qquad -\left( 1 -\xi \right) c_{W}^{2}m_{W}^{2}C_0(p_2,p_1,M_R,m_W,\sqrt{\xi }m_W) \nonumber \\&\quad \approx \mathcal O(m_{lep}^{2}) \approx 0 \, , \end{aligned}$$
(23)

and

$$\begin{aligned}&2c_{W}^{2}\left( C_0(M_R,M_R,m_W)-C_0(M_R,M_R,\sqrt{\xi }m_W)\right) \nonumber \\&\qquad -\left( 1 -\xi \right) c_{W}^{2}m_{W}^{2}D_0(0,p_2,p_1,M_R,M_R,\sqrt{\xi }m_W,m_W) \nonumber \\&\qquad -\left( 1 -\xi \right) c_{W}^{2}m_{W}^{2}D_0(0,p_2,p_1,M_R,M_R,m_W,\sqrt{\xi }m_W)\nonumber \\&\quad \approx \mathcal O(m_{lep}^{2}) \approx 0 \, , \end{aligned}$$
(24)

which can be used to further simplify the previous results of \(\mathcal O(Y_\nu ^2)\) and \(\mathcal O(Y_\nu ^4)\) respectively. It is worth noticing that the general result for an arbitrary \(p_1^2\) is not gauge invariant, since the \(\xi \) dependence is not fully cancelled in Eqs. (21) and (22).

Finally, to get the form factors in an arbitrary \(R_\xi \) gauge in the case where the Z gauge boson is on-shell, we set \(p_1^2= m_Z^ 2\) in Eqs. (21) and (22), and use the above relations in Eqs. (23) and (24). Simplifying the final expression by means of the identities \(s_W^2+c_W^2=1\) and \(m_W^2=c_W^2m_Z^2\), we find out that the expected cancellations among the \(\xi \) dependent terms take place and the final result for \(F_{L}^{\mathrm{R_\xi }}|_{p_1^2=m_Z^2}\) turns out to be \(\xi \) independent, leading to the same result for all \(\xi \) choices and coinciding with the results of the previous section for the Feynman–’t Hooft gauge, \(F_{L}^{\mathrm{MIA\,\, (Y^2)}}|_{p_1^2=m_Z^2}\) and \(F_{L}^{\mathrm{MIA\,\, (Y^4)}}|_{p_1^2=m_Z^2}\) in Eqs. (12) and 13, respectively. Therefore, having found the same result for all \(R_\xi \) gauges as well as for the unitary gauge, we conclude that our result for the on-shell form factor describing the LFVZD, \(F_L|_{p_1^2=m_Z^2}\), is gauge invariant. This is as expected, since this is a physical quantity defining an observable, the partial LFVZD width.

Another interesting discussion arises when considering the zero external momenta approximation, and testing if this approximation is or is not appropriate to estimate the LFVZD rates. We have also explored this question in detail in this work, and our conclusion is that it is not appropriate, because it is not a gauge invariant quantity. Since this issue is not needed for the central results in this paper, we present this discussion separately, and leave it to Appendix E. There we present the results of the form factors in the case of zero Z external momentum, \(p_1^2 = 0\), and include our proof that the result in that case is not gauge invariant. This is in contrast, with the on-shell case presented in this section, being fully gauge invariant.

3 Computation of the one-loop effective vertex for LFVZD

Here we present the computation of the one-loop effective vertex that is the proper one for the description of the LFVZD. This leads us to set first the Z external leg to be on-shell, and second to explore the proper analytic expansion of the MIA form factor \(F_L^\mathrm{MIA}\) that is valid at large \(M_R\). The effective vertex that we look for then summarizes the one-loop effects of the heavy right handed neutrinos, and it is obtained from the result of this large \(M_R\) expansion, generically, as:

$$\begin{aligned} {\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m}=V^\mathrm{eff}_{Z\ell _k\ell _m}|_{p_1^2=m_Z^2} = F_L^\mathrm{MIA}|_{p_1^2=m_Z^2} (\mathbf{M_R}), \end{aligned}$$
(25)

where the mass \(\mathbf{M_R}\) in boldface means that the function \(F_L^\mathrm{MIA}|_{p_1^2=m_Z^2}\) has been expanded at large \(M_R\). Specifically, this expansion should valid for \(M_R\gg v\), with \(v=174\) GeV being the characteristic scale providing all the electroweak masses involved. In practice, this implies \(M_R\) being much heavier than all the other particle masses involved in the loop contributing diagrams, therefore, larger than \(m_Z\), \(m_W\), \(m_\mathrm{lep}\), etc.

The result of this large \(M_R\) expansion must provide a local function in spacetime, hence, leading in momentum space, to an expansion (up to logarithms) given in inverse powers of \(M_R^2\). In fact, our explicit computation presented in this work shows that the first term in this expansion is of \({\mathcal {O}}(v^2/M_R^2)\), the second term is of \({\mathcal {O}}(v^4/M_R^4)\) and so on, leading to an expansion, valid at large \(M_R \gg v\), of the generic form:

$$\begin{aligned} {\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m} \simeq c_1 \frac{v^2}{M_R^2} + c_2 \frac{v^4}{M_R^4}+ \cdots , \end{aligned}$$
(26)

where the coefficients \(c_{1,2,\dots }\) contain contributions that are generically either constants with \(M_R\) or logarithmic contributions like \(\log (m_W^2/M_R^2)\), which are originated from the heavy neutrino loops with W gauge bosons and/or Goldstone bosons. This functional dependence with \(M_R\) typically signals the decoupling behavior of the heavy neutrinos in the one-loop generated LFV form factors, leading to one-loop radiative effects in the LFVZD decays that vanish in the asymptotically infinite right handed mass limit.

In order to get this expansion of the on-shell form factors in inverse powers of \(M_R^2\), we start with the previous results of the Feynman–’t Hooft gauge, \(F_{L}^{\mathrm{MIA\,\, (Y^2+Y^4)}}\), in Eqs. (12) and (13) and insert into these equations the corresponding expansions for the loop functions that we have also computed and whose results are collected in Appendix D. This leads us to very simple results for the on-shell form factors, which in turn define the wanted on-shell effective vertex, such that:

$$\begin{aligned} i \mathcal{M} \simeq i \epsilon ^{\lambda }_{Z}(p_1) \bar{u}_{\ell _k} (-p_2) {\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m} \gamma _{\lambda } P_L v_{\ell _m}(p_3) , \end{aligned}$$
(27)

and

$$\begin{aligned} \Gamma (Z \rightarrow \ell _k \bar{\ell }_m) \simeq \frac{m_Z}{24\pi }|{\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m}|^{2}, \end{aligned}$$
(28)

with,

$$\begin{aligned}&{\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m}\nonumber \\&\quad = \frac{1}{16 \pi ^{2}} \frac{g}{c_W} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \bigg \{ \left( \frac{1}{2}-s_{W}^{2}\right) B_1(\mathbf{M_R},m_W) \nonumber \\&\qquad +\,(1-2s_{W}^{2}){\hat{C}}_{00}(p_2,p_1,\mathbf{M_R},m_W,m_W) \nonumber \\&\qquad +\,2m_{W}^{2}{\hat{C}}_{0}(p_2,p_1,\mathbf{M_R},m_W,m_W) \nonumber \\&\qquad +\,\left( 1-2s_{W}^{2}\right) m_{W}^{2}C_{2}(0,\mathbf{M_R},m_W) \nonumber \\&\qquad +\,m_{W}^{2}\big ( -2{\hat{D}}_{00}+m_{Z}^{2}({\hat{D}}_0+{\hat{D}}_1+{\hat{D}}_{13}\nonumber \\&\qquad -\,{\hat{D}}_{33})\big )(p_{2},0,p_{1},m_{W},0,\mathbf{M_R},0) \nonumber \\&\qquad +\,m_{W}^{2}\big ( -2{\hat{D}}_{00}+m_{Z}^{2}({\hat{D}}_0+{\hat{D}}_1+{\hat{D}}_{12}\nonumber \\&\qquad -\,{\hat{D}}_{22})\big )(p_{2},p_{1},0,m_{W},0,\mathbf{M_R},0) \nonumber \\&\qquad +\,2c_{W}^{2}m_{W}^{2}(2{\hat{D}}_{00}-m_{Z}^{2}{\hat{D}}_{2})(0,p_{2},p_{1},0,\mathbf{M_R},m_{W},m_{W}) \bigg \} \nonumber \\&\qquad +\, \frac{1}{16 \pi ^{2}} \frac{g}{c_W} v^{2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \nonumber \\&\qquad \times \bigg \{ \frac{1}{2} {\hat{C}}_{0}(p_{2},p_{1},m_W,\mathbf{M_R},\mathbf{M_R}) \nonumber \\&\qquad + \left( \frac{1}{2}-s_{W}^{2} \right) \left( 2{\hat{D}}_{00}(0,p_{2},p_{1},\mathbf{M_R},\mathbf{M_R},m_W,m_W)\right. \nonumber \\&\qquad \left. +\, C_2(\mathbf{M_R},\mathbf{M_R},m_W) \right) \bigg \}. \end{aligned}$$
(29)
Fig. 5
figure 5

Predictions for the partial width \(\Gamma (Z \rightarrow \tau \bar{\mu })\) and branching ratio BR\((Z \rightarrow \tau \bar{\mu })\) as a function of \(M_R\). The dashed lines are the predictions from the MIA to \(\mathcal{O}(Y_\nu ^2)\). The solid lines are the predictions from the full one-loop computation of the mass basis. Here the examples TM4 (left panel) and TM5 (right panel) with \(f=0.1, 0.5, 1\), as explained in the text, are chosen. In the bottom of these plots the ratio \(R=\Gamma _\mathrm{MIA}/ \Gamma _\mathrm{full}\) is also shown

Here we have used again the notation in boldface for \(M_R\), to mean that all these functions have been expanded at large \(M_R \gg v\), and we have kept just the first terms in these expansions. Specifically, we select all the needed terms in the involved loop functions that lead to contributions in \(F_L^\mathrm{MIA}\) of \({\mathcal {O}}(v^2/M_R^2)\), which are the first order terms in this large \(M_R\) expansion of the form factor. For shortness, we leave the technical details of the loop functions expansions for the Appendix D, and present here just the final result for the effective vertex. By plugging the results of Appendix D into Eq. (29), we finally get:

$$\begin{aligned} {\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m}&= \frac{g}{16 \pi ^2 c_W} \left[ \frac{m_W^2}{M_R^2} \left( f(c_W^2) + g(c_W^2) \log \left( \frac{m_W^2}{M_R^2} \right) \right) \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} -\frac{v^2}{2M_R^2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger }Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \right] \nonumber \\&= \frac{g}{16 \pi ^2 c_W} \left[ \frac{m_W^2}{M_R^2} \left( 4.1+ i\, 2.1+1.4 \log \left( \frac{m_W^2}{M_R^2} \right) \right) \left( Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} - \frac{v^2}{2M_R^2} \left( Y^{}_{\nu } Y_{\nu }^{\dagger }Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \right] , \end{aligned}$$
(30)

where, for practical purposes, the resulting coefficients \(f(c_W^2)\) and \(g(c_W^2)\), which are functions of the squared W and Z mass ratio, \(m_W^2/m_Z^2= \cos ^2 \theta _W \equiv c_W^2\), have been evaluated numerically in the second line, for \(c_W^2=0.77\). Their complete analytical expressions are given by:

$$\begin{aligned} f(c_W^2)&= 8(c_W^2+2)c_W^4 \arctan ^2\Big [(4c_W^2-1)^{-\frac{1}{2}}\Big ] +\frac{1}{6}\Big (-11 c_W^2 +12 c_W^4+4\pi ^2 (c_W^2+1)^2-26 \Big ) \nonumber \\&\quad +\frac{1}{6}\Big (c_W^{-2}+18-28c_W^2-24c_W^4\Big ) (4c_W^2 -1)^{\frac{1}{2}}\arctan \Big [(4c_W^2 -1)^{-\frac{1}{2}}\Big ]+ \frac{1}{72}(10 -5c_W^{-2}) \nonumber \\&\quad -\log (c_W^2) \Big (3+2 c_W^2+(c_W^2+1)^2 \log (c_W^2)\Big ) -2 (c_W^2+1)^2 {\text {Li}}_2(c_W^2+1) \nonumber \\&\quad -i \pi \Big ( 3 +2 c_W^2+2 (c_W^2+1)^2 \log (c_W^2) \Big ),\end{aligned}$$
(31)
$$\begin{aligned} g(c_W^2)&= \frac{1}{12} (c_W^{-2} +16). \end{aligned}$$
(32)
Fig. 6
figure 6

Predictions for the partial width \(\Gamma (Z \rightarrow \tau \bar{\mu })\) and branching ratio BR\((Z \rightarrow \tau \bar{\mu })\) as a function of \(M_R\). The dashed lines are the predictions from the MIA to \(\mathcal{O}(Y_\nu ^2+Y_\nu ^4)\). The solid lines are the predictions from the full-one loop computation of the mass basis. Here the examples TM4 (left panel) and TM5 (right panel) with f=0.1,0.5,1, as explained in the text, are chosen. In the bottom of these plots the ratio \(R=\Gamma _\mathrm{MIA}/ \Gamma _\mathrm{full}\) is also shown

Notice that in this on-shell Z boson effective vertex there is an imaginary contribution. We have checked that it comes from diagrams (2a) and (2b) in Fig. 2, and is due to the possible crossing through the physical threshold of producing two light neutrinos (mainly \(\nu _L\)) from the on-shell Z boson. We have also checked that our analytical result for this imaginary part is in agreement with the analytical result of the limit of heavy singlets in reference [7], i.e. for \(m_{n_i}\gg m_W\).

In Figs. 5 and 6 we present the numerical results for our predictions of the partial widths and the corresponding branching ratios for the LFVZD. For illustrative purposes we have chosen two examples of input neutrino Yukawa coupling matrices, following [19, 22, 23, 25]. Concretely, in these figures we use:

$$\begin{aligned} Y_{\nu }^\mathrm{TM4}=f\left( \begin{array}{ccc} 0.1&{}0&{}0\\ 0&{}1&{}0\\ 0&{}1&{}0.014 \end{array}\right) ,&\quad Y_{\nu }^\mathrm{TM5}=f\left( \begin{array}{ccc} 0&{}1&{}-1\\ 0.9&{}1&{}1\\ 1&{}1&{}1 \end{array}\right) . \end{aligned}$$
(33)

These particular textures were selected as illustrative examples belonging to a type of scenarios (named TM scenarios in [25]) in which the LFV is always extremely suppressed in the \(\mu e\) sector, which is well-known to be highly constrained, but it can lead to large LFV in the \(\tau \mu \) sector, which is less severely constrained. These scenarios are known to produce interesting phenomenological implications. For instance, in collider physics they can lead to the production of exotic \(\tau \)-\(\mu \)-jet-jet events at LHC [23]. Notice that the Yukawa coupling matrices in these examples are usually given in terms of a scaling factor f that characterizes the global strength of the coupling. We have also tried other examples of textures leading instead to large LFV in the \(\tau e\) sector (the so-called TE scenarios in [25]) and the results are quite similar to the ones presented here for the \(\tau \mu \) sector. Since our aim in this section is mainly to provide useful and accurate formulas for the LFV effective vertex, we believe that the choice of these two textures should be sufficient for the check of the \(\hat{V}^\mathrm{eff}_{Z\ell _k\ell _m}\) accuracy. It should also be noted that in checking our numerical results from the MIA with the previous full one-loop numerical results of [25], which is our choice here, we are also simultaneously checking the agreement with all the other numerical results of the physical basis in [7, 12, 20] since they are all consistent.

In Figs. 5 and 6 we have compared the MIA results with the full results in order to learn on the goodness of our approximate formulas for the on-shell effective vertex. We show the results both to \(\mathcal{O}(Y_\nu ^2)\), Fig. 5, and to \(\mathcal{O}(Y_\nu ^2+Y_\nu ^4)\), Fig. 6, for comparison. The first thing worth noticing is that with the LO result, i.e. taking just the \(\mathcal{O}(Y_\nu ^2)\) solution for the effective vertex, the agreement between the MIA and the full result is not so good as taking also the NLO terms of \(\mathcal{O}(Y_\nu ^4)\). It is also clear from these plots that our simple formula in Eq. (30) for the MIA effective vertex to \(\mathcal{O}(Y_\nu ^2+Y_\nu ^4)\) provides very accurate results, leading to LFV rates which are very close to the full results, even for large Yukawa couplings, with global strength f of order 1. It is only for relatively low values of \(M_R\), say well below 1 TeV, and very large Yukawa couplings, say with \(f \ge \mathcal{O}(1)\), where the initial assumption of \(m_D\ll M_R\) does not hold anymore, that we get a significant deviation from the full results. For all the other input parameters the agreement is excellent. Therefore the MIA approximation works pretty well in the present case of LFVZD.

Finally, to end this section we find interesting to compare our result for \({\hat{V}}^\mathrm{eff}_{Z\ell _k\ell _m}\) in Eq. (30) with our previous result in [27] for the corresponding on-shell Higgs effective vertex \({\hat{V}}^\mathrm{eff}_{H\ell _k\ell _m}\) which is the proper one for the LFV Higgs decays \(H \rightarrow \ell _k \bar{\ell }_m\). This Higgs effective vertex computed in [27] was obtained in exactly the same context of ISS with heavy right handed neutrinos and following the same MIA and large \(M_R\) techniques as in the present paper, therefore this comparison gives us a valuable information. In the Higgs case, the amplitude and partial decay width can be written as:

$$\begin{aligned}&i \mathcal{M} \simeq -i \bar{u}_{\ell _k} {\hat{V}}_{H \ell _{k}\ell _{m}}^\mathrm{eff} P_L v_{\ell _m} \, , \end{aligned}$$
(34)
$$\begin{aligned}&\Gamma (H \rightarrow \ell _{k}\bar{\ell }_{m})\simeq \frac{m_H}{16\pi } \big \vert {\hat{V}}_{H \ell _{k}\ell _{m}}^\mathrm{eff}\big \vert ^{2} \, , \end{aligned}$$
(35)

and the on-shell vertex, i.e. for \(p_1^2=m_H^2\), found is [27]:

$$\begin{aligned} {\hat{V}}_{H \ell _{k} \ell _{m}}^\mathrm{eff}= & {} \frac{g}{64 \pi ^{2}} \frac{m_{\ell _k}}{m_{W}} \left[ \frac{m_{H}^{2}}{M_{R}^{2}} \left( r\Big (\frac{m_{W}^{2}}{m_{H}^{2}}\Big )\right. \right. \nonumber \\&\left. +\log \left( \frac{m_{W}^{2}}{M_{R}^{2}}\right) \right) \left( Y^{}_{\nu } Y_{\nu }^{\dagger }\right) ^{km} \nonumber \\&\left. - \frac{3v^{2}}{M_{R}^{2}} \left( Y^{}_{\nu } Y_{\nu }^{\dagger } Y^{}_{\nu } Y_{\nu }^{\dagger } \right) ^{km} \right] , \end{aligned}$$
(36)

with a numerical value for the coefficient given by \(r(m_W^2/m_H^2)\simeq 0.3\). We clearly see that the two effective vertices \({\hat{V}}_{Z \ell _{k} \ell _{m}}^\mathrm{eff}\) and \({\hat{V}}_{H \ell _{k} \ell _{m}}^\mathrm{eff}\) have the same functional form as functions of the input parameters \(M_R\) and \(Y_\nu \). Both show a decoupling behavior with the heavy neutrino masses as \(\sim 1/M_R^2\), both have a term with \(\log (m_W^2/M_R^2)\), and both have contributions from the LO, \(\mathcal{O}(Y_\nu ^2)\), and from the NLO, \(\mathcal{O} (Y_\nu ^4)\). Besides, we have checked that both contributions LO and NLO in the two decays, H and Z, are needed to get a good numerical agreement of the MIA with the full one-loop result. The main difference, therefore, is just the numerical values of the coefficients in front of these terms. First, in the Higgs case the on-shell effective vertex is real. There is not an imaginary part because, contrary to the Z decays, there are not diagrams with two light neutrino lines (mainly \(\nu _L\)) from the Higgs boson. Instead, the two neutrino lines connected to a Higgs particle are one light (mainly \(\nu _L\)) and the other one heavy (mainly \(\nu _R\)) that cannot be produced on-shell in the Higgs decay, under our assumption of heavy \(M_R \gg v\). Second, notice that the Z effective vertex, \({\hat{V}}^\mathrm{eff}_{Z\ell _k \ell _m}\), is universal in flavor, namely, its size does not depend on the mass of the charged leptons involved, \(m_{\ell _{k,m}}\). This is also in contrast with the Higgs case, where \({\hat{V}}^\mathrm{eff}_{H\ell _k\ell _m}\) shows a linear dependence with the heaviest charged lepton mass, \(m_{\ell _{k}}\), indicating a larger LFV effect for heavier charged leptons.

Fig. 7
figure 7

Predictions for \(Z\rightarrow \mu \bar{e}\) (top), \(Z \rightarrow \tau \bar{e}\) (left) and \(Z \rightarrow \tau \bar{\mu }\) (right) using the effective vertex computed with the MIA (dashed lines) and the full results in the mass basis (solid lines) for \(Y_\nu ^\mathrm{GF}\) in Eq. (40) and \(f=0.1, 0.5, 1, 2\). The chosen example GF is explained in the text. Shadowed areas are disallowed by global fit results (purple) or for giving non-perturbative Yukawa couplings (yellow)

4 Numerical estimates with the MIA-effective vertex of maximum allowed LFVZD rates

In order to show the applicability and simplicity of the MIA results, in this section we use the effective vertex in Eq. (30) to compute the maximum LFVZD rates in the ISS model that are allowed by present experimental constraints. For that purpose, we use the constraints derived in [41], where the non-unitary matrix N describing the mixing between the light neutrino mass eigenstates and the SM charged leptons via W interactions was parametrized in terms of a small Hermitian matrix \(\eta \) defined by [41]:

$$\begin{aligned} N = (1-\eta ) U_\mathrm{PMNS}. \end{aligned}$$
(37)

This \(\eta \) matrix then encodes the deviations from the unitary \(U_\mathrm{PMNS}\) induced by the mixing with the extra heavy neutrinos.

By performing a global fit analysis, upper boundsFootnote 2 on the \(\eta \) matrix were set to

$$\begin{aligned} \eta _{3\sigma }^\mathrm{max}=\left( \begin{array}{ccc} 1.62\times 10^{-3}&{}1.51\times 10^{-5}&{}1.57\times 10^{-3}\\ 1.51\times 10^{-5}&{}3.92\times 10^{-4}&{}9.24\times 10^{-4} \\ 1.57\times 10^{-3}&{}9.24\times 10^{-4}&{}3.67\times 10^{-3} \end{array}\right) . \end{aligned}$$
(38)

In our case of interest with degenerate \(M_R\) and \(v Y_\nu \ll M_R\), the \(\eta \) matrix can be written, following  [41], approximately as:

$$\begin{aligned} \eta =\frac{v^2}{2M_R^2}\, Y_\nu ^{} Y_\nu ^\dagger , \end{aligned}$$
(39)

what allows us to define a scenario that easily implements these bounds. Following [27], we define this scenario by considering the following neutrino Yukawa matrix:

$$\begin{aligned} Y_{\nu }^{\mathrm{GF}}=f\left( \begin{array}{ccc} 0.33&{} 0.83&{} 0.6\\ -0.5&{} 0.13&{} 0.1\\ -0.87&{}1&{}1 \end{array}\right) . \end{aligned}$$
(40)

This \(Y_\nu \) leads to a \(Y_\nu Y_\nu ^\dagger \) with the same pattern as in Eq. (38) and saturates the \(\eta _{3\sigma }^\mathrm{max}\) bounds for \(f/M_R= (3/10)\, \mathrm{TeV}^{-1}\). Consequently, it provides a simple way for concluding on maximum allowed rates within this model. Notice that one can always take \(Y_\nu \) and \(M_R\) as independent input parameters as long as \(\mu _X\) accommodates light neutrino oscillation data by means of the \(\mu _X\)-parametrization introduced in [19].

We show in Fig. 7 the results for the three LFVZD channels in the GF scenario defined in Eq. (40). Solid lines are the exact one-loop results, computed with the expressions in Appendix B after diagonalizing to the mass basis, while dashed lines have been obtained using the effective vertex in Eq. (30) in terms of the parameters in the EW basis. Shadowed areas represent the regions disallowed by some of the constraints: in the purple area, covering the upper and left parts of the figures, the upper bounds in Eq. (38) are not fulfilled; in the yellow area, in the upper right corners, the Yukawa coupling matrix becomes non-perturbative. Our criteria for perturbativity is imposing \(|Y_{ij}|^2/4\pi <1\) for all the entries, what implies \(f<\sqrt{4\pi }\) for this scenario.

The first thing we conclude from these plots is that the computed effective vertex works extremely well in the allowed white region. Applying the constraints from both global fits and perturbativity imposes an upper bound on \(v Y_\nu /M_R\), which further supports our criteria of not computing higher order terms in Eq. (7). For masses below the TeV scale, when \(M_R\) is close to the EW scale, the assumption \(v/M_R\ll 1\) breaks down and the effective vertex stops being a good approximation. Nevertheless, from these plots we see that the MIA results work very well in the allowed region also for lighter \(M_R\). Consequently, we can conclude that our effective vertex is a very powerful tool to easily estimate the LFVZD rates in the region of \(M_R\gtrsim 300\) GeV that is allowed by present constraints.

Second, we see that the shape of the excluded purple area, or the complementary allowed white area, is different in the \(\mu e\) sector with respect to the \(\tau e\) and \(\tau \mu \) ones, specially in the low f and low \(M_R\) regime. The origin of this difference comes from the strong bound on \(\eta _{e\mu }\), coming from the upper bound on \(\mu \rightarrow e\gamma \) by MEG [42], which suppresses the \({\mathcal {O}}(Y_\nu ^2)\) contributions that are the most relevant ones at this low f regime.

Finally, we can use Fig. 7 to conclude on the maximum allowed rates for the LFVZD. As it happens for the Higgs case [27], these large rates are found in the crossing between the global fit and perturbativity bounds, which happens at heavy masses around 10 TeV. Taking the benchmark sensitivities of \(10^{-9}\) for the future linear colliders and assuming a modest improvement in the sensitivities of \(10^{-11}\) at FCC-ee, we see that these rates could be accessible at both experiments for the three LFVZD channels. This is in contrast to the H decays, where the \(H\rightarrow \mu e\) channel is further suppressed due to the small lepton masses. The difference between the Z and H decays comes from the the flavor universality in the LFVZD, as we discussed before.

Interestingly, these future experiments could access not only the high \(M_R\) regime, but also lighter values, meaning that they could be complementary to direct searches at the LHC (for a recent summary, see for instance [43]). Indeed, the FCC-ee could be able to explore the full allowed mass range from the EW scale up to masses above the TeV scale in the \(\tau e\) and \(\tau \mu \) sectors.

Summarizing our findings, we conclude that our effective vertex provides a simple and useful tool for estimating the LFVZD rates in the allowed region, which can be as large as,

$$\begin{aligned}&\mathrm{BR}(Z\rightarrow \mu e) \lesssim ~ 10^{-9}~\big (10^{-13}\big ) \nonumber \\&\quad \mathrm{for}~M_R\sim 10~\mathrm{TeV}~(500~\mathrm{GeV}), \end{aligned}$$
(41)
$$\begin{aligned}&\mathrm{BR}(Z\rightarrow \tau e ) \lesssim ~ 10^{-7}~\big (10^{-10}\big ) \nonumber \\&\quad \mathrm{for}~M_R\sim 10~\mathrm{TeV}~(500~\mathrm{GeV}),\end{aligned}$$
(42)
$$\begin{aligned}&\mathrm{BR}(Z\rightarrow \tau \mu ) \lesssim ~ 10^{-8}~\big (10^{-11}\big ) \nonumber \\&\quad \mathrm{for}~M_R\sim 10~\mathrm{TeV}~(500~\mathrm{GeV}), \end{aligned}$$
(43)

implying that future lepton colliders could probe this kind of low scale seesaw models looking for LFVZD.

5 Conclusions

In this work we have studied the lepton flavor violating decays of the Z boson into two leptons with different flavor. We have computed in full detail the one-loop contributions from the heavy right handed neutrinos to these decays within the inverse seesaw and by using the mass insertion approximation, which works with the electroweak neutrino basis, instead of the usual full one-loop computation that works with the neutrino mass basis. Our analytical results of the involved form factors from the mass insertion approximation are presented explicitly in terms of the relevant inverse seesaw parameters: the right handed neutrino mass, \(M_R\), and the neutrino Yukawa coupling matrix, \(Y_\nu \). The formulas presented here are simple and useful. They contain the LO contributions of \(\mathcal{O}(Y_\nu ^2)\) and the NLO contributions of \(\mathcal{O}(Y_\nu ^4)\), both being relevant for the kind of scenarios that we are interested in with large neutrino Yukawa couplings, \(Y_\nu \sim \mathcal{O}(1)\). We have then presented our computation of the one-loop effective vertex \(Z\ell _k\ell _m\) which is derived from the large \(M_R\) expansion, valid for \(M_R \gg v\), of the form factors and by keeping the first order in this expansion which turns out to be of \(\mathcal{O}(v^2/M_R^2)\). This demonstrates explicitly the decoupling behavior of the heavy right handed neutrinos.

As a very important test of our analytical results, the work has been completed with an explicit demonstration of the gauge invariance of our results for the on-shell effective one-loop vertex.

In the last part we have applied this effective vertex for an easy and accurate estimate of the maximum allowed lepton flavor violating Z decay rates by present data in these low scale seesaw models. The rates found are indeed promising, since they are at the reach of future lepton colliders.