1 Introduction

The success of the Standard Model of particle physics (SM) in describing the experimental data suggests that New Physics (NP) is either heavy or weakly coupled, or that it preferentially interacts with those sectors of the SM that are least constrained by experiment (e.g. the third generation of matter fields, in the case of Minimal Flavour Violation [1]).

Processes that are suppressed or forbidden in the SM, such as lepton flavour violation (LFV) in the charged sector, are smoking gun signals of NP and constitute valuable probes for Beyond Standard Model scenarios. Charged LFV has not yet been observed but is to be expected, since the discovery of neutrino masses and neutrino oscillations represents direct evidence for lepton flavour changing transitions. Searches for charged LFV can provide complementary tests of neutrino mass models, however, charged LFV is also predicted by popular BSM scenarios such as supersymmetry, which are motivated independently of neutrino masses [2].

The current upper limits on the rates of rare muon processes \(\textrm{Br}(\mu \rightarrow e\gamma )< 4\times 10^{-13}\) [3], \(\textrm{Br}(\mu \rightarrow 3e)<10^{-12}\) [4] and \(\textrm{Br}(\mu A\rightarrow e A)< 7\times 10^{-13}\) (\(\mu \rightarrow e\) conversion in nuclei) [5] provide the most stringent constraints in models that allow for \(\mu \rightarrow e\) transitions. The next generation of \(\mu \rightarrow e\) experiments promise an improvement in sensitivity of up to four order of magnitude [6,7,8]. The bounds on LFV processes involving \(\tau \) leptons are less constraining, with \(\textrm{Br}(\tau \rightarrow \ell \gamma ), \textrm{Br}(\tau \rightarrow 3\ell )\lesssim \textrm{few}\times 10^{-8}, \ell = e,\mu \) [9, 10]. Here the experimental sensitivities are expected to improve by a factor \(\sim 10\) in the near future [11].

If both \(\tau \leftrightarrow e\) and \(\tau \leftrightarrow \mu \) couplings are present, there is no symmetry forbidding \(\mu \leftrightarrow e\) processes mediated by the product of \(\mu \rightarrow \tau \) and \(\tau \rightarrow e\) interactions. Considering that the future sensitivities of \(\mu \leftrightarrow e\) and \(\tau \leftrightarrow \ell \) searches approximately satisfy the inequality

$$\begin{aligned} \textrm{Br}(\mu \rightarrow e)\lesssim \textrm{Br}(\tau \rightarrow \mu )\textrm{Br}(\tau \rightarrow e), \end{aligned}$$
(1)

\(\mu \rightarrow e\) observables can probe products of \(\mu \leftrightarrow \tau \) and \(\tau \leftrightarrow e\) couplings which are beyond the reach of direct \(\tau \rightarrow \ell \) searches. If the NP is heavy, model independent \((\mu \rightarrow \tau )\times (\tau \rightarrow e)\) contributions can be calculated in the Standard Model Effective Field Theory (SMEFT) [12]. Since these correspond to the combination of two dimension 6 operators, they only arise at dimension 8. In models, the combination of \(\mu \rightarrow \tau \) and \(\tau \rightarrow e\) can be larger than dimension 8 effects, making it more accessible to \(\mu \rightarrow e\) experiments.

The aim of this article is to explore the sensitivity of \(\mu \rightarrow e\) processes to \(\mu \rightarrow \tau \times \tau \rightarrow e\) transitions in a UV-complete model with light new physics, which we cannot parametrise with the SMEFT. A particularly simple way to extend the SM is to enlarge the SM gauge group by a new spontaneously broken abelian \(U(1)'\) group. This leads to new interactions mediated by a neutral massive vector boson, commonly referred to as a \(Z'\) gauge boson. \(U(1)'\) groups may be remnants of larger non-abelian groups, such as SO(10) or \(E_6\), which feature in certain Grand Unified Theories (GUTs) [13] or they can be considered as standalone extensions, as in the case of \(U(1)_{L_e-L_\mu }\), \(U(1)_{L_\mu -L_\tau }\), \(U(1)_{L_e-L_\tau }\) [14] or \(U(1)_{B-L}\) [15], which are anomaly-free global symmetries of the SM.

LFV \(Z'\) couplings are introduced in such models if the mass and gauge eigenstates of the leptons are misaligned. The stringent bounds from \(\mu \rightarrow e\) transitions [16,17,18] can be avoided in \(U(1)'\) models that allow only for \(\tau \leftrightarrow \ell \), flavour changes, \(\ell =e,\mu \) [19,20,21], which are less constrained.

We propose a \(Z'\) model where \(\mu \rightarrow e\) transitions are mediated by the product of \(\mu \leftrightarrow \tau \) and \(\tau \leftrightarrow e\) couplings only. In this way we suppress \(\mu \rightarrow e\) rates enough to respect the current upper limits, while being in reach of future \(\mu \rightarrow e\) experiments. We gauge \(L_e-L_\mu \),Footnote 1 aiming for a feebly-coupled \(Z'\) with a mass below the electroweak scale, and extend the scalar sector by an extra doublet and a new singlet, both of which are charged under the new \(U(1)'\). At some unknown high energy, the vacuum expectation value (VEV) of the singlet, \(v_S\), breaks \(U(1)_{{L_e}-L_\mu }\), gives the \(Z'\) boson a mass and generates Majorana masses for the sterile neutrinos that are charged under \(U(1)_{{L_e}-L_\mu }\), setting the stage for a type I seesaw neutrino mass model. LFV is introduced via the Yukawa interactions with the new scalars, and the \(Z'\) boson receives flavour changing couplings when the \(U(1)'\) charged doublet acquires a non-vanishing vacuum expectation value (VEV).

A large region of the model’s parameter space is in reach of future \(Z'\) and LFV searches and will be tested in the upcoming years.

This article is organized as follows: In Sect. 2 we present the particle content of the model and define our notation. In Sect. 3 the phenomenology is studied. We briefly review the constraints on non-SM gauge interactions of electrons and muons and give the rates of \(\tau \rightarrow \ell \) and \(\mu \rightarrow e\) processes.

2 The model

We extend the Standard Model gauge group \(SU(3)_c\times SU(2)_L\times U(1)_Y\) by the abelian anomaly-free \(U(1)_{L_1-L_2}\equiv U(1)'\) and consider the particle content summarised in Table 1.

Table 1 Field content of the model. In parentheses we give the representations under the SM gauge group (\(SU(3)_c\), \(SU(2)_L\), \(U(1)_Y\)), in the last column the charges under \(U(1)'\)

Then the most generic renormalisable Lagrangian is

$$\begin{aligned} {\mathcal {L}}={\mathcal {L}}_{\textrm{kin}}-{\mathcal {L}}_{\textrm{Yuk}}-V(H,\phi ,S) \end{aligned}$$
(2)

where

(3)
(4)
(5)

Disregarding \(SU(3)_c\), we use the following conventions for the covariant derivative

$$\begin{aligned} D_\mu =\partial _\mu + ig_1YB_\mu +ig_2 \frac{\tau ^a}{2} W_\mu ^a+ig' Q'B_\mu ' , \end{aligned}$$

where \(g_1, g_2 \) and \(g'\) are the gauge couplings of \(U(1)_Y\), \(SU(2)_L\) and \(U(1)'\), respectively, while \(\tau ^a\) are the Pauli matrices. The label \(\psi \) in Eq. (3) runs over all fermions of the model. We identify the gauge eigenstates 1, 2, 3 by their diagonal Yukawa couplings with the \(U(1)'\) neutral doublet H.

\(U(1)'\) gauge invariance allows for off-diagonal Yukawa interactions of type \(2\leftrightarrow 3\) and \(1\leftrightarrow 3\) with the doublet \(\phi \), but forbids interactions of type \(1\leftrightarrow 2\) among the charged leptons. The off-diagonal Yukawas \(y_{31}, y_{23}\) are the only parameters of the charged lepton sector that introduce LFV. As we discuss in Sect. 2.1, the mass eigenstates \(e,\mu ,\tau \) are nearly aligned with the interaction eigenstates 1, 2, 3, and as a result \(\mu \rightarrow e\) transitions are controlled by the product \(y_{31}\times y_{23}\). Due to the approximate alignment of gauge eigenstates and mass eigenstates, we will use the term lepton flavour in both bases and refer to \(U(1)_{L_1-L_2}\) as \(U(1)_{L_e-L_\mu }\).

The scalar potential parameters can be such that all scalars acquire VEVs, \(\langle {H}\rangle =v_H/\sqrt{2}\), \(\langle {\phi }\rangle =v_\phi /\sqrt{2}\), \(\langle {S}\rangle =v_S/\sqrt{2}\). \(v_H\) and \(v_\phi \) spontaneously break the electroweak gauge symmetry and thus must satisfy

$$\begin{aligned} \sqrt{v_H^2+v_\phi ^2}=v \qquad \textrm{with} ~ v=246 ~ \textrm{GeV}, \end{aligned}$$

whereas either \(v_S\) or \(v_\phi \) can be the VEV that breaks \(U(1)'\) and provides the dominant contribution to the \(Z'\) mass. Since we are gauging the lepton flavor difference \(L_e-L_\mu \), resulting in a \(Z'\) boson that couples to electrons and muons, several experiments can constrain the gauge coupling \(g'\). Values larger than \(g'\sim \textrm{few}\times 10^{-4}\) are excluded in a vast region of the \(Z'\) mass vs. \(g'\) coupling plane. Since, as discussed in Sect. 3, we are aiming at a \(Z'\) mass in the GeV range, we will focus on the limit \(v_S\gg v\).

2.1 Lepton mass basis

Upon spontaneous symmetry breaking, the Yukawa interactions contribute as follows to the mass matrix of the charged leptons:

$$\begin{aligned}&\sum _{i,j\in \lbrace 1,2, 3\rbrace } {\bar{L}}_i {\mathcal {M}}_{ij} e_j + \mathrm {h.c.}=\nonumber \\&\quad = \begin{pmatrix} {\bar{L}}_1&{\bar{L}}_2&{\bar{L}}_3 \end{pmatrix} \begin{pmatrix} \frac{v_H}{\sqrt{2}} y_{11} &{} 0 &{} 0 \\ 0 &{} \frac{v_H}{\sqrt{2}} y_{22} &{} \frac{v_\phi }{\sqrt{2}} y_{2 3} \\ \frac{v_\phi }{\sqrt{2}} y_{3 1} &{} 0 &{} \frac{v_H}{\sqrt{2}} y_{33} \end{pmatrix} \begin{pmatrix} e_1\\ e_2\\ e_3 \end{pmatrix} + \mathrm {h.c.} \end{aligned}$$

Without loss of generality, we can take the Yukawa couplings to be real, as it is possible to absorb the complex phases in the field definitions. As a result, the symmetric matrices \({{\mathcal {M}}}{{\mathcal {M}}}^T\) and \({\mathcal {M}}^T{\mathcal {M}}\) can be diagonalised by the orthogonal matrices \(O^L\) and \(O^R\), respectively,

$$\begin{aligned}&\left( O^L\right) ^T{\mathcal {M}}{\mathcal {M}}^{ T} O^L={\mathcal {M}}_{\textrm{diag}}^2,\\&\left( O^R\right) ^T{\mathcal {M}}^{T} {\mathcal {M}} O^R={\mathcal {M}}_{\textrm{diag}}^2\, , \end{aligned}$$

where \({\mathcal {M}}_{\textrm{diag}}^2=\textrm{diag}\begin{pmatrix} m_e^2&m^2_\mu&m^2_\tau \end{pmatrix}\) has non-negative diagonal entries corresponding to the squared charged lepton masses.

Assuming the flavour off-diagonal Yukawas to be smaller than the flavour conserving ones, the angles that rotate the gauge eigenbasis into the mass eigenbasis can be treated perturbatively, and the orthogonal matrices \(O^{L,R}\) can be written as \(O^{L,R}=R^{L,R}_{13} R^{L,R}_{23}\), where

$$\begin{aligned} R_{13}^{L,R}=\begin{pmatrix} 1 &{} 0 &{} \theta _{13}^{L,R}\\ 0 &{} 1 &{} 0\\ -\theta _{13}^{L,R} &{} 0 &{} 1 \end{pmatrix}, \quad R^{L,R}_{23}=\begin{pmatrix} 1 &{} 0 &{} 0\\ 0 &{} 1 &{} -\theta _{23}^{L,R}\\ 0 &{} \theta _{23}^{L,R} &{} 1 \end{pmatrix}. \end{aligned}$$

At leading order in the ratio of flavour off-diagonal Yukawa couplings and flavour diagonal Yukawa couplings, the rotation angles read

$$\begin{aligned}&\theta _{13}^L\simeq \frac{v_\phi }{v_H}\frac{y_{3 1}y_{11}}{y_{33}^2}\qquad \theta _{23}^L\simeq -\frac{v_\phi }{v_H}\frac{y_{23}}{y_{33}}\nonumber \\&\theta _{13}^R\simeq \frac{v_\phi }{v_H}\frac{y_{3 1}}{y_{33}}\qquad \theta _{23}^R\simeq -\frac{v_\phi }{v_H}\frac{y_{23}y_{22}}{y_{33}^2} \end{aligned}$$
(6)

and the charged lepton masses are given by

$$\begin{aligned} \textrm{diag}\begin{pmatrix} m_e&m_\mu&m_\tau \end{pmatrix} \simeq \frac{v_H}{\sqrt{2}}\ \textrm{diag}\begin{pmatrix} y_{11}&y_{22}&y_{33} \end{pmatrix}\,, \end{aligned}$$
(7)

where \(v_H\) is the VEV of the \(U(1)'\) neutral Higgs doublet. Note that the electron and muon masses receive contributions from the off-diagonal couplings of the form \(\sim v_H y_{11} \theta ^2,\ v_H y_{22}\theta ^2\), hence it is sufficient that the angles \(\theta \) are small to prevent large corrections to Eq. (7).

2.2 Gauge sector

The mass matrix of the neutral gauge bosons can be obtained from the kinetic terms of the scalars in unitary gauge:

$$\begin{aligned}&\left( D_\alpha H^\dagger \right) \left( D^\alpha H\right) {+} \left( D_\alpha \phi ^\dagger \right) \left( D^\alpha \phi \right) {+} \left( D_\alpha S^\dagger \right) \left( D^\alpha S\right) \nonumber \\&\quad \supset \frac{1}{8}\! \begin{pmatrix} B_\alpha \\ W_\alpha ^3 \\ B_\alpha ' \end{pmatrix}^{\!\!T}\!\! \begin{pmatrix} g_1^2 v^2 &{} {-}g_1 g_2 v^2 &{} {-}2 g' g_1 v_\phi ^2\\ {-}g_1 g_2 v^2 &{} g_2^2 v^2 &{} 2 g' g_2 v_\phi ^2\\ {-}2 g' g_1 v_\phi ^2 &{} 2 g' g_2 v_\phi ^2 &{} 4 g^{\prime 2} \left( v_S^2 {+} v_\phi ^2\right) \end{pmatrix}\!\! \begin{pmatrix} B^\alpha \\ W^{3\alpha } \\ B'^\alpha \end{pmatrix} \end{aligned}$$

with \(v^2 \equiv v_H^2 + v_\phi ^2\). If \(M_{Z'}\lesssim 10~\)GeV, \(g'\) must be \(\lesssim 10^{-4}\) to avoid the constraints on electron fifth-force (see Sect. 3.1), leading to suppressed mixing, \(\propto g'\), between the \(B'\) boson and the SM gauge bosons. In the limit \(v_S\gg v\), we approximate the mass matrix as follows:

$$\begin{aligned} \frac{1}{8} \begin{pmatrix} B_\alpha&W_\alpha ^3&B_\alpha ' \end{pmatrix} \begin{pmatrix} g_1^2 v^2 &{} -g_1 g_2 v^2 &{} 0\\ -g_1 g_2 v^2 &{} g_2^2 v^2 &{} 0\\ 0 &{} 0 &{} 4 g^{\prime 2} v_S^2 \end{pmatrix} \begin{pmatrix} B^\alpha \\ W^{3\alpha } \\ B'^\alpha \end{pmatrix}. \end{aligned}$$

The mass eigenstate \(Z'\), with \(M_{Z'}=g'v_s\), is aligned with \(B'\), while the photon A and the Z boson (with \(M_Z=v/2\sqrt{g_1^2+g_2^2}\)) are related to the interaction basis via a rotation by the Weinberg angle \(\theta _W\), with \(\tan \theta _W\equiv g_1/g_2\), just as in the SM.

$$\begin{aligned} \begin{pmatrix} A_\alpha \\ Z_\alpha \\ Z'_\alpha \end{pmatrix}=\begin{pmatrix} \cos \theta _W &{} \sin \theta _W &{} 0\\ -\sin \theta _W &{} \cos \theta _W &{} 0\\ 0 &{} 0 &{} 1 \end{pmatrix}\begin{pmatrix} B_\alpha \\ W^{3}_\alpha \\ B'_\alpha \end{pmatrix} \end{aligned}$$

Note that kinetic mixing of the abelian fields, \(\epsilon B^{\mu \nu } B'_{\mu \nu }\), is not forbidden by any symmetry and introduces a new coupling \(\epsilon \) that leads to \(g'\) independent interactions of the \(Z'\) boson with the SM fermions. If the \(U(1)'\) is the remnant of a larger spontaneously broken non-abelian gauge group, kinetic mixing is absent at tree-level. However, kinetic mixing can always be generated via loops involving fermions that are charged under both abelian groups. These lead to the finite and calculable contribution \(\epsilon _{1-\mathrm loop}\sim g' g_1/(16 \pi ^2)\log (m_\mu /m_e)\). In order to obtain a more predictive model, we consider vanishing tree-level \(\epsilon \) but loop-induced kinetic mixing.

The lepton mass eigenstates are related to the gauge eigenstates via \(L_i=O^L_{ i\alpha } L_\alpha \), \(e_i=O^R_{i\alpha } \alpha \) with \(\alpha \in \{e,\mu ,\tau \}\) and \(i\in \{1,2,3\} \). Whereas the flavour universal photon and Z couplings are unaffected by these unitary transformations, the lepton flavour non-universal \(Z'\) couplings are sensitive to the misalignment of the lepton gauge and mass eigenbases and receive flavour changing contributions. The \(Z'\) interactions with the charged leptons take the form

$$\begin{aligned} {\mathcal {L}}^{\textrm{LFV}}_{Z'}&\,= \,{-}g'Z'_\rho \big [ \theta ^L_{13}({\overline{L}}_e \gamma ^\rho L_\tau {+}\tau {\leftrightarrow } e)\nonumber \\&\qquad {+}\theta ^L_{23}({\overline{L}}_\mu \gamma ^\rho L_\tau +\tau {\leftrightarrow } \mu ) {+}\theta ^L_{13}\theta ^L_{23}({\overline{L}}_e \gamma ^\rho L_\mu {+}\mu \leftrightarrow e)\nonumber \\&\qquad {+}\theta ^R_{13}({\overline{e}} \gamma ^\rho \tau {+}\tau {\leftrightarrow } e) {+}\theta ^R_{23}({\overline{\mu }} \gamma ^\rho \tau +\tau {\leftrightarrow } \mu )\nonumber \\&\qquad +\theta ^R_{13}\theta ^R_{23}({\overline{e}} \gamma ^\rho \mu +\mu \leftrightarrow e)\big ] \end{aligned}$$
(8)

where \(\theta ^{L,R}\) are the rotation angles given in Eq. (6). The \(\mu \rightarrow e\) couplings are proportional to the product of the off-diagonal \(y_{31}\times y_{23}\) Yukawas that parametrise the \(\tau \leftrightarrow e\) and \(\tau \leftrightarrow \mu \) flavour changes. Note that the products \(\theta _{13}^L \theta _{23}^L\) and \(\theta _{13}^R \theta _{23}^R\) in the tree-level \(\mu \rightarrow e\) \(Z'\) interactions are suppressed by the SM electron and muon Yukawas respectively. Only the combination \(\theta _{13}^R \theta _{23}^L\) is sizeable.

2.3 Scalar sector

We supplement the SM Higgs sector by two new scalars, an \(SU(2)_L\) doublet \(\phi \) and a singlet S, both of which are charged under \(U(1)'\). Assuming the singlet to be heavier than the doublet, we integrate it out, reducing the scalar sector to that of a regular two Higgs Doublet Model (2HDM) with the doublets H and \(\phi \). Since the VEV of the singlet breaks \(U(1)'\), the scalar potential now features all \(SU(2)_L\otimes U(1)_Y\) gauge invariant potential terms, including \(U(1)'\) breaking interactions, which are generated via singlet VEV insertions:

$$\begin{aligned} {\mathcal {V}}^{\textrm{eff}}&= M_{HH}^2 H^\dagger H + M_{\phi \phi }^2 \phi ^\dagger \phi -M_{H\phi }^2\left( H^\dagger \phi +\phi ^\dagger H\right) \nonumber \\&\quad +\frac{1}{2}\Lambda _{H}\left( H^\dagger H\right) ^2 + \frac{1}{2}\Lambda _{\phi }\left( \phi ^\dagger \phi \right) ^2 \nonumber \\&\quad + \Lambda _{H\phi } \left( H^\dagger H\right) \left( \phi ^\dagger \phi \right) + {\tilde{\Lambda }}_{H\phi } \left( H^\dagger \phi \right) \left( \phi ^\dagger H\right) \nonumber \\&\quad + \frac{1}{2}\Lambda _a \left( \left( H^\dagger \phi \right) ^2 + \left( \phi ^\dagger H\right) ^2\right) \nonumber \\&\quad + \Lambda _b \left( H^\dagger H\right) \left( \left( H^\dagger \phi \right) +\left( \phi ^\dagger H\right) \right) \nonumber \\&\quad + \Lambda _c \left( \phi ^\dagger \phi \right) \left( \left( H^\dagger \phi \right) +\left( \phi ^\dagger H\right) \right) . \end{aligned}$$
(9)

It is particularly convenient to rotate into the so-called Higgs basis, where only one of the doublets acquires a vacuum expectation value:

$$\begin{aligned} \begin{pmatrix} H_1\\ H_2 \end{pmatrix}=\begin{pmatrix} \cos \beta &{} \sin \beta \\ -\sin \beta &{} \cos \beta \end{pmatrix}\begin{pmatrix} H\\ \phi \end{pmatrix}\qquad \textrm{with}\ \tan \beta \equiv v_\phi /v_H \end{aligned}$$

\(\langle {H_1}\rangle =v/\sqrt{2}\), with \(v=\sqrt{v_\phi ^2+v_H^2}\), and \(\langle {H_2}\rangle =0\). We relabel the potential parameters as follows

$$\begin{aligned} V(H_1,H_2)&=m^2_{11}H^\dagger _{1}H_1+m^2_{22}H^\dagger _{2}H_2-m^2_{12}(H^\dagger _1H_2+\mathrm {h.c})\nonumber \\&\quad +\frac{\lambda _1}{2}(H^\dagger _1 H_1)^2+\frac{\lambda _2}{2}(H^\dagger _2 H_2)^2\nonumber \\&\quad +\lambda _3 (H^\dagger _1 H_1)(H^\dagger _2 H_2)+\lambda _4(H^\dagger _1 H_2)(H^\dagger _2 H_1)\nonumber \\&\quad +\left( \frac{\lambda _5}{2}(H^\dagger _1 H_2)^2 +\lambda _6(H^\dagger _1 H_1)(H_1H^\dagger _2)\right. \nonumber \\&\left. \quad \quad +\lambda _7(H^\dagger _2 H_2)(H_1H^\dagger _2)+\mathrm {h.c}\right) , \end{aligned}$$
(10)

and expand the doublets in terms of the canonically normalized scalar fields

$$\begin{aligned} H_1=\begin{pmatrix} G^{+} \\ \frac{1}{\sqrt{2}}(v+H^0_1+iG^0) \end{pmatrix}, \quad H_2=\begin{pmatrix} H^{+} \\ \frac{1}{\sqrt{2}}(H^0_2+iA) \end{pmatrix}. \end{aligned}$$

Once the Goldstone bosons G are eaten by the electroweak gauge bosons, the spectrum contains one charged scalar \(H^+\), two CP even scalars \(H^0_1,H^0_2\) and one CP odd neutral scalar A. If the potential parameters are real, the physical states are CP eigenstates and only the two CP even neutral scalars, \(H_1^0\) and \(H_2^0\), can mix. The mass matrix of \(H_1^0\) and \(H_2^0\) is diagonalised by the angle \(\beta -\alpha \). We identify two scalar mass-eigenstates

$$\begin{aligned} h= \sin (\beta -\alpha ) H^0_1 + \cos (\beta -\alpha )H^0_2\\ \rho =\cos (\beta -\alpha )H^0_1-\sin (\beta -\alpha ) H^0_2 \end{aligned}$$

with the masses \(m_h\) and \(m_{\rho }\), respectively. The angle \(\beta -\alpha \) can be written in terms of \(\lambda _6\), which is defined in Eq. (10), as well as the scalar masses \(m_h\) and \(m_\rho \) [22, 23]:

$$\begin{aligned} \cos (\beta -\alpha )\sin (\beta -\alpha )\equiv c_{\beta \alpha }s_{\beta \alpha }=-\frac{\lambda _6v^2}{(m^2_{\rho }-m_h^2)} \end{aligned}$$
(11)

The decoupling limit of the 2HDM is obtained when the mass term of the VEV-less doublet \(H_2\) satisfies the condition \(m_{22}\equiv M\gg v\). The doublet mass terms in the Higgs basis, \(m_{11}\), \(m_{12}\), \(m_{22}\) (see Eq. (10)), are related to the mass matrix in the \(H,\phi \) basis (see Eq. (9)) in the following way

$$\begin{aligned} m^2_{11}= & {} c_\beta ^2\left( M^2_{HH}+M^2_{\phi \phi } t_\beta ^2-M^2_{H\phi }\frac{2 t_\beta }{\sqrt{1+t_\beta ^2}} \right) \end{aligned}$$
(12)
$$\begin{aligned} m^2_{22}= & {} c_\beta ^2\left( M^2_{\phi \phi }+M^2_{H H} t_\beta ^2+M^2_{H\phi }\frac{2 t_\beta }{\sqrt{1+t_\beta ^2}} \right) \end{aligned}$$
(13)
$$\begin{aligned} m^2_{12}= & {} \frac{1}{2}\left( M^2_{HH}-M^2_{\phi \phi }\right) s_{2\beta }+M^2_{H\phi }c_{2\beta } . \end{aligned}$$
(14)

The potential minimum conditions imply

$$\begin{aligned} m^2_{11}=-\frac{1}{2}\lambda _1 v^2\qquad m^2_{12}=\frac{1}{2}\lambda _6 v^2. \end{aligned}$$
(15)

Consequently, if we assume \(\lambda _1\) and \(\lambda _6\) to be perturbative couplings, \(m_{11}\) and \(m_{12}\) are sub-electroweak masses. The decoupling condition \(m_{22}\gg v\) is satisfied if \(M_{HH},M_{\phi \phi },\) \(M_{H\phi }\gg v\), however, in this case the minimum potential conditions require a fine-tuned cancellation in Eqs. (1213) and (14). If, on the other hand, \(\beta \ll 1\), the mass configuration \(M_{H}\sim v\) and \(M_{\phi \phi },M_{H\phi }\gg v\) leads to natural minimum potential conditions in the decoupling limit. To avoid fine-tuned spontaneous symmetry breaking, we consider \(\beta \ll 1\).

Once \(H_2\) is decoupled, we identify the light CP even scalar h as the 125-GeV Higgs boson, while the orthogonal state \(\rho \) has a mass \(m_{\rho }\sim m_{22}\sim M\). In the decoupling limit, the relation [22]

$$\begin{aligned} m_{\rho }^2-m^2_{A}=(\lambda _5+\lambda _1) v^2-m_h^2, \end{aligned}$$

implies that the pseudoscalar A and the scalar \(\rho \) are approximately degenerate, with a mass-splitting of order \(v^2\).

In the lepton mass basis, the doublet \(H_1\), which is aligned with the electroweak vacuum, has diagonal Yukawa interactions with the leptons, while the \(H_2\) couplings are in general flavour-changing. Written in terms of the Yukawa couplings defined in Eq. (4), the lepton Yukawa sector in the Higgs basis reads

$$\begin{aligned} {\mathcal {L}}_{\textrm{Yuk}}=\frac{\sqrt{2} \delta _{ij} m_{i}}{v}{\bar{L}}_i e_i H_1 + (O^T_{L} Y_2 O_R)_{ij}{\bar{L}}_i e_j H_2 +\mathrm {h.c}, \end{aligned}$$

where the sum over \(i,j\in \lbrace e,\mu ,\tau \rbrace \) is understood and

$$\begin{aligned} Y_2=\begin{pmatrix} -y_{11} \sin \beta &{} 0 &{} 0 \\ 0 &{} -y_{22} \sin \beta &{} y_{23}\cos \beta \\ y_{31}\cos \beta &{} 0 &{} -y_{33}\sin \beta \end{pmatrix}. \end{aligned}$$

Defining \({\mathcal {Y}}\equiv O^T_{L} Y_2 O_R \), the couplings of the neutral scalar mass eigenstates to the leptons take the form

$$\begin{aligned} {\mathcal {L}}^{\textrm{neut}}_{\textrm{Yuk}}&= \frac{h}{\sqrt{2}}{\bar{e}}_i\left[ s_{\beta \alpha }\frac{\sqrt{2} \delta _{ij} m_{i}}{v}+c_{\beta \alpha }({\mathcal {Y}}_{ij}P_R+{\mathcal {Y}}^\dagger _{ij}P_L) \right] e_j\nonumber \\&\quad +\frac{\rho }{\sqrt{2}}{\bar{e}}_i\left[ c_{\beta \alpha }\frac{\sqrt{2} \delta _{ij} m_{i}}{v}-s_{\beta \alpha }({\mathcal {Y}}_{ij}P_R+{\mathcal {Y}}^\dagger _{ij}P_L) \right] e_j\nonumber \\&\quad +\frac{iA}{\sqrt{2}}{\bar{e}}_i\left[ {\mathcal {Y}}_{ij}P_R-{\mathcal {Y}}^\dagger _{ij}P_L \right] e_j. \end{aligned}$$
(16)

Since the rate of \(h\rightarrow \tau ^+\tau ^-\) measured at the LHC [24, 25] is compatible with the Standard Model prediction, we require \(s_{\beta \alpha }\sim 1\). As a result, the flavour-changing couplings of the Higgs boson h are suppressed by

$$\begin{aligned} c_{\beta \alpha }\simeq -\frac{\lambda _6v^2}{(m^2_{\rho }-m_h^2)}\ll 1\ \rightarrow \ c_{\beta \alpha }\simeq -\frac{\lambda _6 v^2}{M^2}. \end{aligned}$$

3 Phenomenology

In this section we discuss the phenomenological signatures of our model. In Sect. 3.1 we briefly review the experimental constraints on \(L_e-L_\mu \) gauge interactions, before focussing on the LFV phenomenology in Sect. 3.2. In Sect. 3.3 we address neutrino mixing and the generation of neutrino masses.

3.1 \(L_e-L_\mu \) gauge interactions

Several experiments search for \(Z'\) bosons that interact with SM particles. In the absence of tree-level kinetic mixing, these searches directly probe the size of the gauge coupling \(g'\). Since in our model the \(Z'\) interactions arise from gauging the lepton flavour difference \(L_e-L_\mu \), \(g'\) is primarily constrained by bounds on \(Z'ee\) and \(Z'\nu \nu \) [26, 27].

In the \(Z'\) mass range of \(10\ \textrm{MeV} \lesssim M_{Z'}\lesssim 1 \) GeV, the strongest constraints on the \(U(1)'\) gauge coupling come from electron beam dump experiments [30, 31], neutrino oscillation experiments [26] and neutrino scattering experiments [32, 33], while cosmological and astrophysical limits are more relevant for lighter \(Z'\) bosons [34,35,36]. Muonium spectroscopy can also probe for \(Z'\) bosons in the sub-MeV mass range [37]. For larger masses, \(M_{Z'}\gtrsim 1 \) GeV, colliders are the most sensitive probes of a fifth force. Below 10 GeV, BaBar [28, 29] set an upper limit of \(g'\lesssim 10^{-4}\) by searching for \(Z'\) production in combination with a single photon, \(e^+e^-\rightarrow \gamma Z'\rightarrow \gamma \ell ^+\ell ^-\). Belle II is expected to push this limit down to \(g'\sim 10^{-5}\) [38, 39]. \(Z'\) bosons with masses beyond \(M_{Z'}\sim 10 \) GeV can be produced in the decay of an excited meson produced at the LHC, however the process occurs via loop induced kinetic mixing and the bound on the \(U(1)'\) gauge coupling is only \(g'\lesssim 10^{-1}-10^{-2}\) [40, 41].

3.2 Lepton flavour violation

3.2.1 \(\tau \rightarrow \mu \) and \(\tau \rightarrow e\) transitions

Via the lepton flavour changing \(Z'\) couplings in Eq. (8), \(\tau \) leptons can decay into final states such as 3e, \(3\mu \), \(\mu 2e\) or \(e 2\mu \). In Fig. 2, we show the diagram for the tree-level \(Z'\) mediated decay of \(\tau \) into \(3\ell \), where \(\ell =e,\mu \). For a \(Z'\) lighter than the mass difference \(m_\tau -m_\ell \), \(\tau \) leptons can decay into an on-shell \(Z'\) and a charged lepton \(\ell \). If in addition \(M_{Z'}>2m_\ell \), the \(Z'\) boson can decay into a lepton pair \(\ell ^+\ell ^-\). If this happens before the \(Z'\) escapes the detector, a three-lepton final state may be measured as a consequence of the decay chain \(\tau \rightarrow Z' \ell \rightarrow \ell ^+ \ell ^-\ell \). In the narrow-width approximation, \(\mathrm {\Gamma }/M_{Z'}\lesssim g'^2\ll 1\), the Breit-Wigner distribution can be approximated by a \(\delta -\)function, resulting in a factorized rate

$$\begin{aligned} \mathrm {\Gamma }(\tau \rightarrow 3\ell )\simeq \mathrm {\Gamma }(\tau \rightarrow Z'\ell )\times \textrm{Br}(Z'\rightarrow \ell ^{+} \ell ^{-}). \end{aligned}$$

If the muon and electron masses can be neglected with respect to the \(Z'\) mass, we can take \(\textrm{Br}(Z'\rightarrow \ell ^{+} \ell ^{-})\sim 1/3\) and Footnote 2

$$\begin{aligned} \mathrm {\Gamma }(\tau \rightarrow Z' \ell ){=}\!\!\!\sum _{X=L,R}\!\!\!\frac{\bigg (g'\theta ^X_{i3}\bigg )^2}{64\pi }\frac{m^3_\tau }{M_{Z'}^2} \!\left( 1{-}\frac{M_{Z'}^{2}}{m^2_\tau }\!\right) ^{\!\!2}\!\left( 1{+}2\frac{M^2_{Z'}}{m^2_\tau }\!\right) , \end{aligned}$$
Fig. 1
figure 1

Figure taken from Ref. [27] showing the current bounds on the space of \(M_{Z'}\) vs. \(g'\) in the case of gauged \(L_e-L_\mu \). As described in the text, we consider three benchmark scenarios, one of which involves a \(Z'\) boson with a mass \(M_{Z'}\sim 10~\)GeV and a coupling \(g'\sim 10^{-4}\) and is in reach of Belle-II, another benchmark with \(M_{Z'}\sim 15~\)GeV, which avoids the bounds from BaBar [28, 29], allowing for a gauge coupling of \(g'\sim 2.5\times 10^{-3}\) and finally the case of a long-lived \(Z'\) with a mass \(\text {MeV}\lesssim M_{Z'}<m_\mu -m_e\) and a gauge coupling of \(g'\sim 10^{-8}\) which decays outside the Belle-II detector

where \(i=1,2\) for \(\ell =e,\mu \) respectively. (The mixing angles \(\theta \) are given in terms of the Lagrangian parameters in Eq. (6)). In the case of on-shell \(Z'\) production, the rate of \(\mathrm {\Gamma }(\tau \rightarrow 3\ell )\) scales with \(g'^2\) rather than with the \(g'^4\) that one would naively expect from the tree-level \(Z'\) exchange shown in Fig. 2. Consequently, the bounds on the flavour off-diagonal Yukawas from \(\textrm{Br}(\tau \rightarrow 3e)<2.7\times 10^{-8}\), \(\textrm{Br}(\tau \rightarrow 3\mu )<2.1\times 10^{-8}\) are stringent enough to suppress all other LFV signals. Indeed, taking \(M_{Z'}=1~\)GeV and \(g'=10^{-4}\), the upper limit \(\textrm{Br}(\tau \rightarrow 3\mu )<2.1\times 10^{-8} \) implies \( y_{23}<3\times 10^{-7} (\sin \beta )^{-1}\). As will become apparent in the following section, \(\mu \rightarrow e\) processes cannot further constrain the model in this case.

If the \(Z'\) is sufficiently long-lived to escape the detector, upper-limits on the off-diagonal Yukawas can be inferred from the lepton-flavour violating decays \(\tau \rightarrow \ell +\textrm{invisible}\). The \(Z'\) decay length exceeds \(\sim 1\) m when

$$\begin{aligned} (g'M_{Z'})^2\lesssim 2\times 10^{-15}\ \textrm{GeV}^2. \end{aligned}$$
(17)

In the range \(0.1\ \textrm{GeV}\lesssim M_{Z'}\lesssim 1.6\ \textrm{GeV}\), the extremal \(g'\) values compatible with Eq. (17) are \(g'(0.1\ \textrm{GeV})\sim 5\times 10^{-7}\) and \(g'(1.6\ \textrm{GeV})\sim 3\times 10^{-8}\). Note that these values are not excluded, see Fig. 1. Considering \(M_{Z'}=0.1\) GeV, the upper limit \(\textrm{Br}(\tau \rightarrow \ell +\mathrm{inv.})/\textrm{Br}(\tau \rightarrow \ell \nu \nu )\lesssim 10^{-3}\) [43] implies

$$\begin{aligned} \theta _{i3}^X\lesssim 3\times 10^{-2}, \quad i=1,2,\quad X=L,R. \end{aligned}$$
(18)

For \(0.01\ \textrm{GeV}\lesssim M_{Z'}\lesssim 0.1\) GeV, the allowed couplings for \(Z'\) bosons that escape the detector are \(g'\lesssim 10^{-8}\) and the constraint on the mixing angle arising from \(\tau \rightarrow \ell +\mathrm{inv.}\) is \(\theta _{i3}\lesssim 0.1\). Smaller \(Z'\) masses are tightly constrained by BBN [44].

If \(M_{Z'}>m_\tau \), the \(\tau \rightarrow 3\ell \) decay is mediated at tree-level by an off-shell \(Z'\) boson. Even for a \(Z'\) mass of \(M_{Z'}\sim 5\) GeV, treating the \(Z'\) exchange as a contact interaction between four leptons is sufficient for an estimation of the decay rate to a \(\sim 10\%\) accuracy. Integrating out the \(Z'\) boson at the scale of \(M_{Z'}\), the four-lepton operators

Fig. 2
figure 2

\(Z'\) mediated \(\tau \rightarrow 3\ell \) decay, with \(\ell =e,\mu \). If \(2m_\ell<M_{Z'}<m_\tau -m_\ell \), the decay can happen via the on-shell production of the \(Z'\) and its subsequent decay into a lepton pair

$$\begin{aligned} {\mathcal {O}}_{V,XY}=({\bar{\ell }} \gamma ^\alpha P_X \tau )({\bar{\ell }} \gamma _\alpha P_Y\ell ) \end{aligned}$$
(19)

are generated with the following coefficients

$$\begin{aligned} C^{e \tau e e}_{V,XY}=-\frac{g'^2\theta ^X_{13}}{M^2_{Z'}}\qquad C^{\mu \tau \mu \mu }_{V,XY}=\frac{g'^2\theta ^X_{23}}{M^2_{Z'}}. \end{aligned}$$

The decay rate of \(\tau \rightarrow 3\ell \) is then given by

$$\begin{aligned} \textrm{Br}\left( \tau \rightarrow 3\ell \right)&=\frac{m_\tau ^5 }{1536 \,\pi ^3\, \mathrm {\Gamma }_\tau } \left( 2 \left| C^{\ell \tau \ell \ell }_{V,LL}\right| ^2+\left| C^{\ell \tau \ell \ell }_{V,LR}\right| ^2\nonumber \right. \\&\left. \quad \quad \quad \quad \quad \quad \quad \quad \quad +L\leftrightarrow R \right) , \end{aligned}$$
(20)

where \(\mathrm {\Gamma }_\tau \) is the total decay width of the \(\tau \). Here we are neglecting the QED running from \(M_{Z'}\) to the \(\tau \) mass.

Contributions from the scalar sector arise from diagrams similar to the one depicted in Fig. 2, but with a neutral scalar exchange. However, the flavour-diagonal current couples to scalars via the light lepton Yukawas, and LFV Higgs decays are more sensitive to \(\tau \leftrightarrow \ell \) flavour changing Yukawa couplings than the Yukawa suppressed \(\tau \rightarrow 3\ell \) decay [45]. The width for the LFV Higgs decay is given by [46]

$$\begin{aligned} \mathrm {\Gamma }(h\rightarrow \tau \ell )= \frac{\vert {{\mathcal {Y}}_{\ell \tau }} \vert ^2+\vert {{\mathcal {Y}}_{\tau \ell }} \vert ^2}{16\pi }c_{\beta \alpha }^2 m_h \end{aligned}$$

with \({\mathcal {Y}}\) as defined in Eq. (16). The allowed region in the \(y_{31}-y_{23}\) plane is plotted in Fig. 3, which shows that \(\textrm{Br}(\tau \rightarrow 3e)<2.7\times 10^{-8}\), \(\textrm{Br}(\tau \rightarrow 3\mu )<2.1\times 10^{-8}\) and \(\textrm{Br}(h\rightarrow \tau \mu )<1.5\times 10^{-3}, \textrm{Br}(h\rightarrow \tau e)<2.2\times 10^{-3}\) [45] are all compatible with perturbative Yukawa couplings. These bounds are not able to constrain \(y_{31}\) and \(y_{23}\) to values smaller than \(y_{33}/t_\beta \sim 1/10\), as required in Eq. (6).

Fig. 3
figure 3

Considering \(g'=10^{-4}\), \(M_{Z'}=10~\)GeV, \(t_\beta =\frac{1}{10}\) and \(M=10~\)TeV, we plot the values of the off-diagonal couplings that saturate the current upper limits on the \(\tau \) LFV branching ratios. Since the limit of small mixing (see Eq. (6)) assumes much smaller Yukawas, we conclude that lepton flavour violation in the \(\tau \) sector cannot constrain the off-diagonal couplings \(y_{31}\) and \(y_{23}\) in this region of parameter space

3.2.2 \(\mu \rightarrow e\) transitions

We now turn to the \(\mu \rightarrow e\) phenomenology. In presence of the flavour mixing angles of Eq. (6), the \(Z'\) boson can couple at tree-level to \(\mu \rightarrow e\) currents, however these couplings only arise at second order in the mixing angles and are suppressed by the electron and muon Yukawas: the left-handed and right-handed vector currents come with the products of mixing angles \(\theta ^L_{13}\theta ^L_{23}\) and \(\theta ^R_{13}\theta ^R_{23}\) respectively, where

$$\begin{aligned} \theta _{23}^L&= \tan \beta \frac{y_{23}}{y_{33}}\qquad { \theta ^R_{23}=\theta _{23}^L\times \frac{y_{22}}{y_{33}}}\nonumber \\ \theta _{13}^R&= \tan \beta \frac{y_{3 1}}{y_{33}}\qquad {\theta ^L_{13}=\theta _{13}^R\times \frac{y_{11}}{y_{33}}}. \end{aligned}$$
(21)

If the \(Z'\) is light enough to permit the decay \(\mu \rightarrow e Z'\), the experimentally allowed values for the gauge coupling are \(g'\lesssim 10^{-8}\). Although in the mass range \(2m_e\lesssim M_{Z'}\lesssim m_{\mu }-m_e\) the \(Z'\) could subsequently decay into an electron positron pair, the decay is not fast enough to happen inside the SINDRUM detector, such that the bound \(\textrm{Br}(\mu \rightarrow 3e)<10^{-12}\) [4] does not apply to the decay chain \(\mu \rightarrow e (Z'\rightarrow {\bar{e}} e)\). On the other hand, the non-observation of \(\mu \rightarrow e+\mathrm{inv.}\) by the TWIST collaboration sets an upper limit \(\textrm{Br}(\mu \rightarrow e Z')<8.1\times 10^{-6}\) [47] in this region of the parameter space. The \(Z'\) primarly couples to a right-handed \(\mu \rightarrow e\) current, while the left-handed couplings are suppressed by the electron Yukawa. The TWIST upper bound \(\textrm{Br}(\mu \rightarrow e Z')<8.1\times 10^{-6}\) on the \(\mu \rightarrow e Z'\) rate

$$\begin{aligned} \mathrm {\Gamma }(\mu {\rightarrow } Z' e){=}\frac{\left( g'\theta ^R_{13}\theta ^L_{23}\frac{y_{22}}{y_{33}}\right) ^2}{64\pi } \frac{m^3_\mu }{M_{Z'}^2}\!\left( 1{-}\frac{M_{Z'}^{2}}{m^2_\mu }\!\right) ^{\!\!2}\!\left( 1{+}2\frac{M^2_{Z'}}{m^2_\mu }\!\right) \!, \end{aligned}$$

implies \(\theta ^R_{13}\theta ^L_{23}\lesssim 10^{-3}\), taking \(g'\sim 10^{-8}, M_{Z'}\sim 1\) MeV. This is beyond the reach of \(\tau \rightarrow e+\text {inv.} \) and \( \tau \rightarrow \mu +\text {inv.}\) combined.

Fig. 4
figure 4

a Tree-level \(Z'\) exchange contributing to \(\mu \rightarrow 3e\). b \(Z'\) penguin diagram contributing to \(\mu \rightarrow e\gamma \)

For \(M_{Z'}>m_\mu \), the \(\mu \rightarrow 3e\) decay, upon which the bound \(\textrm{Br}(\mu \rightarrow 3e)<10^{-12}\) applies, receives contributions from the diagram of Fig. 4a. The effective four-fermion interactions that result from integrating out the \(Z'\) boson, are given by

$$\begin{aligned} C^{e\mu e e}_{V,XY}=-\frac{g'^2\theta ^X_{13}\theta ^X_{23}}{M^2_{Z'}}. \end{aligned}$$
(22)

Contributions of the dipole, which comes with the coupling product \(\theta ^R_{13}\theta ^L_{23}\) and avoids the suppression by the SM electron and muon Yukawas, can also be relevant. For instance, the penguin diagram of Fig. 4b contributes to the \(\mu \rightarrow e\) photon dipole

$$\begin{aligned} {\mathcal {O}}^{e\mu }_{D,X}=m_\mu ({\bar{e}}\sigma _{\alpha \beta }P_X \mu )F^{\alpha \beta }, \end{aligned}$$
(23)

with a coefficient

$$\begin{aligned} (C^{e\mu }_{D,L})_{peng}=-\frac{3 e}{48\pi ^2 M^2_{Z'}}\left( \frac{m_\tau }{m_\mu }\right) g'^2\theta ^R_{13}\theta ^L_{23}. \end{aligned}$$
(24)

In addition to the penguin, loop diagrams with scalars can give sizeable contributions to the dipole. In the diagrams of Fig. 5a, the mass insertion flips the chirality in the virtual \(\tau \) line. These diagrams contribute to the dipole coefficient as follows [48]

$$\begin{aligned}&(C^{e\mu }_{D,L})_\mathrm {1-loop} \nonumber \\&\quad =-\frac{e}{64\pi ^2}\left( \frac{m_\tau }{m_\mu }\right) \bigg [\frac{{\mathcal {Y}}^*_{\mu \tau }{\mathcal {Y}}^*_{e\tau }}{m^2_h}c^2_{\beta \alpha }\left( 2\log \left( \frac{m_\tau }{m_h}\right) +\frac{3}{2}\right) \nonumber \\&\qquad \qquad \qquad \qquad \qquad +\frac{{\mathcal {Y}}^*_{\mu \tau }{\mathcal {Y}}^*_{e\tau }}{m^2_\rho }s^2_{\beta \alpha } \left( 2\log \left( \frac{m_\tau }{m_\rho }\right) +\frac{3}{2}\right) \nonumber \\&\qquad \qquad \qquad \qquad \qquad -\frac{{\mathcal {Y}}^*_{\mu \tau }{\mathcal {Y}}^*_{e\tau }}{m^2_A} \left( 2\log \left( \frac{m_\tau }{m_A}\right) +\frac{3}{2}\right) \bigg ], \end{aligned}$$

with \({\mathcal {Y}}\) as defined in Eq. (16). \(C_{D,R}\) is obtained from \(C_{D,L}\) by replacing \({\mathcal {Y}}\leftrightarrow {\mathcal {Y}}^\dagger \). When the loop is closed by the light Higgs h, each flavour changing vertex is proportional to \(c_{\beta \alpha }\), resulting in a contribution suppressed by four powers of the heavy Higgs mass, \(c^2_{\beta \alpha }\sim 1/M^4\). Moreover, in the decoupling limit, the contributions of the heavy scalar \(\rho \) and the pseudoscalar A cancel in the above equation, since their masses are degenerate at leading order in \(1/M^2\). The total amplitude ends up being a sub-dominant \(1/M^4\) effect.

Fig. 5
figure 5

Loop diagrams with neutral scalars contributing to the rate of \(\mu \rightarrow e \gamma \). \(\Phi =h,\rho , A\)

The misalignment of the lepton mass eigenbasis and the lepton gauge eigenbasis also affects the scalar couplings. Whereas the \(Z'\) boson acquires \(\mu \leftrightarrow e\) couplings proportional to \(\propto \theta ^L_{13} \theta ^L_{23}\) or \(\propto \theta ^R_{13} \theta ^R_{23}\), the scalars acquire \(\mu \rightarrow e\) couplings proportional to \(\theta ^R_{13} \theta ^L_{23}\).

It is well known that the Barr–Zee diagrams of Fig. 5b,c give the leading contribution to \(\mu \rightarrow e \gamma \) in 2HDM models with \(\mu \rightarrow e\) Yukawa couplings [49]. The branching ratio of \(\mu \rightarrow e \gamma \) is given by

$$\begin{aligned} \textrm{Br}(\mu \rightarrow e \gamma )=\frac{m_\mu ^5}{4\pi \mathrm {\Gamma }_{\mu }}\left( \vert {C^{e\mu }_{D,L}} \vert ^2+\vert {C^{e\mu }_{D,R}} \vert ^2\right) , \end{aligned}$$
(25)

where \(\mathrm {\Gamma }_{\mu }\simeq G_F^2m_\mu ^5/(192\pi ^3)\) is the total decay rate of the muon, and, to a good approximation,

$$\begin{aligned} C^{e\mu }_{D,X}=(C^{e\mu }_{D,X})_\textrm{peng}+(C^{e\mu }_{D,X})_\mathrm {t-loop}+(C^{e\mu }_{D,X})_\mathrm {W-loop} \end{aligned}$$
(26)

where \((C^{e\mu }_{D,X})_\mathrm {t-loop}\) and \((C^{e\mu }_{D,X})_\mathrm {W-loop}\) correspond to the two-loop dipole contributions of Fig. 5b, c, respectively, and are defined in Appendix A. Note that these diagrams scale like \(1/M^2\) (see also Fig. 7).

For the \(\mu \rightarrow 3e\) rate we consider the tree-level diagram of Fig. 4 and the dipole contributions arising from attaching an electron current to the photon. We find

$$\begin{aligned}&\textrm{Br}\left( \mu {\rightarrow } 3e\right) \nonumber \\&\quad {=}{-}\frac{e^2\, m_\mu ^5 }{192\, \pi ^3\,\mathrm {\Gamma }_\mu }\left( |C_{D,L}^{e\mu }|^2{+}|C_{D,R}^{e\mu }|^2\right) \left( 4 \log \left( \frac{m_e^2}{m_\mu ^2}\right) {+}11\right) \nonumber \\&\qquad {+}\frac{m_\mu ^5 }{1536 \,\pi ^3\, \mathrm {\Gamma }_\mu } \left( 2 \left| C^{e\mu ee}_{V,LL}\right| ^2+\left| C^{e\mu ee}_{V,LR}\right| ^2 +L\leftrightarrow R\right) \\&\qquad +\!\frac{e m_\mu ^5}{192 \pi ^3 \mathrm {\Gamma }_\mu } \!\Big (\textrm{Re}\left[ C_{D,R}^{e\mu *} \!\left( 2\, C^{e\mu ee}_{V,LL}\!\!+\!\! C^{e\mu ee}_{V,LR}\right) \right] \!\!+\!\!L\leftrightarrow R\Big ), \end{aligned}$$

with \(C^{e\mu ee}_{V,X}\) as defined in Eq. (22).

LFV interactions with quarks via \(Z'\) exchange arise from loop induced kinetic mixing or via penguin diagrams with an external quark current. The kinetic mixing is both loop-suppressed and suppressed by the Yukawa couplings featuring in the \(Z'\mu e \) vertex, leading to a sub-dominant contribution. The relevant scalar contributions are the tree-level exchanges with the u, d and s quarks in the nuclei, the Barr-Zee dipoles that already featured in Eq. (26) and are discussed in Appendix 1, as well as the diagrams in Fig. 6. The latter contribute to the gluon operator \({\mathcal {O}}_{GG,X}=({\bar{e}}P_X \mu )G^{\alpha \beta }G_{\alpha \beta }\), which enters the \(\mu \rightarrow e\) conversion rate. In order to compute these contributions, we integrate out the heavy scalar doublet \(H_2\) at the mass scale M and match onto the SMEFT operators \({\mathcal {O}}_{Ledq}^{e\mu ii}\equiv ({{\bar{L}}}_e \mu )({{\bar{d}}}_i q_i)\), \({\mathcal {O}}_{Lequ}^{(1)e \mu ii}\equiv ({{\bar{L}}}_e \mu )({{\bar{q}}}_i u_i)\), and \({\mathcal {O}}_{eH}^{e\mu }\equiv (H_1^\dagger H_1)({{\bar{L}}}_e \mu H_1) \), which leads to the off-diagonal Yukawa couplings of the SM Higgs boson h, given in Eq. (16). We include the QCD running of the scalar operators from the scale M down to the electroweak scale. At the electroweak scale, the SMEFT operators are matched at tree-level onto the Low Energy Effective Field Theory (LEFT) scalar contact interactions and the gluon operator \({\mathcal {O}}_{GG,X}=({\bar{e}}P_X \mu )G^{\alpha \beta }G_{\alpha \beta }\) [50], which receives contributions from the diagrams with t-loops. Finally, we compute the \(\mu \rightarrow e\) conversion rate using Eq. (2.22) of [51], which is written in terms of LEFT operators evaluated at the electroweak scale and includes the running of the Wilson coefficients down to the experiment.

Fig. 6
figure 6

Contributions via heavy quark loops to the gluon operator \({\mathcal {O}}_{GG,X}=({\bar{e}}P_X \mu )G^{\alpha \beta }G_{\alpha \beta }\) that enters the \(\mu \rightarrow e\) conversion rate. \(q=c,b,t\) and \(\Phi =h,\rho , A\)

As shown in Fig. 7, the MEG bound, \(\textrm{Br}(\mu \rightarrow e\gamma )<4.2\times 10^{-13}\), is better at constraining the product of Yukawa couplings \(y_{23}\times y_{31}\) than the current upper limits \(\textrm{Br}(\mu \rightarrow 3e)<10^{-12}\) and \(\textrm{Br}(\mu A\rightarrow eA)<7\times 10^{-13}\). On the other hand, future \(\mu \rightarrow e\) conversion and \(\mu \rightarrow 3e\) experiments expect an impressive improvement in the branching ratio sensitivities. The COMET and Mu2e collaborations aim at \(\textrm{Cr}(\mu \textrm{Al}\rightarrow e \textrm{Al})\sim 10^{-16}\) [8, 52] and Mu3e at \(\textrm{Br}(\mu \rightarrow 3e)\sim 10^{-16}\) [7], both of which can surpass the expected sensitivity of MEG-II with \(\textrm{Br}(\mu \rightarrow e \gamma )\sim 6\times 10^{-14}\) [6] (Fig. 8).

Fig. 7
figure 7

Constraints on the Yukawa couplings \(y_{31}\) and \(y_{23}\) from experimental upper bounds on \(\mu \rightarrow e\) transitions. The orange region corresponds to the allowed parameter space. In the plot we consider \(g'=10^{-4}\), \(M_{Z'}=10\) GeV, \(M=10\) TeV (solid lines) or \(M=5\) TeV (dashed lines), \(\lambda _6=1\) and \(\tan \beta =0.1\). The dashed curves show how the lepton flavour violation induced by the scalar sector changes with M. These contributions arise at mass dimension 6 within the SMEFT

Fig. 8
figure 8

Constraints on the Yukawa couplings \(y_{31}\) and \(y_{23}\), derived from future experimental sensitivities to \(\mu \rightarrow e\) transitions. The light orange region corresponds to the parameter space that will be probed by Mu3e, whereas the purple region is the allowed parameter space if no \(\mu \rightarrow 3e\) signal is observed by Mu3e. For this plot we consider: \(g'=10^{-4}\), \(M_{Z'}=10\) GeV, \(M=10\) TeV (solid lines) or \(M=5\) TeV (dashed lines), \(\lambda _6=1\) and \(\tan \beta =0.1\). Again, the suppression of the scalar contributions by (at least) \(1/M^2\) can be observed by comparing the solid and the dashed curves

A \(Z'\) boson with \(M_{Z'}=10\) GeV and \(g'=10^{-4}\), as considered in Figs. 7 and 8, will be probed at Belle-II in the upcoming years. In this scenario \(\mu \rightarrow e\) rates are the result of contributions both from the \(Z'\) and from scalar LFV interactions. The scalar contributions dominate over the \(Z'\) contributions in this region of parameter space.

Fig. 9
figure 9

Future sensitivities of searches for \(\mu \rightarrow e\) processes to the Yukawa couplings \(y_{31}\) and \(y_{23}\) for \(g'=2.5\times 10^{-3}\), \(M_{Z'}=15\) GeV and \(\tan \beta =0.1\). Since in this case the contributions of the scalar singlet cannot be neglected with respect to those of the scalar doublet, we only show the vector contributions (which are independent of M and \(\lambda _6\))

By considering a marginally larger mass, \(M_{Z'}=15\) GeV, the BaBar constraint \(g\lesssim 10^{-4}\) can be avoided and the most stringent upper limit on \(g'\) becomes \(g'\lesssim 10^{-1}-10^{-2}\) from LHCb [40, 41]. Taking \(g'\sim 2.5\times 10^{-3}\) and \(M_{Z'}\sim 15\) GeV, the singlet has a mass \(M_S\simeq v_s\simeq M_{Z'}/g'\sim 10\ \textrm{TeV}\), which is of the same order as the heavy scalar masses of the 2HDM sector, and thus cannot be integrated out. In Fig. 9 we show the \(\mu \rightarrow e\) sensitivities in the \(y_{31}-y_{23}\) plane considering only vector contributions and \(g'=2.5\times 10^{-3}\), \(M_{Z'}=15\ \textrm{GeV}\), while the inclusion of the scalar diagrams would require a more careful analysis of the singlet-doublet mixing.

In summary, we observe regions of parameter space in which \(\mu \rightarrow e=\mu \rightarrow \tau \times \tau \rightarrow e\) can constrain the model more than direct searches for \(\tau \rightarrow \mu \) and \(\tau \rightarrow e\):

  • When the decay \(\mu \rightarrow e Z'\) is kinematically allowed, \(\mu \rightarrow e+\ \mathrm{inv.}\) searches can probe smaller off-diagonal Yukawa couplings than \(\tau \rightarrow \ell +\ \mathrm{inv.}\).

  • For \(M_{Z}\gtrsim m_\tau \), \(\tau \) LFV transitions are unable to constrain the model below \(y_{31},y_{23}\sim {\mathcal {O}}(1)\), while \(\mu \rightarrow e\) leads to the constraints shown in Figs. 7 and 8.

3.3 Neutrino masses

In this section we show that our model is compatible with neutrino mass differences and oscillation data.

After spontaneous symmetry breaking, the neutrino mass Lagrangian takes the form

$$\begin{aligned} {\mathcal {M}}_{\nu }&= (M_D)_{ij}{\bar{L}}_i N_j+\frac{1}{2}\overline{N^c_i}(M_N)_{ij} N_j+\mathrm {h.c.}, \end{aligned}$$

where

$$\begin{aligned} M_D=\begin{pmatrix} m^\nu _{11} &{} &{} m^\nu _{13}\\ &{} m^\nu _{22}&{} \\ &{} m^\nu _{32} &{} m^\nu _{33} \end{pmatrix}\,\qquad M_N=\begin{pmatrix} &{} M^N_{12} &{} M^N_{13}\\ \cdot &{} &{} \\ \cdot &{} &{} M^N_{33} \end{pmatrix}. \end{aligned}$$

When the Majorana masses \(M_N\) are much larger than the Dirac masses \(M_D\), the mass matrix of the light active neutrinos is obtained via the celebrated seesaw formula [53]

$$\begin{aligned} M_\nu =-M_D M^{-1}_R M^T_D. \end{aligned}$$
(27)

The PMNS matrix U diagonalises the neutrino mass matrix \(U^T M_\nu U=\textrm{diag}\begin{pmatrix} m_1&m_2&m_3 \end{pmatrix}\) and is canonically parameterised as

$$\begin{aligned} U&=\begin{pmatrix} c_{12}c_{13} &{} s_{12}c_{13} &{} s_{13}e^{-i\delta }\\ -s_{12}c_{23}-c_{12}s_{23}s_{13}e^{i\delta } &{} c_{12}c_{23}-s_{12}s_{23}s_{13}e^{i\delta } &{} s_{23}c_{13}\\ s_{12}s_{23}-c_{12}c_{23}s_{13}e^{i\delta } &{} -c_{12}s_{23}-s_{12}c_{23}s_{13}e^{i\delta } &{} c_{23}c_{13} \end{pmatrix}\nonumber \\&\quad \times \textrm{diag}\begin{pmatrix}1&e^{i\alpha _{12}}&e^{i\alpha _{31}}\end{pmatrix}. \end{aligned}$$

Assuming the so-called Normal Ordering (NO) \(m_1<m_2<m_3\), a recent global fit [54] to neutrino oscillation data gives the following values for the mass squared differences and mixing angles:

$$\begin{aligned} m^2_{2}-m^2_{1}=&\left[ 6.94-8.14\right] \times 10^{-5}\ e\textrm{V}^2\nonumber \\ \vert {m^2_{3}-m^2_{1}} \vert =&\left[ 2.47-2.63\right] \times 10^{-3}\ e\textrm{V}^2 \nonumber \\ \sin ^2\theta _{12}=&\left[ 2.71-3.69\right] \times 10^{-1}\nonumber \\ \sin ^2\theta _{23}=&\left[ 4.34-6.1\right] \times 10^{-1} \nonumber \\ \sin ^2\theta _{13}=&\left[ 2.000 - 2.405\right] \times 10^{-2}\nonumber \\ \delta =&\left[ 0.71 - 1.99\right] \times \pi \end{aligned}$$
(28)

Here the lower and upper values define the \(\pm 3\sigma \) range. The texture of the neutrino mass matrix in our model is compatible with the observed mass differences and mixing angles. To simplify the expressions, we consider \(m^\nu _{32}\rightarrow 0\), resulting in a neutrino mass matrix that reads

$$\begin{aligned} M_\nu {=}&\begin{pmatrix} -\frac{(m^\nu _{13})^2}{M^N_{33}} &{} -\frac{m^\nu _{22}m^\nu _{11}}{M^N_{12}}\left( 1-\frac{m^\nu _{13}M^N_{13}}{m^\nu _{11}M^N_{33}}\right) &{} -\frac{m^\nu _{13}m^\nu _{33}}{M^N_{33}} \\ \cdot &{} -\frac{(m^\nu _{22})^2(M^N_{13})^2}{M^N_{33}(M^N_{12})^2} &{} -\frac{(m^\nu _{22})(m^\nu _{33})(M^N_{13})}{M^N_{33}(M^N_{12})^2} \\ \cdot &{} \cdot &{} -\frac{(m^\nu _{33})^2}{M^N_{33}} \end{pmatrix} \\ \equiv&\begin{pmatrix} \frac{a^2}{Z} &{} X &{} a \\ \cdot &{} Y &{} b \\ \cdot &{} \cdot &{} Z \end{pmatrix}. \nonumber \end{aligned}$$

We find that with the choice of parameters

$$\begin{aligned} a&\sim 2\times 10^{-3}~\textrm{eV}, \; b\sim -2.75 \times 10^{-2}~\textrm{eV}, \\ X&\sim 9 \times 10^{-3}~\textrm{eV},\; Y\sim -2 \times 10^{-2}~\textrm{eV},\\ Z&\sim -2 \times 10^{-2}~\textrm{eV}, \end{aligned}$$

we are within the parameter ranges listed in Eq. (28), and predict the neutrino masses

$$\begin{aligned} m_1\sim&2 \times 10^{-3}~\textrm{eV},\\ m_2\sim&9 \times 10^{-3}~\textrm{eV},\\ m_3\sim&5 \times 10^{-2}~\textrm{eV}. \end{aligned}$$

Assuming that the sterile neutrinos have masses in the TeV range, which can be probed at the LHC [55, 56], the above values require Dirac Yukawas that are \(y^\nu \sim {\mathcal {O}}(10^{-7})\).

The interactions generating neutrino masses in our model cannot lead to detectable charged lepton flavour violating signals.

4 Conclusions

In this article we proposed a simple model that shows how \(\mu \rightarrow e\) processes can probe \(\tau \rightarrow e \) and \(\tau \rightarrow \mu \) couplings beyond the reach of direct searches for lepton flavour violation in the \(\tau \) sector. We extended the Standard Model gauge group by the anomaly-free abelian group \(U(1)'\equiv U(1)_{L_e-L_\mu }\) and added two scalars that are charged under this \(U(1)'\) group, one \(SU(2)_L\) doublet, \(\phi \), and one singlet, S, to the SM particle content.

As a result of the spontaneous breaking of the \(U(1)'\) gauge group, the associated \(Z'\) boson acquires a mass. Since the new scalar doublet is singly charged under the \(U(1)'\) gauge group, \(\mu \leftrightarrow \tau \) and \(\tau \leftrightarrow e\) Yukawa couplings are allowed, while \(\mu \leftrightarrow e\) couplings are forbidden. Nonetheless, \(\mu \rightarrow e\) can be mediated by the product of \(\mu \rightarrow \tau \times \tau \rightarrow e\) interactions. After electroweak symmetry breaking, the \(Z'\) couplings receive flavour off-diagonal components due to the misalignment of the gauge eigenbasis and the mass eigenbasis of the leptons.

If \(M_{Z'}\lesssim m_\mu -m_e\) and the \(Z'\) boson decays outside the detector, searches for \(\mu \rightarrow e +\text {inv.}\) can compete with constraints on \(\tau \rightarrow e\) and \(\tau \rightarrow \mu \) couplings from \(\tau \rightarrow e +\text {inv.}\) and \(\tau \rightarrow \mu +\text {inv.}\) searches.

Also in the case of \(M_{Z'}\gtrsim m_\tau \), \(\tau \rightarrow \ell \) searches do not appreciably constrain the model and the \(\mu \rightarrow e\) processes lead to the most stringent limits, despite being proportional to the product of Yukawa couplings \(y_{23}\times y_{31}\). The model predicts rates that are in reach of the upcoming \(\mu \rightarrow e\) experiments and our \(Z'\) boson can be searched for at Belle-II.

Finally, if we add sterile neutrinos that are charged under \(U(1)'\) to the spectrum, the singlet VEV can contribute to their Majorana masses. We show that, via a type-I seesaw mechanism, we can accommodate for the observed neutrino masses and mixing angles.

Our model shows explicitly that allowing for \(\tau \rightarrow \mu \) and \(\tau \rightarrow e\) processes close to the current experimental bounds can lead to observable effects in \(\mu \rightarrow e\) searches.