1 Introduction

This exercise was born from a puzzle: experiments that search for \(\mu \leftrightarrow e\) flavour change constrain a long list of QCD\(\times \)QED-invariant four-fermion operators, some of which turn out to be of dimension eight when SU(2) invariance is imposed. But it is common, when describing New Physics from above \(m_W\) with Effective Field Theory(EFT) [1, 2], to use the SU(2)-invariant basis of dimension 6 operators given by Buchmuller and Wyler [3] and pruned in [4]. To explore when it is justified to neglect the additional four-fermion operators below \(m_W\), we were looking for a model where they might give relevant contributions. The first model we tried was the 2 Higgs Doublet Model (2HDM). It turns out that the additional four-fermion operators (not in the Buchmuller–Wyler list) must be included below \(m_W\) to correctly reproduce the \(\mathcal{O}(1/M^2)\) terms in the \(\mu \rightarrow e \gamma \) amplitude of the 2HDM.

The exercise takes place in a Type III 2HDM in the decoupling limit, where the one-loop and two-loop “Barr–Zee” contributions to the \(\mu \rightarrow e \gamma \) amplitude are known [5]. The aim is to extract the numerically dominant contributions and to identify where they arise in an EFT description.

Section 2 reviews the 2HDM of Type III in the decoupling limit and the calculation in this model of the \(\mu \rightarrow e \gamma \) amplitude by Chang et al. [5] (CHK). Type III 2HDMs include charged-lepton flavour-changing couplings; for simplicity, only a \(\mu \leftrightarrow e\) flavour-changing interaction is allowed. The decoupling limit is taken by requiring the mass scale M of the second doublet to be \(\gtrsim \)10 v, to ensure that EFT can give a reasonable approximation to the \(\mu \rightarrow e \gamma \) amplitude in this model. In the final subsections, the \(\mathcal{O}(1/M^2)\) and \(\mathcal{O}(1/M^4)\) parts of the \(\mu \rightarrow e \gamma \) amplitude are compared for the various classes of diagrams.

Section 3 sets up an EFT formalism, based on dimension-six operators and one-loop RGEs, which correctly reproduces all the \(\mathcal{O}([\alpha \log ]^n/M^2)\) terms of the CHK calculation in the full model. How to obtain the remaining terms of the CHK result is outlined in Appendix B.

Section 4 summarises the less trivial aspects of the calculation and the location in the EFT of the Barr–Zee diagrams. It should be readable independently of the more technical sections, Sects. 2 and 3. There are two-and-a-half issues with the results obtained with dimension-six operators and one-loop RGEs: the four-fermion operator basis below \(m_W\) only needs to be invariant under QCD and QED, so it should be enlarged with respect to the list [4] of SU(2)-invariant dimension-six operators. Second, two-loop effects are important, because the one-loop contribution is suppressed by the square of the muon Yukawa coupling. Finally, dimension-eight terms can be enhanced, both by logs and by unknown couplings in the 2HDM (encapsulated in \(\tan \beta \)).

This exercise overlaps with several studies. Pruna and Signer [6] studied \(\mu \rightarrow e \gamma \) in an EFT consisting of the SM extended by a complete set of SU(2)-invariant operators, but they focused on the electroweak running above \(m_W\), rather than the matching at \(m_W\), so they did not extend the operator basis below \(m_W\). In the context of B physics, Alonso et al. [7] and Aebischer et al. [8] calculated the coefficients of the enlarged operator basis below \(m_W\), given a selection of SU(2)-invariant operators above \(m_W\). This exercise only agrees approximatively with [7], as discussed in Sect. 4.

2 \(\mu \rightarrow e \gamma \) in the Type III 2HDM

2.1 Review of the 2HDM Type III in the decoupling limit

The 2HDM is a minimal extension of the Standard Model, including one extra Higgs doublet with new unknown interactions to the Standard Model fermions and Higgs—for a review, see [9, 10]. In the 2HDM considered here, the extra doublet is taken to be heavy—this is the decoupling limit, and it should be describable with EFT. To allow for LFV, consider a “Type III” model, where there is no discrete symmetry that distinguishes the Higgs, so there is no “symmetry basis” in which to write the Lagrangian (so also no unambiguous definition of \(\tan \beta \)). For simplicity, the Higgs potential is taken to be CP invariant.

The Lagrangian can be written in the “Higgs basis”, defined such that \(\langle H_1 \rangle \ne 0\), and \(\langle H_2\rangle =0\), so the doublets are writtenFootnote 1

$$\begin{aligned} H_1= & {} \left( \begin{array}{c} G^+ \\ {1\over \sqrt{2}}\left( v+H_1^0+iG^0\right) \end{array} \right) ,\quad H_2=\left( \begin{array}{c} H^+ \\ {1\over \sqrt{2}}\left( H_2^0+iA\right) \end{array} \right) ,\nonumber \\ \end{aligned}$$
(1)

where the Gs are Goldstones. This shows that the mass eigenstates A and \(H^\pm \) are the CP-odd and charged components of the vev-less \(H_2\), but there can be some mixing between \(H_1\) and \(H_2\) in the CP-even scalars hH. The convention \(\langle H_1 \rangle = v/\sqrt{2}\) implies \(v= 246\) GeV,

$$\begin{aligned} \frac{4 G_F}{\sqrt{2}} = \frac{2}{v^2} ,\quad \sqrt{2} G_F = \frac{1}{v^2}. \end{aligned}$$
(2)

In this “Higgs basis”, in the notation of [11], the potential parameters are written in upper case, so the potential is

$$\begin{aligned} \mathcal {V}= & {} M_{11}^2H_1^\dagger H_1+M^2 H_2^\dagger H_2 -[M_{12}^2 H_1^\dagger H_2+\mathrm{h.c.}]\nonumber \\&\mathrm{(in~ Higgs~ basis)} \nonumber \\&+{\textstyle {1 \over 2}}\Lambda _1(H_1^\dagger H_1)^2 +{\textstyle {1 \over 2}}\Lambda _2(H_2^\dagger H_2)^2 +\Lambda _3(H_1^\dagger H_1)(H_2^\dagger H_2)\nonumber \\&+\Lambda _4(H_1^\dagger H_2)(H_2^\dagger H_1) \nonumber \\&+\left\{ {\textstyle {1 \over 2}}\Lambda _5(H_1^\dagger H_2)^2 +\big [\Lambda _6(H_1^\dagger H_1)\right. \nonumber \\&\left. +\,\Lambda _7(H_2^\dagger H_2)\big ] H_1^\dagger H_2+\mathrm{h.c.}\right\} . \end{aligned}$$
(3)

The angle \(\beta - \alpha \) can be defined from the Higgs potential of a Type III model, unlike \(\beta \) and \(\alpha \). It rotates between the mass basis of hH and the Higgs basis:

$$\begin{aligned}&h= H_1^0~s_{\beta -\alpha }+ H_2^0\,c_{\beta -\alpha },\nonumber \\&H= H_1^0~c_{\beta -\alpha }- H_2^0\,s_{\beta -\alpha }, \end{aligned}$$
(4)

so that what is here called \(\beta -\alpha \) is independent of the angle \(\beta \), which will later be defined from the Yukawas. If the potential is CP invariant, there is a simple relation for \(c_{\beta -\alpha }\) [11]:

$$\begin{aligned} \cos \,[(\beta -\alpha )] \sin \,[(\beta -\alpha )]= & {} {-\Lambda _6v^2\over m_{H}^2-m_{h}^2}. \end{aligned}$$
(5)

The masses of the scalars are related to the potential parameters in the Higgs basis as [11]

$$\begin{aligned}&m_{H^\pm }^2 = M^2 + \frac{v^2}{2} \Lambda _3 ,\nonumber \\&m_A^2 - m_{H^\pm }^2= -\frac{v^2}{2} (\Lambda _5 - \Lambda _4), \nonumber \\&m_H^2 + m_h^2- m_A^2 = + v^2 (\Lambda _1 + \Lambda _5), \nonumber \\&(m_H^2 - m_h^2)^2 = [m_A^2 + (\Lambda _5 - \Lambda _1)v^2]^2 + 4 \Lambda _6^2 v^4, \end{aligned}$$
(6)

and the couplings to \(W^+W^-\) are \( i g m_W C_{\phi WW} g^{\mu \nu }\) with

$$\begin{aligned} C_{h WW} = s_{ \beta - \alpha },\quad C_{H WW} = c_{\beta - \alpha } ,\quad C_{A WW} =0. \end{aligned}$$
(7)

In the decoupling limit [12], where \(\Lambda _i v^2 \ll M^2\), the exact relations (6) can be expanded in \(v^2/M^2\) to obtain

$$\begin{aligned} m_H^2 - m_{A}^2\simeq & {} +\Lambda _5 v^2 \quad (\mathrm{decoupling~limit}) ,\nonumber \\ m_h^2\simeq & {} + v^2 \Lambda _1 - \frac{v^4}{M^2} \left( \Lambda _6 + \frac{(\Lambda _5 - \Lambda _1)^2}{4} \right) \nonumber \\&(\mathrm{decoupling~limit}); \end{aligned}$$
(8)

and then approximating \(s_{\beta -\alpha }\simeq 1 \) in Eq. (5) gives

$$\begin{aligned} \Rightarrow \cos \,[(\beta -\alpha )]\simeq & {} {-\Lambda _6v^2\over M^2} \left( 1 + \frac{\Lambda _1v^2}{M^2} \right) + \cdots \nonumber \\&(\mathrm{decoupling~limit}), \end{aligned}$$
(9)

which confirms that, in the decoupling limit, h is mostly \(H_1\) and H is mostly \(H_2\).

The Yukawa couplings in the Higgs basis for the Higgses, and the mass eigenstate basis for the \(\{ u_R, d_R,e_R, d_L, e_L \}\), are

$$\begin{aligned} -\mathcal{L}_\mathrm{Y}= & {} {\Big (} \overline{Q}_{j} \widetilde{H}_1 K^*_{ij} Y^{U}_i U_{i} + \overline{Q}_i H_1 Y^{D}_{i} D_{i} +\overline{L}_i H_1 Y^{E}_{i} E_{i} {\Big ) } \nonumber \\&+\overline{Q}_{i} \widetilde{H}_2 [K^\dagger \rho ^{U}]_{ij} U_{j} \nonumber \\&+ \overline{Q}_i H_2 [\rho ^{D}]_{ij} D_{j} +\overline{L}_i H_2 [\rho ^{E}]_{ij} E_{j} +\mathrm{h.c.}, \end{aligned}$$
(10)

where QL are SU(2) doublets, EUD are singlets, SU(2) indices are implicit (\(\overline{L} H_1 = \bar{\nu } H^+_1 + \bar{e} H^0_1\)), \(\widetilde{H}_i = i \sigma _2 H_i^*\), the generation indices are explicit, and K is the CKM matrix. The Y matrices are flavour diagonal and equal to the SM Yukawas:

$$\begin{aligned}{}[Y^P]_{ij} = \sqrt{2} \frac{m^P_j}{ v} \delta _{ij} \end{aligned}$$
(11)

as a result of being in the fermion mass eigenstate bases.

The \(\rho ^P\) matrices can be flavour-changing. To obtain \(\mu \rightarrow e \gamma \) without other flavour-changing processes, I neglect all off-diagonal elements except \(\rho _{\mu e} \simeq \rho _{ e \mu } \ne 0\). To obtain a predictive model and make contact with other 2HDM literature, the diagonal elements follow the pattern of one of the types of 2HDM which have a discrete symmetry that ensures flavour conservation:

$$\begin{aligned} \begin{array}{c|ccc} \mathrm{Model ~type} &{}\rho ^U &{} \rho ^D &{} \rho ^E \\ \hline \mathrm{Type ~I} &{}Y^U \cot \beta &{} - Y^D\tan \beta &{} -Y^E \tan \beta \\ \mathrm{Type ~II} &{}-Y^U \tan \beta &{} - Y^D\tan \beta &{} -Y^E \tan \beta \\ \mathrm{Type~ X }&{}Y^U \cot \beta &{} Y^D\cot \beta &{} -Y^E \tan \beta \\ \mathrm{Type~ Y} &{}-Y^U \tan \beta &{} Y^D\cot \beta &{} -Y^E \tan \beta \\ \end{array} \end{aligned}$$
(12)

This requires a definition of \(\tan \beta \) which is common to all the fermions. It can for instance be defined from the \(\tau \) Yukawa: in the mass eigenstate basis for \(\tau _R, \tau _L\), define \(H_\tau \) to be the linear combination of Higgses to which couples the \(\tau \), and \(\beta \) as the angle in Higgs doublet space between \(H_1\) (the vev) and \(H_\tau \):

$$\begin{aligned} H_{\perp }= & {} {\widetilde{H}}_1 \sin \beta + {\widetilde{H}}_2 \cos \beta \nonumber \\ H_\tau= & {} {H}_1 \cos \beta - {H}_2 \sin \beta \end{aligned}$$
(13)

Since \(\tan \beta \) is defined from the lepton Yukawas, \(\rho ^E \propto \tan \beta \) in all Types of 2HDM listed above. This is unconventional, but should include the same predictions provided that \(\tan \beta \) is allowed to range from \(1/50 \rightarrow 50\).

From Eq. (10) the couplings to fermions are

$$\begin{aligned} -\mathcal {L}_Y= & {} \overline{d} \frac{ 1 }{\sqrt{2}}\left[ Y^D (P_R + P_L) s_{\beta -\alpha }+ (\rho ^DP_R+{\rho ^D}^\dagger P_L)c_{\beta -\alpha }\right] d h\nonumber \\&+ \,\overline{d} \frac{ 1 }{\sqrt{2}} \left[ Y^D(P_R + P_L) c_{\beta -\alpha }- (\rho ^DP_R+{\rho ^D}^\dagger P_L)s_{\beta -\alpha }\right] dH\nonumber \\&+\,\frac{i}{\sqrt{2}}\overline{d}(\rho ^DP_R-{\rho ^D}^\dagger P_L)d A\nonumber \\&\overline{e} \frac{ 1 }{\sqrt{2}} \left[ Y^E(P_R + P_L) s_{\beta -\alpha }+ (\rho ^EP_R+{\rho ^E}^\dagger P_L)c_{\beta -\alpha }\right] eh\nonumber \\&+ \,\overline{e} \frac{ 1 }{\sqrt{2}} \left[ Y^E(P_R + P_L) c_{\beta -\alpha }- (\rho ^EP_R+{\rho ^E}^\dagger P_L)s_{\beta -\alpha }\right] \nonumber \\&e H+\frac{i}{\sqrt{2}}\overline{e}(\rho ^EP_R-{\rho ^E}^\dagger P_L)e A\nonumber \\&+ \,\overline{u} \frac{ 1 }{\sqrt{2}} \left[ Y^U(P_R + P_L) s_{\beta -\alpha }+ ( \rho ^UP_R+{\rho ^U}^\dagger P_L)c_{\beta -\alpha }\right] uh\nonumber \\&+\,\overline{u} \frac{ 1 }{\sqrt{2}} \left[ Y^U (P_R + P_L) c_{\beta -\alpha }- (\rho ^UP_R+{ \rho ^U}^\dagger P_L)s_{\beta -\alpha }\right] uH\nonumber \\&-\frac{i}{\sqrt{2}}\overline{u}( \rho ^UP_R-{\rho ^U}^\dagger P_L)uA\nonumber \\&+\left\{ \overline{u}\left[ K\rho ^DP_R-{\rho ^U}^\dagger KP_L\right] dH^+ +\mathrm{h.c.}\right\} \nonumber \\&+\left\{ \overline{\nu }\left[ \rho ^EP_R \right] e H^+ +\mathrm{h.c.}\right\} . \end{aligned}$$
(14)

This gives Feynman rules of the form \(-iF^{\phi ,X}_{ij} P_X\), where

$$\begin{aligned}&F^{h,L}_ {ij}= \frac{Y^P_{ij}}{\sqrt{2}}s_{\beta -\alpha }+ \frac{[\rho ^{P\dagger }]_{ij}}{\sqrt{2}}c_{\beta -\alpha },\nonumber \\&F^{h,R}_ {ij}= \frac{Y^P_{ij}}{\sqrt{2}} s_{\beta -\alpha }+ \frac{[\rho ^{P }]_{ij}}{\sqrt{2}}c_{\beta -\alpha }\nonumber \\&F^{H,L}_ {ij}= \frac{Y^P_{ij}}{\sqrt{2}}c_{\beta -\alpha }-\frac{[\rho ^{P \dagger }]_{ij}}{\sqrt{2}}s_{\beta -\alpha },\nonumber \\&F^{H,R}_ {ij}= \frac{Y^P_{ij}}{\sqrt{2}} c_{\beta -\alpha }- \frac{[\rho ^{P }]_{ij}}{\sqrt{2}}s_{\beta -\alpha }\nonumber \\&F^{A,L}_ {u_iu_j}= i\frac{[\rho ^{U \dagger }]_{ij}}{\sqrt{2}} ,\nonumber \\&F^{A,R}_ {u_iu_j}= -i \frac{[\rho ^{U }]_{ij}}{\sqrt{2}} \nonumber \\&F^{A,L}_ {i j}= - i\frac{[\rho ^{P \dagger }]_{ij}}{\sqrt{2}} ,\nonumber \\&F^{A,R}_ {ij}= i \frac{[\rho ^{P }]_{ij}}{\sqrt{2}},\quad \mathrm{for}\quad P \in E,D, \end{aligned}$$
(15)

where the possible forms for \(\rho ^P\) are given in Eq. 12.

2.2 \(\mu \rightarrow e \gamma \) in the 2HDM

The decay \(\mu \rightarrow e \gamma \) can be parametrised by adding the dipole operator to the Standard Model Lagrangian. In the notation of Kuno and Okada [13]

$$\begin{aligned} \mathcal{L}_{meg}= & {} -\frac{4G_F}{\sqrt{2}} m_\mu \left( A_R \overline{\mu _R} \sigma ^{\alpha \beta } e_L F_{\alpha \beta } + A_L \overline{\mu _L} \sigma ^{\alpha \beta } e_R F_{\alpha \beta }\right) ,\nonumber \\ \end{aligned}$$
(16)

which gives

$$\begin{aligned} BR(\mu \rightarrow e \gamma )= & {} 384 \pi ^2 (|A_R|^2 + |A_L|^2) <5.7\times 10^{-13}\nonumber \\ \end{aligned}$$
(17)

where the upper bound is from the MEG experiment [14]. (The amplification factor of \(384\pi ^2 = \frac{3}{2}(16 \pi )^2\) is because \(\mu \rightarrow e \gamma \) is a 2-body decay, and is compared to the usual three-body muon decay.). \(|A_R|, |A_L|\) are dimensionless, and if \(|A_R| = |A_L|\), then \(|A_X| <8.6\times 10^{-9}\).

Fig. 1
figure 1

One- and two-loop diagrams contributing to \(\mu \rightarrow e \gamma \) in the 2HDM, in the presence of a flavour-changing Yukawa coupling \(F_{\mu e}\) (see Eq. (15))

The decay \(\mu \rightarrow e \gamma \) has been extensively studied in the 2HDM [3, 1527], particularily in connection [2937] with the recent LHC [28, 38] excess in \(h\rightarrow \tau ^\pm \mu ^\mp \). Chang, Hou and Keung [5] (CHK) calculate the contributions to \(\mu \rightarrow e \gamma \) of neutral Higgs bosons with flavour-changing couplings. Their calculation can be divided into four classes of diagrams, illustrated in Fig. 1: the one-loop diagrams, then three classes of two-loop diagrams, that is, those with a t-loop, with a b-loop and with a W-loop. I use their results, and later, in matching onto SU(2)-invariant operators, assume that the charged Higgs contribution ensures the SU(2) invariance of the results. The results of CHK are referred to as “full-model” results in the following.

In the appendix is given a translation dictionary between the notation of CHK and here. The amplitude given by CHK [5]Footnote 2 for the one-loop diagram of Fig. 1, with an internal \(\mu \), is

$$\begin{aligned}&-2\sqrt{2} G_F m_\mu A^{1~loop}_{L}\nonumber \\&\quad = -\frac{em_\mu }{32 \pi ^2} \left[ \sum _{\phi = h,H,A} \frac{ F^{\phi ,L}_{\mu \mu } F^{\phi ,L}_{e \mu } }{m_\phi ^2} \left( \ln \frac{m_\mu ^2}{m_\phi ^2} + \frac{3}{2} \right) \right] ^*,\nonumber \\&-2\sqrt{2} G_F m_\mu A^{1~loop}_{R} \nonumber \\&\quad =-\frac{e m_\mu }{32 \pi ^2} \left[ \sum _{\phi = h,H,A} \frac{ F^{\phi ,R}_{\mu \mu } F^{\phi ,R}_{e \mu } }{m_\phi ^2} \left( \ln \frac{m_\mu ^2}{m_\phi ^2} + \frac{3}{2} \right) \right] ^*. \end{aligned}$$
(18)

Notice that these amplitudes are suppressed by two leptonic Yukawas and an additional muon mass insertion to flip the chirality. In the decoupling limit, with \(\phi \in \{H,A\}\) and \(\mu \) in the loop, Eq. (18) for \(A_L\) gives

$$\begin{aligned}&-2\sqrt{2} G_F m_\mu A^{H,A,1~loop}_{L} \nonumber \\&\quad \simeq -\frac{e m_\mu }{32 \pi ^2M^2} [\rho ^E]_{\mu e} \left[ Y^E_{\mu \mu } \frac{ \Lambda _6 v^2}{2 M^2} - \frac{[\rho ^E]_{\mu \mu } }{2M^2} \Lambda _5v^2 \right] ^* \ln \frac{m_\mu ^2}{M^2}\nonumber \\ \end{aligned}$$
(19)

where \(m_H \simeq m_A \simeq M\) was used in the logarithm, and terms suppressed by more than \(M^{-4}\) were dropped. The result for \(A_L\) is obtained by replacing \([\rho ^E]_{ij} \rightarrow [\rho ^E]^*_{ij }\) in the above. The one-loop diagram with the light Higgs h and a \(\mu \) (also in the decoupling limit) gives

$$\begin{aligned}&-2\sqrt{2} G_F m_\mu A^{h,1~loop}_{L} \nonumber \\&\quad \simeq -\frac{e m_\mu }{32 \pi ^2m_h^2} [\rho ^E]_{\mu e} \left[ -Y^E_{\mu \mu } \frac{ \Lambda _6 v^2}{2 M^2} + 2 [\rho ^E]_{\mu \mu } \left( \frac{ \Lambda _6 v^2}{2 M^2}\right) ^2 \right] ^*\nonumber \\&\quad \qquad \times \ln \frac{m_\mu ^2}{m_h^2}. \end{aligned}$$
(20)

As noted long ago by Bjorken and Weinberg [39], there are two-loop diagrams which can be relevant for \(\mu \rightarrow e \gamma \). Some examples are illustrated to the right in Fig. 1; a more complete set of diagrams with broken electroweak symmetry can be found in [40]. The resulting two-loop contributions to \(\mu \rightarrow e \gamma \) can be numerically larger than the one-loop contributions, because the Higgs attaches only once to the lepton line, via the flavour-changing coupling, and otherwise couples to a Wb or t loop. The result for the top loop (neglecting the diagrams with internal Z exchange, see Fig. 1), is

$$\begin{aligned}&-2\sqrt{2} G_F m_\mu A_{L}^{t~loop}\nonumber \\&\quad =\frac{e\alpha }{16\pi ^3} \frac{1}{m_t} 3Q_t^2\nonumber \\&\qquad \times \left[ \sum _{\phi = h,H} F_{e \mu }^{\phi ,L} F^{\phi ,L}_{tt} f\left( \frac{m_t^2}{m_\phi ^2}\right) + F_{e \mu }^{A,L} F^{A,L}_{tt} g\left( \frac{m_t^2}{m_A^2}\right) \right] ^* \nonumber \\&\quad \simeq \frac{e\alpha m_t }{32\pi ^3M^2} [\rho ^{E }]_{\mu e} 3Q_t^2\nonumber \\&\qquad \times \left[ \left( [\rho ^{U}]_{tt} + Y^{U}_{tt}\frac{\Lambda _6 v^2}{2M^2} \right) \ln ^2 \frac{m_t^2}{M^2} - 2 Y^{U}_{tt} \Lambda _6 f\left( \frac{m_t^2}{m_h^2}\right) \right] \nonumber \\ \end{aligned}$$
(21)

where the log\(^2\) terms of the approximate equality are the sum of the heavy H and A contributions, and the third term is due to the light h. The approximate equality is in the decoupling limit, so one uses \(m_H \simeq m_A \simeq M\) in the log, neglects terms suppressed by \(v^2/M^2\), and uses the following, which holds for small z [5]:

$$\begin{aligned}&f(z) \simeq g(z) \simeq \frac{z}{2} \ln ^2 z ,\quad h(z) \simeq z \ln z ,\quad \nonumber \\&f(z) - g(z) \simeq z (\ln z +2). \end{aligned}$$
(22)

(The function h will appear in the W-loop contribution.) The functions fg and h are given in CHK, are slowly varying near \(z\sim 1\) and at \(z = m_W^2/m_h^2\simeq 0.4\) they are \(f \sim 0.7, g\sim 0.9\) and \(h\sim 0.5\). The formula for \(A^{t~loop}_R\) is obtained by replacing \( F_{ij}^{H,L} \rightarrow F_{ij}^{H,R}\) in the coefficient of f, and \(\rho ^{F \dagger } \rightarrow \rho ^{F}\) in the coefficient of g. The top-loop amplitude is \(\propto m_t\), as expected because a top mass insertion is required both to have an even number of \(\gamma \)-matrices in the loop, and to provide the Higgs leg of the dipole operator.

A b-quark loop should be described by the same formula as the top loop (again neglecting the Z-exchange diagrams), but the A-exchange contribution will subtract from H-exchange, because of the sign difference in the couplings of A to bs and ts (see Eq. (15)):

$$\begin{aligned}&-2 \sqrt{2} G_F m_\mu A_{L}^{b~loop} =\frac{e\alpha }{16 \pi ^3 } \frac{3Q_b^2}{m_b }\nonumber \\&\times \quad \left[ \sum _{\phi = h,H} F_{e \mu }^{\phi ,L} F^{\phi ,L}_{bb} f(\frac{m_b^2}{m_\phi ^2}) + F_{e \mu }^{A,L} F^{A,L}_{bb} g(\frac{m_b^2}{m_A^2}) \right] ^* \nonumber \\&\simeq \frac{e\alpha m_b }{64\pi ^3M^2} 3Q_b^2 \rho ^{E }_{ \mu e} \nonumber \\&\times \quad \left[ \frac{ v^2}{M^2} \left( Y^{D}_{bb} \Lambda _6 - \rho ^{D}_{bb} \Lambda _5 \right) \ln ^2 \frac{m_b^2}{M^2} - \frac{v^2}{m_h^2}\right. \nonumber \\&\quad \times \left. \left( Y^{D}_{bb} \Lambda _6 - \rho ^{D }_{bb} \frac{\Lambda ^2_6 v^2}{M^2}\right) \ln ^2 (\frac{m_b^2}{m_h^2}) \right] \end{aligned}$$
(23)

where the approximate equality is in the decoupling limit, and uses the approximations of Eq. (22). The last term is the light Higgs contribution. Notice that the \(\mathcal{O} (1/M^2)\) contributions of H and A cancelled against each other in the decoupling limit. The formula for \(A_R\) is obtained by replacing \( F_{ij}^{H,L} \rightarrow F_{ij}^{H,R}\) in the coefficient of f, and \(\rho ^{F \dagger } \rightarrow \rho ^{F}\) in the coefficient of g.

Finally, there is a two-loop contribution involving a W-loop; the third diagram of Fig. 1 is one of the many that contribute. In the CP-conserving 2HDM considered here, A does not couple to the W or the Goldstones, so it cannot appear in these diagrams. CHK compute separately the diagrams with either photon or Z exchange between the lepton and W (see Fig. 1). The Z-mediated amplitude is proportional to \(1 - 4 \sin ^2 \theta _W\) and estimated by CHK to be about 10 % of the photon-mediated amplitude, so only the \(\gamma \)-amplitude is considered here. The H and h contributions individually are

$$\begin{aligned}&-2 \sqrt{2} G_F m_\mu A_L^{W~loop} = \quad \pm \frac{e \alpha }{32 \pi ^3} \frac{\Lambda _6 v}{M^2} \frac{\rho ^{E}_{ \mu e}}{\sqrt{2}}\nonumber \\&\times \quad \left( 3f(z_\phi ) + 5 g(z_\phi ) + \frac{3}{4}[g(z_\phi ) +h(z_\phi )] \right. \nonumber \\&\quad \left. +\frac{f(z_\phi ) -g(z_\phi )}{2z_\phi }\right) \end{aligned}$$
(24)

where \(z_\phi = m_W^2/m_\phi ^2\), and - is for \(\phi =H\), +is for \(\phi =h\). The sum of the contributions, in the decoupling limit, is

$$\begin{aligned}&-2 \sqrt{2} G_F m_\mu A_L^{W~loop} \simeq - \frac{e \alpha }{64 \pi ^3} \frac{\Lambda _6 v}{M^2} \frac{\rho ^{E}_{ \mu e}}{\sqrt{2}}\nonumber \\&\times \quad \left[ \left( \frac{ m_W^2}{M^2} \ln \frac{ m_W^2}{M^2} \left( \frac{35}{4} \ln \frac{ m_W^2}{M^2} + \frac{3}{2} \right) + \ln \frac{ m_W^2}{M^2} +2 \right) \right. \nonumber \\&\quad \left. \left. - 2 \left( 3f(z_h) + 5 g(z_h)\right. + \frac{3}{4}\left[ g(z_h)+h(z_h)\right. \right. \right. \nonumber \\&\qquad \quad \left. \left. \left. + \frac{f(z_h) -g(z_h)}{2z_h } \right] \right) \right] \end{aligned}$$
(25)

where the limiting forms of Eq. (22) were used. The first line of the approximate equality is the contribution of H, and the last line is the contribution of h where \(z_h = m_W^2/m_h^2\), and the parentheses term evaluates to \(\sim 7\). The heavy H exchanged between the lepton line and a Goldstone loop generates an \(\mathcal{O}(\log /M^2)\) term, which is carefully discussed by CHK, because the \(1/M^2\) suppression arises from the decoupling limit of \(c_{\beta -\alpha }\), given in Eq. (9). CHK take the mixing angle \(c_{\beta -\alpha }\) as a free parameter, so one refers to this term as a “non-decoupling” contribution.

2.2.1 The relative importance of the \(\mathcal{O}(1/M^2)\) and \(\mathcal{O}(1/M^4)\) terms

Since EFT is an expansion in the operator dimension, it is interesting to see under what conditions the \(\mathcal{O}(1/M^2)\) terms give a good approximation to the full answer. These terms should arise in a relatively simple EFT using dimension-six operators; a more extended EFT would be required to reproduce the \(\mathcal{O}(1/M^4)\) terms. So this section compares the magnitude of the \(\mathcal{O}(1/M^2)\) and \(\mathcal{O}(1/M^4)\) terms in the four classes of diagrams (one-, b-, t- and W-loop classes), with little attention to signs and 2s.

For the one-loop diagrams, Eqs. (19,20) give the ratio of the \(\mathcal{O}(1/M^4)\) parts to the \(\mathcal{O}(1/M^2)\) part as

$$\begin{aligned}&\left( \frac{ v^2}{ M^2 } \right) \nonumber \\&\times \quad \frac{ 2 [\rho ^E]_{\mu \mu } \Lambda ^2_6 + \left[ Y^E_{\mu \mu } \Lambda _6 - [\rho ^E]_{\mu \mu } \Lambda _5 \right] ( 1 + \ln \frac{m_h^2}{M^2} /\ln \frac{m_\mu ^2}{m_h^2} ) }{ 2 Y^E_{\mu \mu } \Lambda _6 }\nonumber \\ \end{aligned}$$
(26)

where I used \(v^2/m_h^2 \simeq 4\). Recall that, in this paper, \(\tan \beta \) is defined referring to the leptons, see Eq. (13), so \({[\rho ^E]_{\mu \mu }}/{[Y^E]_{\mu \mu }} \equiv \tan \beta \). Therefore, the \(\mathcal{O}(1/M^2)\) parts are larger than the \(\mathcal{O}(1/M^4)\) contributions, provided thatFootnote 3

$$\begin{aligned}&\Lambda _6 \tan \beta \frac{v^2}{M^2} < 1,\quad \tan \beta \frac{\Lambda _5}{ 4 \Lambda _6} \frac{v^2}{M^2} < 1\nonumber \\&\quad \Rightarrow \mathcal{O}\left( \frac{1}{M^2}\right) > \mathcal{O}\left( \frac{1}{M^4}\right) \end{aligned}$$
(27)

If \(1/50 < \tan \beta < 50\), \(v^2/M^2 < 0.01\), and \(\Lambda _5 \sim \Lambda _6 <1\), then the \(\mathcal{O}(1/M^2)\) terms give the correct order of magnitude, but the \(\mathcal{O}(1/M^4)\) terms can be required to get two significant figures for large \(\tan \beta \).

For the top loop, the ratio of \(\mathcal{O}(1/M^4)\) over the \(\mathcal{O}(1/M^2)\) terms is

$$\begin{aligned} \left( \frac{ v^2}{M^2} \right) \frac{ Y^{U}_{tt} \Lambda _6 }{[\rho ^{U}]_{tt} - Y^{U}_{tt} \Lambda _6/\ln ^2 \frac{m_t^2}{M^2} } \end{aligned}$$
(28)

where \(f(\frac{m_t^2}{m_h^2}) \simeq 1\) was used. This shows that the \(\mathcal{O}(1/M^2)\) terms are always larger (neglecting the possible cancellation of \([\rho ^{U}]_{tt} \ln ^2 \frac{m_t^2}{M^2}\) against \( \Lambda _6\)), however, in 2HDMs where the top couples mostly to the SM Higgs such that \([\rho ^{U}]_{tt} \ln ^2 \frac{m_t^2}{M^2} \ll \Lambda _6\), the dimension-eight contribution is only suppressed by \( \frac{m_t^2}{M^2} \ln ^2 \frac{m_t^2}{M^2}\) which is \(\simeq .2\) for \(\frac{m_t^2}{M^2} = 0.01\). So the dimension-eight contribution can be numerically relevant in some areas of parameter space.

For the b-loop, the ratio of \(\mathcal{O}(1/M^4)\) over \(\mathcal{O}(1/M^2)\) terms is

$$\begin{aligned}&\left( \frac{ v^2}{M^2} \right) \nonumber \\&\times \quad \frac{ \left[ 4 \Lambda _6 \rho ^{D \dagger }_{bb} + \left( Y^{D}_{bb} - \rho ^{D}_{bb}\frac{\Lambda _5 }{\Lambda _6} \right) \left( 1 + \ln ^2 \frac{m_h^2}{M^2}/\ln ^2 (\frac{m_b^2}{m_h^2}\right) \right] }{4 Y^{D}_{bb} }\nonumber \\ \end{aligned}$$
(29)

So the \(\mathcal{O}(1/M^2)\) parts are larger than the \(\mathcal{O}(1/M^4)\) contributions, provided that

$$\begin{aligned}&\frac{\Lambda _5 }{4 \Lambda _6 } \frac{\rho ^D_{bb}}{Y^D_{bb}} \frac{v^2}{M^2} < 1,\quad \Lambda _6 \frac{\rho ^D_{bb}}{Y^D_{bb}} \frac{v^2}{M^2} < 1\quad \Rightarrow \mathcal{O}(\frac{1}{M^2}) > \mathcal{O}(\frac{1}{M^4})\nonumber \\ \end{aligned}$$
(30)

where \(\frac{\rho ^D_{bb}}{Y^D_{bb}} \sim 35\) in 2HDMs where \(\rho ^D_{bb} \simeq 1\). As in the case of the one-loop contribution, the logarithm multiplying the \( \mathcal{O}({1}/{M^4}) \) heavy Higgs exchange diagrams runs from \(M \rightarrow m_b\) rather than from \(m_h \rightarrow m_b\), which could give a factor 2.

For the W-loop, the ratio of \(\mathcal{O}(1/M^4)\) over \(\mathcal{O}(1/M^2)\) terms is

$$\begin{aligned} \frac{ \frac{ m_W^2}{M^2} \ln \frac{ m_W^2}{M^2} \left( \frac{35}{4} \ln \frac{ m_W^2}{M^2} - \frac{3}{2} \right) }{ \ln \frac{ m_W^2}{M^2} -12} \sim \frac{ m_W^2}{M^2} \ln ^2 \frac{ m_W^2}{M^2} \end{aligned}$$
(31)

where the estimate neglects cancellations and shows that the \(\mathcal{O}(1/M^4)\) contribution is only mildly suppressed, because \(z \ln ^2 z\) does not decrease rapidly.

So in summary, provided that

$$\begin{aligned} \Lambda _6 \sim \Lambda _5,\quad \frac{v^2}{M^2} \cot \beta <1,\quad \frac{v^2}{M^2} \tan \beta <1 \end{aligned}$$
(32)

in all four types of 2HDM, the \(\mathcal{O}(1/M^2)\) terms give the correct order of magnitude, but the \(\mathcal{O}(1/M^4)\) terms can be required to get two significant figures in some areas of parameter space.

2.2.2 The relative size of the four classes of diagram

The previous section suggested that it would be reasonable to use an EFT with dimension-six operators. So the next question is to determine the loop order to which the matching and running should be performed in the EFT, in order to reproduce the numerically dominant \(\mathcal{O}(1/M^2)\) terms. So below, the magnitudes of the \(\mathcal{O}(1/M^2)\) terms of the four classes of diagrams are compared.

The \(\mathcal{O}(1/M^2)\) part of the one-loop contribution arises from h exchange:

$$\begin{aligned} 2\sqrt{2} G_F m_\mu A^{h,1~loop}_{L}\sim & {} \frac{e m_\mu }{ \pi M^2} [\rho ^E]_{\mu e} Y^E_{\mu \mu } \Lambda _6 \end{aligned}$$
(33)

where \(v^2/m_h^2 \simeq 4\), and \(\frac{1}{16 \pi } \ln \frac{m_\mu ^2}{m_h^2} =0.998 \simeq 1\).

Normalising the \(\mathcal{O}(1/M^2)\) part of the two-loop diagrams to Eq. (33) gives

$$\begin{aligned}&{t~loop} \sim \frac{ \alpha m^2_t }{32\pi ^2 m^2_\mu } \frac{ 3Q_t^2}{\Lambda _6} \left[ [\rho ^{U}]_{tt} \ln ^2 \frac{m_t^2}{M^2} - \Lambda _6 ) \right] ,\end{aligned}$$
(34)
$$\begin{aligned}&{b~loop} \sim \frac{\alpha m^2_b }{16\pi ^2 m^2_\mu } 3Q_b^2 \ln ^2 (\frac{m_b^2}{m_h^2}) ,\end{aligned}$$
(35)
$$\begin{aligned}&{W~loop} \sim \frac{ \alpha }{64 \pi ^2} \frac{ m^2_t}{m^2_\mu } \left[ \ln \frac{ m_W^2}{M^2} -12 \right] \end{aligned}$$
(36)

where \(f(m_t^2/m_h^2) \simeq 1\) was used. Since \( \frac{ \alpha }{64 \pi ^2} \frac{ m^2_t}{m^2_\mu } \simeq 32\), this reproduces the well-known dominance of the top and W-loops. Interestingly, only the top loop can be enhanced (or suppressed) by the angle \( \beta \)—the relative magnitude of the other terms is mostly controlled by Standard Model parameters.

The two-loop top and b contributions contain log\(^2\) terms, which should arise at second order in the one-loop RGEs of dimension-six operators. However, there are significant terms of the top and W contributions without a log, which presumably arise in two-loop matching. The log term of the W amplitude should be generated by two-loop RGEs. So we see that a two-loop analysis would be required to reproduce the dominant \(\mathcal{O}(1/M^2)\) terms.

3 The EFT version

The aim of this section is to obtain, in EFT, the “leading” \(\mathcal{O}([\alpha \log ]^n/M^2)\) parts of the \(\mu \rightarrow e \gamma \) amplitude. Appendix B discusses where some other parts of the full-model calculation would arise. EFT is transparently reviewed in [1, 2], and the EFT construction here attempts to follow the recipe given there. For simplicity, the EFT has only three scales: the heavy Higgs scale M, the weak scale \(m_W\) (taken \(\simeq m_Z, m_h,m_t\)) and a low scale \(m_\mu \) (taken \(\simeq m_b\)). Between M and \(m_W\), the theory and operators are \(SU(3) \times SU(2) \times U(1)\) invariant; below \(m_W\), they are \(SU(3) \times U(1)\) invariant.

The EFT constructed here contains dimension-six operators, which should reproduce the \(\mathcal{O}(1/M^2)\) terms of the \(\mu \rightarrow e \gamma \) amplitude. In Eqs. (39)–(42) the dimension-six, SU(2)-invariant operators required above \(m_W\) are listed. Equation (47) gives some additional SU(3)\(\times \)U(1)-invariant, dimension-six operators which are required below \(m_W\) (but which would have been of dimension eight in the SU(2)-invariant formulation appropriate above \(m_W\)). As expected, the operator basis below \(m_W\) should include all four-fermion operators that are of dimension six in a QED\(\times \) QCD-invariant theory. If instead the basis is restricted to SU(2)-invariant “Buchmuller–Wyler” operators, one cannot obtain all the \(\mathcal{O}([\alpha \log ]^n/M^2)\) terms of the \(\mu \rightarrow e \gamma \) amplitude (only the top loop is included).

The EFT studied here is at “lowest order” in the loop expansion: tree-level matching of operator coefficients, and one-loop RGEs. This should reproduce the \(\mathcal{O}(\alpha ^n \log ^n)\) termsFootnote 4 in the amplitude. I do not calculate the two-loop matching, or the two-loop RGEs, which would allow one to reproduce the dimension-six contributions of the top and W.

3.1 Setting up the EFT calculation

The aim of this top-down EFT calculation is to reproduce the amplitude for \(\mu \rightarrow e \gamma \) in the 2HDM. So the first step is to match out the heavy Higgses H and A at the “New Physics scale” M. The \(H^\pm \) are neglected because the full-model calculation (to which the EFT is compared) includes only neutral Higgses. Presumably the \(H^\pm \) contribution ensures SU(2) invariance. The 2HDM from above M is matched onto the (unbroken) Standard Model, with its full particle content, and a selection of \(SU(3) \times SU(2) \times U(1)\)-invariant dimension-six operators. A list of possible operators was given by Buchmuller and Wyler [3], and slightly reduced in [4]. Here, only those operators which are required to reproduce the \(\mathcal{O}(1/M^2)\) part of the \(\mu \rightarrow e \gamma \) amplitude are selected. In matching out, the operator coefficients of the effective theory are assigned so as to reproduce the tree-level Green’s functions of the full theory, at zero external momentum.

The second step is to run the operator coefficients down to the scale \(m_W\). This running should be performed with electroweak RGEs, which are given in [6, 41]. However, since CHK separate photon and Z diagrams, and only the photon contributions were retained in the previous section, the running from \(M\rightarrow m_W\) is performed with the RGEs of QED. The operator coefficients evolve [42] with scale \(\mu \) as

$$\begin{aligned} \mu \frac{\partial }{\partial \mu } \vec {C} = \frac{\alpha _{em}}{4\pi }\vec {C} \varvec{\Gamma } \end{aligned}$$
(37)

where the operator coefficients have been organised into a row vector \( \vec {C} \), and \(\frac{\alpha _{em}}{4\pi } \varvec{\Gamma }\) is the anomalous dimension matrix. The algorithm to calculate \( \varvec{\Gamma }\) is given, for instance, in [43]. For a square \(\varvec{\Gamma }\), this equation can be perturbatively solved to give the components of the vector \(\vec {C}(\mu )\) at a lower scale \(\mu \):

$$\begin{aligned}&C_A(M)\nonumber \\&\qquad \times \left( \delta _{AB} - \frac{\alpha (\mu ) }{4\pi } \left[ \Gamma \right] _{AB} \ln \frac{M}{\mu } + \frac{\alpha ^2(\mu ) }{32\pi ^2} \left[ \Gamma \Gamma \right] _{AB} \ln ^2 \frac{M}{\mu } +\cdots \right) \nonumber \\&\quad = C_B (\mu ) \end{aligned}$$
(38)

At \(m_W\), the WZh and t should be matched out. The theory below \(m_W\) should be QED and QCD for all the SM fermions except the top, augmented by a complete set of \(QED \times QCD\)-invariant dimension-six operators. For simplicity, I consider only QED for the \(b,\mu \) and e, plus those dimension-six operators required in matching onto the tree-level Green’s functions of the SU(2)-invariant EFT from above \(m_W\).

Finally, the operator coefficients are run down to \(m_\mu \), where the dipole coefficient can be used to calculate the \(\mu \rightarrow e \gamma \) amplitude. In principle, the RGEs of QED and QCD should be used. However, QCD is neglected because it is not included in the full-model calculation of CHK.

3.2 Matching at M and one-loop running to \(m_W\)

The operator for \(\mu \rightarrow e \gamma \) is given in Eq. (16). It is convenient to rescale the coefficient as

$$\begin{aligned}&2\sqrt{2} G_F m_\mu A^{ij}_R (\overline{e}_i \sigma ^{\alpha \beta }P_R e_j ) F_{\alpha \beta }\nonumber \\&\quad = \frac{ Y^E_{\mu \mu }}{M^2} C^{ij}_{D,R} (\overline{L}_i H_1 \sigma ^{\alpha \beta }P_R E_j ) F_{\alpha \beta }\equiv \frac{ Y^E_{\mu \mu }}{M^2} C^{ij}_{D,R} { O}^{ij}_{D,R}\nonumber \\ \end{aligned}$$
(39)

so that the dimensionful operator coefficient is suppressed by the heavy Higgs scale M. Above \(m_W\), there is a hypercharge dipole and an SU(2) dipole; here only the linear combination corresponding to the photon dipole is used, because only the QED part of the SU(2) running is included. To obtain the \(\mathcal{O}([\alpha \log ]^n/M^2)\) parts of the \(\mu \rightarrow e \gamma \) amplitude, only three additional operators from the pruned Buchmuller–Wyler list are required between M and \(m_W\):

$$\begin{aligned}&{ O}^{e \mu tt}_{LE Q U}= (\overline{L}_e^A E_\mu )\epsilon _{AB} (\overline{Q}^B_t U_t ) ,\nonumber \\&{ O}^{ \mu e tt}_{LE Q U}= (\overline{L}_\mu ^A E_e )\epsilon _{AB} (\overline{Q}^B_t U_t ),\end{aligned}$$
(40)
$$\begin{aligned}&{ O}^{e \mu tt}_{T, LEQU} = (\overline{L}_e^A \sigma ^{\mu \nu } E_\mu )\epsilon _{AB} (\overline{Q}^B_t \sigma _{\mu \nu } U_t )\nonumber \\&{ O}^{\mu e tt}_{T, LEQU} = (\overline{L}_\mu ^A \sigma ^{\mu \nu } E_e )\epsilon _{AB} (\overline{Q}^B_t \sigma _{\mu \nu } U_t ) ,\end{aligned}$$
(41)
$$\begin{aligned}&{ O}^{e \mu }_{eH } = H_1^\dagger H_1 \overline{L}_e H_1 E_\mu ,\nonumber \\&{ O}^{\mu e}_{e H } = H_1^\dagger H_1 \overline{L}_\mu H_1 E_e, \end{aligned}$$
(42)

where \(\sigma \) is the anti-symmetric tensor \(\frac{i}{2}[\gamma ,\gamma ]\), and \(\epsilon \) provides an SU(2) contraction. These operators appear in the Lagrangian as

$$\begin{aligned} \delta \mathcal{L} = - \sum _\mathrm{operators}\frac{C^{ijmn}_{Z}}{M^2} { O}^{ijmn}_{Z} +h.c. \end{aligned}$$
(43)

so the coefficients C are dimensionless, and the four-fermion operators are normalised such that the Feynman rule is \(-iC/M^2\).

The last pair of operators, Eq. (42) will give the flavour-changing light Higgs interaction required for the one-loop, b-loop and W-loop amplitudes. As can be seen from the right diagram of Fig. 2, matching at the scale M of the tree-level Green’s functions of the full theory onto those of the SM + dimension-six operators gives coefficients

$$\begin{aligned} \frac{C_{eH}^{e\mu }}{M^2} = -\frac{[\rho ^E]_{e \mu } \Lambda _6}{M^2},\quad \frac{C_{eH}^{\mu e}}{M^2} = -\frac{[\rho ^E]_{ \mu e} \Lambda _6}{M^2}. \end{aligned}$$
(44)

The running of these coefficients between M and \(m_W\) is neglected, because it is not required to reproduce the CHK results.

Fig. 2
figure 2

The left diagram generates the dimension-six \((\overline{Q}_3 U_t) (\overline{L}_e E_\mu )\) operator, by matching out the heavy doublet Higgs \(H_2\) (dashed line). The right diagram generates the dimension-six \(H_1^\dagger H_1 (\overline{L}_e H_1 E_\mu )\) operator

The first four operators of the list above will generate the top loop contribution. Matching at M via the diagram given on the left in Fig. 2 gives coefficients for the scalar LEQU operators,

$$\begin{aligned} \frac{C_{LEQU}^{e\mu tt}}{M^2}= & {} -\frac{[\rho ^E]_{e \mu } [\rho ^{U \dagger }]_{tt}}{M^2},\quad \frac{C_{LEQU}^{\mu e tt}}{M^2} = -\frac{[\rho ^E]^*_{ \mu e} [\rho ^{U \dagger }]_{tt}}{M^2},\nonumber \\ \end{aligned}$$
(45)

where the negative sign is from the scalar propagator. Then, between the scales M and \(m_t\), the RGEs of QED mix the scalar operator \({ O}^{e \mu tt}_{LE Q U}\) to the tensor \({ O}^{e \mu tt}_{T, LEQU}\), and the tensor to the dipole, such that the coefficient of \({ O}^{e\mu }_{D,L}\) is

$$\begin{aligned} -m_\mu \frac{C_{D,L}^{e\mu }}{M^2}= & {} -\frac{C^{e \mu tt}_{LE Q U}}{M^2} \frac{e \alpha }{128\pi ^3} \left[ (2Q_t)(8N_c Q_t m_t) \right] \log ^2 \frac{M}{m_t}\nonumber \\= & {} \frac{e \alpha }{32\pi ^3M^2} 3 Q^2_t m_t [\rho ^E]_{e \mu } [\rho ^{U \dagger }]_{tt} \log ^2 \frac{m_t^2}{M^2}\nonumber \\ \end{aligned}$$
(46)

where in the brackets after the first equality is the product of the scalar\(\rightarrow \)tensor and tensor\(\rightarrow \)dipole elements of the anomalous dimension matrix \(\varvec{\Gamma }\). This agrees with the \(\mathcal{O}(1/M^2)\) part of Eq. (21) that is generated by heavy Higgs exchange.

3.3 Matching at \(m_W\) and one-loop running to \(m_\mu \)

At the weak scale \( \simeq m_W\), the hW and t should be matched out of the theory, onto a basis of dimension-six operators. These operators should respect the gauge symmetries below \(m_W\), which, in the absence of the hW and Z, are QCD and QED. So there is no reason to impose SU(2) on the operator basis below \(m_W\).

  1. 1.

    If the matching is performed at tree or one loop, then matching out the top leaves only the contribution to the dipole operator given in Eq. (46). The top-loop contribution with a light Higgs, which is \(\mathcal{O}(1/M^2)\), could be included by matching at two-loop.

  2. 2.

    In matching out the Higgs h, the one-loop, and b-loop \(\mathcal{O}(1/M^2)\)-contributions can be obtained by matching onto the scalar and tensor operators:

    $$\begin{aligned}&{ O}_{S}^{e\mu bb} = (\overline{e} P_R \mu ) (\overline{b} P_Rb),\quad { O}_{S}^{\mu e bb} = (\overline{\mu } P_R e) (\overline{b} P_R b), \nonumber \\&{ O}_{T}^{e\mu bb} = (\overline{e} \sigma P_R \mu ) (\overline{b } \sigma P_R b) ,\quad \nonumber \\&{ O}_{T}^{\mu e bb} = (\overline{\mu } \sigma P_R e) (\overline{b } \sigma P_R b) ,\nonumber \\&{ O}_{S}^{e \mu \mu \mu } = (\overline{e} P_R\mu ) (\overline{\mu } P_R \mu ) ,\quad { O}_{S}^{\mu e \mu \mu } = (\overline{\mu } P_R e) (\overline{\mu } P_R \mu ) ,\nonumber \\&{ O}_{T}^{e \mu \mu \mu } = (\overline{e} \sigma P_R\mu ) (\overline{\mu } \sigma P_R\mu ),\quad \nonumber \\&{ O}_{T}^{\mu e \mu \mu } = (\overline{\mu } \sigma P_R e) (\overline{\mu } \sigma P_R \mu ). \end{aligned}$$
    (47)

    If SU(2) were imposed, these operators would be of dimension eight (for instance, the first operator could be written as \(^{(8)}{ O}^{e \mu bb }_{LEQD} = (\overline{L}_e H E_\mu ) (\overline{Q}_3 H D_b)\)). However, they are of dimension six in the QCD\(\times \)QED-invariant EFT below \(m_W\), and they are required in the 2HDM to correctly reproduce the \(\mathcal{O}(\alpha ^n \log ^n/M^2)\) terms that dimension-six, one-loop EFT should obtain. The operators of Eq. (47) are not included in the EFT analysis of \(\mu \rightarrow e \gamma \) performed by Pruna and Signer [6], who make the restriction to dimension-six SU(2)-invariant operators.

    From the diagrams illustrated in Fig. 3, one obtains

    $$\begin{aligned}&\frac{C^{e \mu \mu \mu }_{S}}{M^2} = \frac{[Y^E]_{\mu \mu }[\rho ^E]_{ e\mu } \Lambda _6v^2/2}{ m_h^2 M^2} \nonumber \\&\frac{C^{ \mu e \mu \mu }_{S}}{M^2} = \frac{[Y^E]_{\mu \mu }[\rho ^E]_{ \mu e } \Lambda _6v^2/2}{ m_h^2 M^2} ,\end{aligned}$$
    (48)
    $$\begin{aligned}&\frac{C^{e \mu bb }_{S}}{M^2} = \frac{[Y^D]_{bb}[\rho ^E]_{ e\mu } \Lambda _6v^2/2}{ m_h^2 M^2} \nonumber \\&\frac{C^{ \mu e bb }_{S} }{M^2}= \frac{[Y^D]_{bb}[\rho ^E]_{ \mu e } \Lambda _6v^2/2}{ m_h^2 M^2}, \end{aligned}$$
    (49)

    so the coefficients are \(\mathcal{O}(1/M^2)\) because \(v^2/m_h^2 \simeq 4\).

    Then, between the scales \(m_W\) and \(m_\mu \), the RGEs of QED mix the scalar operator \({ O}^{e \mu bb}_{S}\) to the tensor \({ O}^{e \mu bb}_{T}\), which mixes to the dipole, exactly as in the previously discussed case of tops. So with the anomalous dimensions as in Eq. (46), the dipole coefficient

    $$\begin{aligned}&-m_\mu \frac{ C_{D,L}^{e\mu } }{M^2} =-\frac{C^{e \mu bb}_{S}}{M^2} \frac{e \alpha m_b }{32\pi ^3} 3 Q^2_b \log ^2 \frac{m_\mu ^2}{m_h^2} \nonumber \\&\quad = - \frac{e \alpha m_b }{64\pi ^3M^2} 3 Q^2_b \frac{[Y^D]_{bb}[\rho ^E]_{ e\mu } \Lambda _6v^2}{ m_h^2 } \log ^2 \frac{m_\mu ^2}{m_h^2}\nonumber \\ \end{aligned}$$
    (50)

    is generated, in agreement with the \(\mathcal{O}(1/M^2)\) part of the b-loop contribution to \(\mu \rightarrow e \gamma \), given in Eq. (23).

    In matching out the Higgs h, the flavour-changing operator \({ O}^{e \mu }_{eH}\) can also generate the dimension-six operator \({ O}^{ e \mu bb}_{LE D Q}= (\overline{L}_e E_\mu ) (\overline{D}_b Q_b )\) with coefficient \(\propto \rho ^E_{\mu e} \Lambda _6 v^2/(m_h^2 M^2)\). However, this operator is not useful for generating \(\mu \rightarrow e \gamma \) via one-loop RGEs, because there is no tensor operator for it to mix to, on the way to the dipole. The reason that there is no tensor is that \(\sigma \) and \(\sigma \gamma _5\) are related: \(\sigma _{\mu \nu } = \frac{i}{2} \varepsilon _{\mu \nu \alpha \beta } \sigma ^{\alpha \beta } \gamma _5\), which implies that \((\overline{e}\sigma ^{\alpha \beta } \gamma _5 \mu ) (\overline{b}\sigma _{\alpha \beta } \gamma _5 b) = (\overline{e} \sigma _{\mu \nu } \mu ) (\overline{b}\sigma ^{\mu \nu } b ) \) or \((\overline{e}\sigma ^{\alpha \beta } P_L \mu ) (\overline{b}\sigma _{\alpha \beta } P_R b) = 0\).

    The scalar operator with three muons, \({ O}^{e \mu \mu \mu }_{S}\), mixes directly to the dipole via a penguin diagram, so the coefficient of \({ O}^{e\mu }_{D,L}\) is

    $$\begin{aligned}&-m_\mu \frac{ C_{D,L}^{e\mu } }{M^2} = C^{ \mu e \mu \mu }_{S} \frac{\alpha }{16\pi ^2} \left[ \frac{-1}{e} \right] \log \frac{m_h}{m_\mu } \nonumber \\&\quad =\frac{e m_\mu }{64\pi ^2 M^2} \frac{[Y^E]_{\mu \mu }[\rho ^E]_{ \mu e } \Lambda _6v^2}{ m_h^2} \log \frac{m^2_\mu }{m_h^2} \end{aligned}$$
    (51)

    where in the brackets are the scalar\(\rightarrow \)dipole element of the anomalous dimension matrix \(\varvec{\Gamma }\). This agrees with the \(\mathcal{O}(1/M^2)\) part of the one-loop, light Higgs diagrams; see Eq. (20).

  3. 3.

    In matching out the W at tree or one-loop, none of the W-loop contribution to \(\mu \rightarrow e \gamma \) is included, because the light Higgs part arises in two-loop matching. In the full-model result of Eq. (25), there is also an \(\mathcal{O}(1/M^2)\) heavy Higgs part, but it is only enhanced by one log and presumably arises in the two-loop RGEs.

Fig. 3
figure 3

The right diagram generates the QCD\(\times \)QED-invariant, dimension-six operator \((\overline{e } P_R \mu )(\overline{\mu } P_R \mu )\), by matching out the light Higgs h (dotted line), which has a SU(2)-invariant dimension-six LFV vertex represented by the grey circle. The free Higgs legs attach to the vev. The left diagram generates a similar operator \((\overline{b} P_R b) (\overline{e} P_R \mu )\) involving b quarks

4 Discussion

The CP-conserving 2HDM studied here is a minimal extension of the Standard Model—it has only one LFV coupling \([\rho ^E]_{\mu e} \simeq [\rho ^E]_{ e \mu }\) (see Eq. 14), the magnitude of the other new flavoured couplings is controlled by the single parameter \( \tan \beta \), defined in Eq. (13), and the only new mass scale is the heavy doublet mass M, taken \( \gtrsim 10v\). The one- and two-loop contributions to \(\mu \rightarrow e \gamma \) of the neutral Higgses were calculated by Chang et al. [5], and their result, in the decoupling limit, is given in Eqs. (20), (19), (21), (23) and (25).

Section 3 tries to reproduce the amplitude for \(\mu \rightarrow e \gamma \), in a simple EFT with three scales: \(M, m_W, m_\mu \). At the scale M, the heavy Higgses were “matched out” onto dimension-six, SU(2)-invariant operators. Between M and \(m_W \), the operator coefficients should run according to electroweak RGEs, but I used those of QED (because I compare only to the photon diagrams of [5]). At \(m_W\), the WZh and t are matched out, so below \(m_W\) is an EFT containing all Standard Model fermions but the top, interacting via QCD, QED, and various four-fermion operators. Finally the operator coefficients run to \(m_\mu \) according to the RGEs of QED.

There were three issues in reproducing the amplitude for \(\mu \rightarrow e \gamma \) in EFT:

  1. 1.

    In order to obtain the \(\mathcal{O}(1/M^2)\) terms of the full-model calculation, the operator basis below the weak scale needed to include QED\(\times \)QCD-invariant four-fermion operators which are dimension six, but would have been dimension eight in the SU(2)-invariant basis appropriate above \(m_W\).Footnote 5

    This was the case for the one-loop contribution of the light Higgs to \(\mu \rightarrow e \gamma \): the light Higgs has a flavour-changing dimension six interaction \(H_1^\dagger H_1 \overline{L}_e H_1 E_\mu \). At \(m_h\), the light Higgs can be matched onto the operator \(v^2 (\overline{e} P_R \mu ) (\overline{\mu } P_R \mu )\) = \((\overline{L}_e H E_\mu ) (\overline{L}_\mu H E_\mu )\) with coefficient \(1/(M^2 m_h^2)\), as illustrated on the right in Fig. 3. Then, in QED running down to \(m_\mu \), this operator mixes to the dipole by a penguin diagram obtained by closing a muon loop, inserting a mass and attaching a photon. So the contribution to \(\mu \rightarrow e \gamma \) is \(\propto v^2/(M^2 m_h^2) \log (m_\mu ^2/v^2) \sim \log (m_\mu ^2/v^2) /M^2 \).

    However, although the coefficients of \((\overline{e} P_R \mu ) (\overline{\mu } P_R \mu )\) and \((\overline{e} P_R \mu ) (\overline{b} P_R b)\) are formally \(\mathcal{O}(1/M^2)\), they are also small, because proportional to light fermion Yukawa couplings. As discussed in Sect. 2.2.2, they do not give a numerically relevant contribution to \(\mu \rightarrow e \gamma \) in the decoupling limit of the 2HDM. So although these extra operators are required below \(m_W\) to obtain all the \(\mathcal{O}(1/M^2)\) terms, they are not necessary for getting a reasonable approximation to the answer.Footnote 6

  2. 2.

    There are two-loop contributions involving a top or W-loop, initially discussed by Bjorken and Weinberg [3], and illustrated in Fig. 1. As discussed in Sect. 2.2.2, these always dominate the one-loop contribution to \(\mu \rightarrow e \gamma \) in the decoupling limit. This disorder in the loop expansion occurs because the one-loop contribution is suppressed by the square of the muon Yukawa coupling, whereas the two-loop diagrams are proportional to the square of the gauge coupling or of the top Yukawa. In the EFT, two-loop matching and running would be required to reproduce the W-loop, and part of the top loop.

    This “disorder” is partly a feature of the dipole operator, and partly of the 2HDM. The dipole operator has a Higgs leg, and at one loop, that Higgs can attach to the fermion line, or to the boson line, if the boson of the loop has a dimension-three coupling to the Higgs (such as the \(\mu ^* H^*_u L E\) interaction in supersymmetry). However, at two loops, there are many more possibilities for attaching the external Higgs leg with an \(\mathcal{O}(1)\) coupling. In some models, it might be possible to have \(\mathcal{O}(1)\) coupling of the external Higgs leg to the fermion or boson of the one-loop diagram. However, in the case of the 2HDM, there are no new fermions so the fermion of the one-loop diagram is at best a \(\tau \). The boson is a Higgs without dimension-three interactions, so the one-loop diagram is suppressed by small Yukawa couplings. This issue could be addressed by running and matching at two-loop,Footnote 7 as is done for \(b\rightarrow s \gamma \) [43].

  3. 3.

    Section 2.2.1 checked that the \(\mathcal{O}(1/M^2)\) terms in the amplitude are larger than the \(\mathcal{O}(1/M^4)\) terms, so a parametrisation in terms of dimension-six operators should work. However, dimension-eight operators can be enhanced by logs and factors of \(\tan \beta \), so that it may be necessary to include them for numerical accuracy.

    It is convenient to neglect dimension-eight operators, and they are expected to be suppressed by \( \sim z =v^2/M^2\). However, if dimension-eight operators are log-enhanced in running, and dimension-six operators only contribute in matching, then dimension eight are only suppressed by \(z \log ^n(z)\). This was the case in the 2HDM for the two-loop contributions of the Ws and bs. Since the high scale is not so high in the 2HDM, (e.g. \(v/M \simeq 1/10\)), \(z =v^2/M^2 \simeq 0.01\), but \(z \log ^2(z) \simeq .2\).

    There is also the inevitable ignorance, in EFT, as regards the magnitude of operator coefficients in the full theory. This uncertainty is parametrised by \(\tan \beta \) in the 2HDM. To justify neglecting the dimension-eight operators, the restrictions \(\cot \beta , \tan \beta < v^2/M^2\) had to be imposed. However, since flavour physics is about the hierarchy of couplings, it may not be sensible to assume that all New Physics couplings are \(\mathcal{O}(1)\) at the New Physics scale.

Fig. 4
figure 4

Obtaining the Barr–Zee diagrams at second order in the one-loop RGEs: the scalar operator (grey circle of the left diagram) is mixed to the tensor (grey circle of the next diagram) via photon exchange between the leptons and quarks (only one of the four diagrams is drawn). Then the quark loop of the tensor operator is closed to mix it to the dipole

This exercise located the “Barr–Zee” diagrams (see the right two diagrams of Fig. 1) in an EFT description.

  1. 1.

    The top loop with heavy Higgses contributes at second order in the one-loop RGEs for dimension-six operators, as sketched in Fig. 4, so it is included in the EFT used here. (There is also a dimension-eight contribution.) The top loop with the light Higgs would appear in two-loop matching at \(m_W\).

  2. 2.

    The W-loop with heavy Higgs contributes in two-loop RGEs of dimension-six operators, and at second order in the one-loop RGEs for dimension-eight operators. The W-loop with the light Higgs would contribute in two-loop matching at \(m_W\). As a result, none of the W contribution is included in the EFT used here.

  3. 3.

    The b-loop contribution is small in the decoupling limit. The light Higgs contribution, which appears at second order in the one-loop RGEs for dimension-six operators below \(m_W\), is suppressed by the square of the SM Yukawa coupling of the b. The heavy Higgs contribution, which could be \(\tan \beta \) enhanced, however, only contributes at dimension eight, via second order terms in the one-loop RGEs. This is because the tensor operator \((\overline{\psi _1} \sigma P_L \psi _2) (\overline{\psi _3} \sigma P_R \psi _4)\) vanishes, so the SU(2)-invariant dimension-six scalar operator that can be constructed with leptons and down quarks, \( (\overline{Q} D)(\overline{E} L)\), cannot mix via RGEs to a tensor and then the dipole.

5 Summary

This paper used Effective Field Theory (EFT) to calculate the amplitude for \(\mu \rightarrow e \gamma \) in the decoupling limit of a 2 Higgs Doublet Model (2HDM) of Type III. A “leading order” EFT was used, with one-loop running, tree matching, dimension-six operators, and three scales (the heavy doublet mass M, the electroweak scale \(m_W\), and \(m_\mu \)). Comparing this EFT to a calculation performed in the 2HDM showed two things: first, that this EFT reproduces (as it should) the \(\mathcal{O}([\alpha \log ]^n/M^2)\) terms of the full-model calculation, provided that the operator basis below \(m_W\) is constructed with dimension-six \(SU(3)\times U(1)\) invariant operators (some of which could be dimension eight in an SU(2)-invariant construction). Secondly, to obtain the numerically dominant contributions to \(\mu \rightarrow e \gamma \) in the 2HDM, two-loop RGEs and two-loop matching are required, but the enlarged operator basis below the weak scale is not.

The 2HDM also illustrates that higher dimensional operators may be numerically relevant, because they can be enhanced by unknown large couplings of the high-scale model (\(e.g.\ \tan \beta \)), or by logarithms.