1 Introduction

Upon the discovery of the Higgs boson in 2012 [1, 2], the particle content as proposed within the Standard Model (SM) was confirmed to exist. Precise determinations of the properties of the Higgs boson, as well as in other sectors of physics, are a major component of the proposed program of next-generation lepton colliders. There are five proposals for so-called electron–positron “Higgs factories” [3]: the Circular Electron Positron Collider (CEPC) [4,5,6], the International Linear Collider (ILC) [7,8,9,10], the Future Circular Collider (FCC-ee) [11,12,13,14], the Compact Linear Collider (CLIC) [15,16,17] and the Cool Copper Collider \((\hbox {C}^3)\) [18,19,20]. The designation “Higgs factory” mainly refers to the stage of a future \(e^+e^-\) collider being run at a centre-of-mass energy of roughly \(\sqrt{s} =\) 240–250 GeV, where the integrated cross section of the Higgsstrahlung process peaks and so dominates over all Higgs production mechanisms. Other stages involve the operation as a “Z factory” at the Z pole for \(\sqrt{s} = m_Z\), close to the \(W^+W^-\) production threshold at \(\sqrt{s} = 2m_W\), and close to or at the \(t{\bar{t}}\) threshold for \(\sqrt{s} =\) 350–370 GeV, as well as potential upgrades for runs at even higher centre-of-mass energies [21].

The incompleteness of the SM has been corroborated upon the measurement of neutrino oscillations by the Super-Kamiokande [22] and SNO collaborations [23, 24], which evidence the need to introduce new physics in order to account for neutrino masses. A particularly straightforward way to do so is via postulating the existence of sterile neutrinos, several searches for which at future lepton colliders have been proposed. Most of the sensitivity studies focus on direct production of sterile neutrinos in electroweak processes and Higgs production [25,26,27,28,29,30,31,32,33], see also Refs. [34,35,36,37] for reviews on heavy neutral leptons at colliders. Similarly, electroweak triplet fermions at colliders have been studied in, e.g., Refs. [38, 39]. The anticipated high precision attainable at future lepton colliders also serves as a motivation to consider virtual corrections, see e.g. Ref. [40]. In Refs. [41, 42], the contribution of sterile neutrinos to the triple-Higgs coupling was studied.

In this work, we investigate the potential of electron–positron colliders to test the fermionic Seesaw scenarios [43,44,45,46,47,48], given the sub-percent precision to which these colliders are projected to measure the cross section of the Higgsstrahlung process \(e^+e^-\rightarrow Zh\). Achieving this precision is possible due to the so-called “recoil method” that refers to the selection of Higgsstrahlung events solely through measuring the four-momenta of the decay products of the Z boson which recoils against the Higgs boson. This method is in principle applicable for any Higgs decay mode and thus allows for a model-independent reconstruction of the Higgs-boson mass, see for instance Ref. [3]. The Higgsstrahlung process is also well-understood in the SM. Recently, two-loop electroweak corrections to the SM cross section were calculated in Refs. [49, 50]; see also Refs. [51,52,53,54,55,56,57].

We consider the process at two benchmark centre-of-mass energies, \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV. As argued for in Ref. [58], the smaller cross section at larger s can be partly compensated for by a higher instantaneous luminosity which scales approximately linearly with s. Moreover, the additional boost of the Z and h bosons allows for a better separation of the respective jets and therefore a more precise measurement of \(\sigma _{Zh}\times \text {BR}(H\rightarrow X)\). Further advantages mentioned are the immediate access to top-pair production as well as the \(e^+e^-\rightarrow H\nu _e\overline{\nu _e}\) process via \(W^+W^-\) fusion which enables a precise determination of the Higgs-boson width, and the possibility to measure the Higgs-boson mass with a precision similar to the one at \(\sqrt{s} ={240}\,\hbox {GeV}\).

In the fermionic Seesaw models, the smallness of the absolute mass scale of the light neutrinos generically requires the neutrino Yukawa couplings to be tiny for TeV-scale sterile states, and consequently lepton-number conserving processes are suppressed [59,60,61] and beyond the reach of the proposed lepton colliders. In contrast, symmetry-protected Seesaw models [59, 62,63,64,65,66,67,68,69,70,71,72] offer the possibility to entertain relatively light singlets or triplets around the TeV scale without the need to assume tiny Yukawa couplings only. Essentially, the new interaction states are assigned units of lepton number such that it is (approximately) conserved, which implies that some elements of the Dirac and Majorana mass matrices can be sizeable, whereas the remaining ones must be comparatively suppressed. The smallness of active neutrino masses is then guaranteed via the proportionality to these small entries, and does not only rely on an overall suppression through the large mass scale. In addition, no fine-tuned cancellation between (a priori unrelated) elements of the mass matrices are needed.

The remaining paper is organised as follows. In Sect. 2 we introduce the relevant theoretical framework. We discuss Higgsstrahlung and other relevant processes which are sensitive to the same parameters in Sect. 3. Our results are summarised and conclusions are drawn in Sect. 4. In the Appendices, more technical details and useful formulae are collected.

2 Theory framework

Firstly, to set our conventions for notation, signs, and normalisation, we state the electroweak (EW) and leptonic parts of the SM Lagrangian:

(1)

with the gauge-covariant derivative

$$\begin{aligned} D_{\mu }= & {} \partial _{\mu } - ig_2 W_{\mu }^a T^a - ig_1 Y B_{\mu }, \nonumber \\ T^a= & {} {\left\{ \begin{array}{ll} \dfrac{\sigma ^a}{2} &{} \text {EW doublet}\\ 0 &{} \text {EW singlet}\\ \end{array}\right. }, \end{aligned}$$
(2)

in terms of the Pauli matrices \(\sigma ^a\) with \(a = 1,2,3\).

2.1 Effective field theory

2.1.1 SMEFT

In the Standard Model Effective Field Theory (SMEFT) we extend the SM Lagrangian with a set of dimension-five and -six operators \({\mathcal {O}}_i\):

$$\begin{aligned} {\mathcal {L}} = {\mathcal {L}}_{\text {SM}} + \sum _i C_i {\mathcal {O}}_i. \end{aligned}$$
(3)

In our convention the Wilson coefficients \(C_i\) are dimensional; for later use we also define the dimensionless variants \({\hat{C}}_i \equiv C_i \times {\textrm{TeV}}^2\) of the dimension-six coefficients.

The conventional basis of operators is the so-called Warsaw basis [73]. For the convenience of the reader, we reproduce the most relevant operators for the fermionic Seesaw models (additional operators relevant to Higgsstrahlung are also listed in Table 2):

(4)

with \({\widetilde{H}} = i\sigma _2 H^*\), and C the charge conjugation matrix satisfying \(C^\dagger \gamma ^\mu C = -(\gamma ^\mu )^T\).

In addition to directly contributing to physical processes, the SMEFT operators also modify the relations between observed quantities and SM parameters, as well as the relations between different parameters. The result of these effects is that the SM parameters will be numerically different in SMEFT compared to the SM. For example, the measured value of the Fermi constant, \(\hat{G}_F\), in muon decay will differ from its standard expression \(G_F = 1/(\sqrt{2} v_T^2)\), \(v_T\) being the Higgs VEV in SMEFT (see e.g. Ref. [74]), by

(5)

When computing the shift in a quantity such as a cross section from its SM value, one must therefore take care to include the contributions from these parameter shifts. Concretely, if \(\sigma _{\text {SM}}(g_i)\) is a cross section computed in the SM written as a function of the parameters \(g_i\), then

(6)

where \(\Delta \sigma _{\text {Direct}}\) captures direct contributions of new operators, and is the shift in the parameter \(g_i\). The values of the parameter shifts critically depend on one’s choice of input parameters; in this work we use \((\alpha , m_Z, G_F)\) as our electroweak inputs. A thorough review of such shifts may be found for instance in Ref. [75], and we provide additional exposition in Appendix A.

2.1.2 LEFT

To describe physics taking place at scales below the electroweak scale we utilise Low-Energy Effective Field Theory (LEFT). The relevant part of the Lagrangian for purely leptonic transitions reads in the Jenkins–Manohar–Stoffer (JMS) basis [76]

(7)

We define the covariant derivative in QED as in \(D_{\mu }=\partial _{\mu } + i Q e A_{\mu }\), following Ref. [76]. For semi-leptonic neutral-current transitions, the relevant part of the Lagrangian is

$$\begin{aligned} {\mathcal {L}}{} & {} \supset C_{eq}^{VLL} ({\overline{\ell }} \gamma ^\mu P_L \ell ) ({\overline{q}} \gamma _{\mu } P_L q)\nonumber \\{} & {} \quad + C_{eq}^{VRR} ({\overline{\ell }} \gamma ^\mu P_R \ell ) ({\overline{q}} \gamma _{\mu } P_R q) \nonumber \\{} & {} \quad +C_{eq}^{VLR} ({\overline{\ell }} \gamma ^\mu P_L \ell ) ({\overline{q}} \gamma _{\mu } P_R q)\nonumber \\{} & {} \quad + C_{qe}^{VLR} ({\overline{q}} \gamma ^\mu P_L q) ({\overline{\ell }} \gamma _{\mu } P_R \ell ). \end{aligned}$$
(8)

To obtain the LEFT Wilson coefficients for the Seesaw models introduced below, we utilise the software package DsixTools [77] to (i) compute the renormalisation group (RG) running of the SMEFT coefficients between the Seesaw scale and the electroweak scale, \(\mu = m_Z\), (ii) match the SMEFT and LEFT coefficients, and (iii) run the LEFT coefficients to the low scale \(\mu = {5}\,\hbox {GeV}\). As there are no sizeable contributions to quark-field operators in the Seesaw models, we assume that this procedure captures the main contributions from RG running in LEFT, and further effects at lower scales do not appreciably change the results. See Appendix B for more details.

2.2 Seesaw models

2.2.1 Type-I

Table 1 SMEFT Wilson coefficients obtained from matching the Type-I and Type-III Seesaw models at the scale \(\mu = M^X\) [78,79,80]. In order to properly account for the stringent bounds from the non-observation of lepton flavour violation, the electroweak dipole operators \(C_{eB}\) and \(C_{eW}\) (last two rows) are matched to one-loop order, while all other operators are matched at tree level

In the Type-I Seesaw model [43,44,45,46,47] the SM Lagrangian is extended by adding \(n_\nu \) right-handed sterile neutrinos \(\{ \nu _{Ri} \}_{i=1}^{n_\nu }\) (so that there are a total of \(3 + n_\nu \) neutrinos), accompanied by a new Yukawa interaction to generate Dirac neutrino masses, as well as Majorana mass terms for the \(\nu _R\):

(9)

where the conjugate fields \(\nu ^c\) are defined as \(\nu ^c \equiv \gamma ^0 C \nu ^*\) [81]. After electroweak symmetry breaking we are able to express the combined mass terms as the matrix equation

(10)

where we refer to \(m^\nu _{ij} \equiv Y^\nu _{ij} v/\sqrt{2}\) as the Dirac mass matrix, and \(M^\nu \) is the Majorana mass matrix. Matching this theory onto SMEFT at the scale \(\mu = M^\nu \) yields the effective operators collected on the left side of Table 1.

2.2.2 Type-III

In the Type-III Seesaw model [48] the SM Lagrangian is extended by adding \(n_\Sigma \) right-handed weak fermion triplets \(\{ \Sigma _{Ri}^a \}_{i=1}^{n_\Sigma }\) with vanishing hypercharge, a new Yukawa interaction to generate Dirac neutrino masses, and Majorana mass terms for the \(\Sigma _R\) [69, 82]:

(11)

where \(a = 1,2,3\) is elided from all but the Yukawa term. For a fixed triplet generation i, the eigenstates of electric charge are given by the combinations [69]

$$\begin{aligned} \Sigma _{Ri}^\pm \equiv \frac{\Sigma _{Ri}^1 \mp i\Sigma _{Ri}^2}{\sqrt{2}}, \quad \Sigma _{Ri}^0 \equiv \Sigma _{Ri}^3. \end{aligned}$$
(12)

In a manner completely analogous to the Type-I Seesaw model, after electroweak symmetry breaking we obtain the neutrino mass matrix

(13)

where we refer to \(m^\Sigma _{ij} \equiv Y^\Sigma _{ij} v/\sqrt{2}\) as the Dirac mass matrix, and \(M^\Sigma \) is the Majorana mass matrix. The states \(\Sigma _{Ri}^\pm \) instead mix into the charged leptons. Matching this theory onto SMEFT at the scale \(\mu = M^\Sigma \) yields the effective operators collected on the right side of Table 1.

2.3 Conserved lepton-number symmetry

In this work, we study symmetry-protected versions of the fermionic Seesaw models, wherein a lepton-number (LN) symmetry decouples the physics of neutrino masses from the phenomenology associated with the conservation of LN [59, 69,70,71,72]. Without loss of generality, we fix \(n_\nu = n_\Sigma = 2\), that is, we focus on the case of two heavy fermion singlet or triplet interaction states which is consistent with at least two massive active neutrinos, as is dictated by neutrino oscillation data. The heavy fermion states are assigned 1 and \(-1\) unit of LN, respectively. After electroweak symmetry breaking, the Dirac mass matrix is given by

(14)

and the Majorana mass matrix reads

(15)

where \(\epsilon \) and \(\mu _{1,2}\) are dimensionless parameters. We parametrise the mixing of the SM neutrino \(\nu _i\) with the fermion singlet \(\nu _{Ri}\), or the neutral component \(\Sigma _{Ri}^0\) of the fermion triplet in terms of the dimensionless ratios

$$\begin{aligned} \theta _i = \frac{m^X_{i1}}{M^X} = \frac{Y^{X_i}}{\sqrt{2}}\frac{v}{M^X}, \end{aligned}$$
(16)

which are equal to the active-sterile mixing angles in the small-mixing approximation, that is, if \({\mathcal {O}}\big ((v/M^X)^3\big )\) effects are neglected. For simplicity, we will refer to \(\theta _e\) also as the “electron(-flavour) mixing angle”, and to \(|\theta _e|\) as “electron(-flavour) mixing”, and equivalently for the other flavours. Light neutrino masses are then proportional to \(\epsilon \) and \(\mu _2\) which break LN:

$$\begin{aligned} m_v&= \frac{v^2}{2} \Big [\mu _2 Y^X(M^X)^{-1}Y^{X^T} \nonumber \\&\quad - \epsilon \left( Y^{X'}(M^X)^{-1}Y^{X^T} + Y^X(M^X)^{-1}Y^{X'^T}\right) \Big ]. \end{aligned}$$
(17)

The limit \(\mu _2\ne 0\) and \(\mu _1 = \epsilon = 0\) is referred to as inverse Seesaw [62, 83, 84], and \(\epsilon \ne 0\) and \(\mu _{1,2} = 0\) is commonly known as linear Seesaw [85, 86].Footnote 1

We adopt the LN-conserving limit \(\epsilon = \mu _{1,2} = 0\) with non-zero \(M^X\) in this work, which results in massless active neutrinos and a heavy Dirac neutrino of mass \(M^X\), and hence assume the textures

(18)

for both \(X = \nu \) and \(\Sigma \). In this way, we neglect the phenomenological implications of LN violation, and instead focus on LN-conserving effects.Footnote 2 Note that one may add a further singlet or triplet with vanishing LN such that

(19)

which supports three massive active neutrinos if one departs from the LN-conserving limit. Still, the additional state trivially decouples from the phenomenology.Footnote 3

3 Phenomenology

We choose the benchmark value \(M^X = {1}\,\hbox {TeV}\) for the masses of the new interaction states in our analysis. To our knowledge, this is consistent with all performed direct searches for heavy neutral leptons at colliders, see for instance Ref. [37] for a recent overview. In Ref. [88], for sterile neutrinos of a mass \(M^{\nu } \approx {1}\,\hbox {TeV}\) the constraint \(|\theta _e|\sim |\theta _{\mu }|\lesssim {\mathcal {O}}(1)\) was derived via a search for the signature of three charged leptons with any combination of electron and muon flavours. Reference [89] reports the constraint \(|\theta _{\mu }|^2\lesssim {\mathcal {O}}(0.1)\) for TeV-scale Majorana neutrinos, based on a search for same-sign dimuon final states, see also Ref. [61]. The bound \(M^{\Sigma _0} \ge {910}\,\hbox {GeV}\) was derived in a recent study [90] which focuses on leptonic final states and takes into account earlier ATLAS results.

In our phenomenological discussion we consider the following observables:

  • the relative shift \(\Delta \sigma /\sigma _0\) in the Higgsstrahlung cross section from its SM prediction,

  • the effective leptonic weak mixing angle \(\sin ^2(\theta ^{\text {lept}}_{w,\text {eff}})\) and the W-boson mass \(m_W\),

  • the ratios \(g^X_{\mu /e}\) and \(g^X_{\tau /\mu }\) of leptonic gauge couplings as probes of lepton flavour universality (LFU), and the ratios \(R(K_{\ell 3})\) and \(R(V_{us})\), and

  • the branching ratios of the LFV processes \(\mu \rightarrow e\gamma \), \(\mu \rightarrow 3e\), \(\tau \rightarrow e\gamma \), \(\tau \rightarrow 3e\), and the ratios of the \(\mu -e\) conversion rates over the muon capture rate in different target nuclei.

In Tables 4, 56 and 7, the theoretical expressions for these observables in the fermionic Seesaw models are listed as functions of the mixing angles \(\theta _e\), \(\theta _{\mu }\), \(\theta _\tau \), as defined in Eq. (16), for a matching scale \(\mu = M^X = {1}\,\hbox {TeV}\). While our expressions hold for complex Yukawa couplings, we do not consider CP violation in the analysis and treat the mixings as real numbers which are only constrained from existing bounds on LN-conserving processes. If effects from RG running in SMEFT above the matching scale are neglected, one may naïvely interpret the results also for a larger mass and appropriately rescale the couplings. Nonetheless, we also computed the relevant expressions for \(\mu = M^X = {10}\,\hbox {TeV}\), which is commented on in Sect. 3.5.

The following discussions are supported by plots in the \(\theta _e\)\(\theta _{\mu }\) plane, as well as plots in the \(\theta _e\)\(\theta _\tau \) plane for the LFU and LFV observables.Footnote 4 The third mixing angle is fixed to the benchmark values \(\theta _\tau = 10^{-2}\) and \(\theta _{\mu } =10^{-6}\); these choices are most transparently justified (at least for Type-III) by Fig. 6, which depicts the most competitive constraints for both models. We have explicitly checked that there are no appreciable changes in the resulting phenomenology if these values are tuned smaller or even zero, apart from the fact that the LFV bounds become weaker and eventually vanish. In each of these plots we produce exclusion regions which either reflect the current bounds at \(2\sigma \) for the electroweak and LFU observables, or the upper limits on the LFV processes at 90% C.L.

3.1 Higgsstrahlung

3.1.1 SM tree-level contribution

The tree-level differential cross section in the SM is well-known, and is given by [91, 92]

$$\begin{aligned} \frac{\hbox {d}\sigma _0}{\hbox {d}\,\cos \theta } = \frac{\sqrt{\lambda }}{32\pi s^2} \overline{\vert {{\mathcal {M}}_t}^2\vert }, \end{aligned}$$
(20)

where unpolarised beams are assumed, with

$$\begin{aligned} \overline{\vert {{\mathcal {M}}_t}\vert ^2} = \frac{s}{2} (g_L^2 + g_R^2) \left( \frac{g_{ZZh}}{s-m_Z^2}\right) ^2 \left( 1 + \frac{\lambda \sin ^2\theta }{8sm_Z^2} \right) \end{aligned}$$
(21)

the spin-averaged matrix element. Here the couplings are

$$\begin{aligned} g_{ZZh}&= \frac{g_2 m_Z}{\cos \theta _w} ,\qquad&g_L&= g_Z (-\tfrac{1}{2} + \sin ^2\theta _w) ,\qquad \nonumber \\ g_R&= g_Z \sin ^2\theta _w ,\qquad&g_Z&= \frac{g_2}{\cos \theta _w}, \end{aligned}$$
(22)

\(\theta \) is the angle between the incoming electron and outgoing Z boson, and

$$\begin{aligned} \lambda = (s - m_Z^2 - m_h^2)^2 - 4 m_Z^2 m_h^2 \end{aligned}$$
(23)

is the relevant Källèn function. The corresponding integrated cross section is

$$\begin{aligned} \sigma _0 = \frac{\sqrt{\lambda }}{32\pi s} (g_L^2 + g_R^2) \left( \frac{g_{ZZh}}{s-m_Z^2} \right) ^2 \left( 1 + \frac{\lambda }{12sm_Z^2}\right) . \end{aligned}$$
(24)

The dependence of \(\sigma _0\) on \(\sqrt{s}\) is depicted in Fig. 1. It peaks around a centre-of-mass energy of \(\sqrt{s} \approx {245}\,\hbox {GeV}\).

Fig. 1
figure 1

The tree-level cross section for the Higgsstrahlung process in the SM as a function of the centre-of-mass energy \(\sqrt{s}\)

3.1.2 Corrections in SMEFT

Including corrections from new physics, we write

(25)

where denotes the effect of parameter shifts in SMEFT to the tree-level cross section, as discussed in Sect. 2.1.1, and \(2\Re \overline{{\mathcal {M}}_t^* {\mathcal {M}}_c}\) is the interference term of the tree-level amplitude with corrections from new operators. The explicit result readsFootnote 5

(26)

where the parameter shifts and coefficients \(d_i\) are presented below, and the form factors \(F_i\) may be found in Ref. [92].

Fig. 2
figure 2

Diagrams with effective vertices contributing to Eq. (27)

The full resulting fractional shift for the cross section is

(27)

Here the parameter shifts in the \((\alpha , m_Z, G_F)\) input scheme are

(28)

where we adopt the shorthands \(s_w \equiv \sin \theta _w\) and \(c_w \equiv \cos \theta _w\), and is defined in Eq. (5). The integrated form factors \(f_i\) are

$$\begin{aligned} f_2= & {} 12m_Z^2 \frac{s(s + m_Z^2 - m_h^2)}{12sm_Z^2 + \lambda }, \end{aligned}$$
(29a)
$$\begin{aligned} f_3= & {} -12 em_Z^2 \frac{g_L + g_R}{g_L^2 + g_R^2} \frac{(s-m_Z^2)(s + m_Z^2 - m_h^2)}{12s m_Z^2 + \lambda }, \end{aligned}$$
(29b)
$$\begin{aligned} f_4= & {} \frac{2g_L}{g_L^2 + g_R^2} (s-m_Z^2) ,\quad \text {and}\quad \end{aligned}$$
(29c)
$$\begin{aligned} f_5= & {} \frac{2g_R}{g_L^2 + g_R^2} (s-m_Z^2), \end{aligned}$$
(29d)

and their corresponding coefficients \(d_i\) are

$$\begin{aligned} d_2= & {} 4(s_w^2 C_{HB} + s_w c_w C_{HWB} + c_w^2 C_{HW}), \end{aligned}$$
(30a)
$$\begin{aligned} d_3= & {} -4 s_w c_w C_{HB} - 2(c_w^2 - s_w^2) C_{HWB}\nonumber \\{} & {} + 4s_w c_w C_{HW}, \end{aligned}$$
(30b)
$$\begin{aligned} d_4= & {} -g_Z (C_{HL,11}^{(1)} + C_{HL,11}^{(3)}) ,\quad \text {and}\quad \end{aligned}$$
(30c)
$$\begin{aligned} d_5= & {} -g_Z C_{He,11}. \end{aligned}$$
(30d)

The diagrams giving rise to the \(d_i\)’s are depicted in Fig. 2. Note that \(d_1\) and \(f_1\) are absent here, as we have elected instead to absorb their contribution into . The dimension-six SMEFT operators which constitute these corrections are listed in Table 2. We comment that the \(d_i\)’s are zero at tree level in the Seesaw models, apart from \(d_4\) in Type-III.

Table 2 Dimension-6 SMEFT operators in the Warsaw basis [73] which enter the correction to the Higgsstrahlung cross section. Here View full size image

3.1.3 Discussion

An essential part of the program of most of the proposed next-generation lepton colliders is to be run as “Higgs factories” at a centre-of-mass energy of \(\sqrt{s} = {240}\,\hbox {GeV}\) (CEPC, FCC-ee) or 250 GeV (ILC), and to perform a scan of the \(t{\bar{t}}\) production threshold in the range \(\sqrt{s} = 350\)–380 GeV. Measurements of the Higgsstrahlung cross section are foreseen at both stages for all collider proposals considered herein, apart from CLIC which is envisioned to directly run at \(\sqrt{s} = {380}\,\hbox {GeV}\) in its initial stage. Therefore, we evaluate the relative shift \(\Delta \sigma /\sigma _0\) of the Higgsstrahlung cross section, computed in Eq. (27), at \(\sqrt{s} = {240}\,\hbox {GeV}\) and \({365}\,\hbox {GeV}\). Regarding the precision of the measurement, we assume the two benchmark values of 0.5% and 1.0% for \(\sqrt{s} = {240}\,\hbox {GeV}\), and 1.0% for \(\sqrt{s} = {365}\,\hbox {GeV}\). This is representative of the results of several conducted studies of the attainable precision, which are collected in Table 3.

Table 3 Forecast (statistical) precision of measurements of the Higgsstrahlung cross section at different proposed next-generation colliders. In the third column, the Z-decay final states taken into account in the respective analysis are given; \(\ell ^+\ell ^-\) always implies both \(Z\rightarrow e^+e^-\) and \(Z\rightarrow \mu ^+\mu ^-\). For the results from Ref. [93] a polarisation \((P_{e^-},P_{e^+}) = (-80\%,+30\%)\) is assumed, still, this changes the expected Higgsstrahlung event rate by maximally 50% compared to unpolarised beams [58, 94]. No polarisation is assumed for the ILC precision for \(\sqrt{s} = {350}\,\hbox {GeV}\) and hadronic Z-boson decays in Ref. [94], wherein an attainable precision of 1.76% for polarised beams and an integrated luminosity of \(0.35\,\text {ab}^{-1}\) is reported as well
Table 4 Shifts of the Higgsstrahlung cross section at different centre-of-mass energies in terms of the mixing angles, when \(M^X = {1}\,\hbox {TeV}\). For the sake of comparison, we have also computed them for the larger centre-of-mass energy \(\sqrt{s} = {500}\,\hbox {GeV}\)

The shifts in the cross section as functions of the mixing angles are listed for both Seesaw models in Table 4. To aid in the following discussion, we moreover find for \(\sqrt{s}={240}\,\hbox {GeV}\)

$$\begin{aligned} \frac{\Delta \sigma }{\sigma _0} \approx 0.90\,\hat{C}_{HL,11}^{(1)} + 0.77\,\hat{C}_{HL,11}^{(3)} - 0.13\,\hat{C}_{HL,22}^{(3)}, \end{aligned}$$
(31)

and for \(\sqrt{s} = {365}\,\hbox {GeV}\)

$$\begin{aligned} \frac{\Delta \sigma }{\sigma _0} \approx 2.09\,\hat{C}_{HL,11}^{(1)} + 1.96\,\hat{C}_{HL,11}^{(3)} - 0.13\,\hat{C}_{HL,22}^{(3)}, \end{aligned}$$
(32)

where the Wilson coefficients are evaluated at \(\mu = \sqrt{s}\), respectively, and as a reminder, we define the dimensionless Wilson coefficients by \(\hat{C} = C\times \hbox {TeV}^{2}\). These approximate expressions deviate from the exact results, presented in Table 4, by maximally 5% in either model.

As the couplings of the Z boson to charged leptons are not directly altered at tree level in the Type-I Seesaw model, \(\sigma (e^+ e^- \rightarrow Zh)\) is predominantly modified via the shift in the Fermi constant, Eq. (5), which enters through the shifts , , and in Eq. (28). While the contributions of electron and muon mixing are of fairly similar magnitudes, \(\Delta \sigma /\sigma _0\) turns out to be slightly more sensitive to the latter. This is due to a partial cancellation of the coefficient \(d_4\) in Eq. (30c) (which acquires a nonzero value from RG running) against the dominant contribution from the Fermi constant for electron mixing. As the corresponding form factor \(f_4\) scales with s, the resulting sensitivity to electron mixing shrinks even further at higher energies. Consequently, if sterile neutrinos are to be searched for via precision Higgs measurements, we do not expect running a next-generation lepton collider at higher centre-of-mass energies to reveal much for Type-I.

Contrariwise, in Type-III the couplings of the Z boson to left-handed charged leptons are modified at tree level which results in a sizeable contribution to \(d_4\). That is, in an EFT language, the model induces the effective four-point interaction \({\overline{e}}\gamma ^\mu P_LeZ_{\mu } h\) depicted on the right in Fig. 2 which interferes with the tree-level contribution to Higgsstrahlung in the SM. As the latter is suppressed by s due to the Z-boson propagator, this results in a very pronounced sensitivity of the ratio \(\Delta \sigma /\sigma _0\) to electron mixing which approximately scales with s. Consequently, if enough luminosity can be attained to compensate for smaller statistics, fermion triplets may well be searched for in Higgsstrahlung measurements at larger centre-of-mass energies. Note that the contributions from electron mixing could in principle be (partly) cancelled by large muon mixing, however, we will find that this scenario is tightly constrained by existing phenomenological bounds. From Table 4 one can immediately deduce that if only electron mixing is sizeable, a minimal shift of \(\Delta \sigma /\sigma _0 \ge 1\%\) for \(\sqrt{s} = {240}\,\hbox {GeV}\) requires \(|\theta _e|\gtrsim 0.019\), whereas \(|\theta _e|\gtrsim 0.013\) is sufficient if the relative cross-section shift can be as small as 0.5%, or if \(\sqrt{s} = {365}\,\hbox {GeV}\) is considered instead.

3.2 Electroweak sector

In this section, we introduce the shifts in the weak mixing angle and the mass of the W boson. The expressions obtained for the fermionic Seesaw models in terms of the mixing angles are listed in Table 5.

Table 5 SM predictions for and current measurements of the electroweak observables considered in this work, together with approximate expressions for their shifts in terms of the mixing angles. The W-boson mass listed in the PDG [98] corresponds to the mass parameter in a Breit–Wigner distribution with a mass-dependent width. The SM prediction for the effective leptonic weak mixing angle is taken from Table II in Ref. [96]. The model predictions are obtained from matching onto SMEFT at the Seesaw scale \(\mu = M^X = {1}\,\hbox {TeV}\) and running to the electroweak scale \(\mu = m_Z\)

3.2.1 Weak mixing angle

In SMEFT the weak mixing angle is modified in accordance with (see e.g. Ref. [74])

(33)

where the shift in the Fermi constant is defined in Eq. (5). There are numerous ways to extract the weak mixing angle from data; the most precise determination is that of the effective leptonic weak mixing angle \(s^2_{w,\text {eff}}\equiv \sin ^2(\theta ^\text {lept}_{w,\text {eff}})\) at LEP [97], achieved via measurements of the left-right asymmetry factor

$$\begin{aligned} {\mathcal {A}}_f = \frac{g_L^2 - g_R^2}{g_L^2 + g_R^2} = \frac{2(1-4s_w^2)}{1+(1-4s_w^2)^2}. \end{aligned}$$
(34)

Apart from the general shift in Eq. (33), we must also take into account the fact that a modification of the Z couplings to charged leptons will directly affect the extraction of \(s^2_{w,\text {eff}}\) from \({\mathcal {A}}_f\). Incorporating the “direct” shifts to these couplings,

(35)

we find

(36)

where the right-hand side is evaluated at the scale \(\mu = m_Z\). Contributions from \(C_{He}\) are not sourced at tree level in either fermionic Seesaw model and thus we neglect them in the approximate expression above.

3.2.2 W-boson mass

The shift incurred in SMEFT is [74]

(37)

which approximately evaluates to

$$\begin{aligned} m_W \approx m_{W,\text {SM}} - {1.05}\left( \hat{C}_{HL,11}^{(3)} + \hat{C}_{HL,22}^{(3)}\right) \,\hbox {GeV} \end{aligned}$$
(38)

at the scale \(\mu = m_Z\). Evidently, the predicted W-boson mass is mainly sensitive to \({\mathcal {O}}^{(3)}_{HL}\) as the major correction is induced via the modified extraction of the Fermi constant in the fermionic Seesaw models.

3.2.3 Discussion

Fig. 3
figure 3

Current constraints from electroweak observables at \(2\sigma \), in comparison with projected sensitivities of precision Higgsstrahlung measurements. The red-ruled regions indicate \(|\Delta \sigma /\sigma | < 0.5\%\) at \(\sqrt{s} = {240}\,\hbox {GeV}\). The dot-dashed and dotted red lines are the corresponding 1% contours at \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV, respectively. The ivory regions for large \(\theta _{e,\mu }\) in the upper-right of the plots are excluded by measurements of both \(s_w\) and \(m_W\). For Type-I Seesaw, the orange dashed line marks where the current experimental world average for \(m_W\) is exactly accommodated, and in the orange-ruled region the CDF measurement [99] is explained at \(2\sigma \)

The constraints arising from the electroweak observables \(s_{w,\text {eff}}^{2}\) and \(m_{W}\) are illustrated in Fig. 3.

For Type-I Seesaw one immediately notices that observing a deviation in the Higgsstrahlung cross section is already in direct conflict with the determination of \(s_{w,\text {eff}}^{2}\), with a 0.5% shift at \(\sqrt{s} = {240}\,\hbox {GeV}\) suffering from a \(2.9\sigma \) tension, and a 1% shift excluded at \(\sim 6\sigma \). While this tension can be reduced by turning up \(\theta _{\tau }\) (see the expression in Table 5), LFU constraints discussed in Sect. 3.3 below preclude this from occurring. Thus, the Type-I Seesaw model is unlikely to be a viable minimal SM extension that can be probed in precision Higgs measurements, unless a significant reduction of the statistical uncertainty of these measurements can be attained.

In the Type-III Seesaw model, the effects of the triplets contributing to the extraction of the Fermi constant and from direct contributions to the leptonic gauge couplings largely cancel out, which equally applies to electron and muon flavour, see Eq. (36). Thus, \(s_{w,\text {eff}}^{2}\) acts as a rather weak constraint on the Type-III Seesaw model, and is in fact most relevant for tau-flavour mixing, implying \(|\theta _{\tau }| \lesssim 0.06\) at \(2\sigma \).

The existing tension between the SM prediction for the W-boson mass \(m_W\) and the larger experimental world average is however exacerbated in Type-III, leading to a much stronger constraint. In contrast, as \(C_{HL}^{(3)}\) is induced with equal magnitude, but opposite signs via matching at the Seesaw scale in the two models under consideration, the tension can be alleviated in Type-I; if uncertainties are ignored, the current world average is reproduced for \(\sqrt{|\theta _{e}|^{2} + |\theta _{\mu }|^{2}} \approx 0.051\), and the CDF measurement [99] for \(\sqrt{|\theta _{e}|^{2} + |\theta _{\mu }|^{2}} \approx 0.097\). A detectable shift in \(\sigma (e^{+} e^{-} \rightarrow Zh)\) could in principle only be induced in the latter case.

Testing the Type-III Seesaw model via Higgsstrahlung measurements is generally compatible with the current constraint arising from \(m_W\). Still, it disfavours detectable \(\Delta \sigma /\sigma _0\) induced via muon mixing, and clearly prefers contributions from electron mixing. The bound from \(m_W\) can be expected to become more competitive when the CDF measurement will be included in the experimental average in the future.

3.3 Lepton flavour universality

In SMEFT the \(W\ell \nu \) coupling is directly altered due to \({\mathcal {O}}_{HL}^{(3)}\):

$$\begin{aligned} {\mathcal {L}} \supset -i \frac{g_2}{\sqrt{2}} \left( \delta _{ij} + v_T^2 C_{HL,ij}^{(3)} \right) \overline{\ell _i}\gamma ^\mu P_L\nu _jW_{\mu }. \end{aligned}$$
(39)

This modifies (semi-)leptonic decays mediated by W bosons, which, among other effects, drives the predictions for LFU ratios away from 1. In the following we consider ratios constituted by the leptonic decays \(\pi \rightarrow \ell \nu \), \(K \rightarrow \ell \nu \) and \(\tau \rightarrow \ell \nu \bar{\nu }\). Furthermore, we utilise the extraction of the Cabibbo–Kobayashi–Maskawa (CKM)-matrix element \(V_{us}\) from the semi-leptonic decay \(K_{\ell 3}: K\rightarrow \pi \ell \nu \) and from nuclear beta decays, together with the assumption of unitarity of the CKM matrix.

3.3.1 Ratios of leptonic gauge couplings

Table 6 Current constraints on and model predictions for the LFU ratios taken into account in this work. For \(R(V_{us})\), the given experimental value refers to the case of \(N_f = 2 + 1 + 1\) dynamical quark flavours in the lattice simulations from which the relevant decay constants are extracted. The case of \(N_f = 2 + 1\) quark flavours gives rise to a less competitive bound [98]. The model predictions are obtained from matching onto SMEFT at the Seesaw scale \(\mu = M^X = {1}\,\hbox {TeV}\) and running to the electroweak scale \(\mu = m_Z\). In the right column, the upper sign refers to Type-I Seesaw, whereas the lower sign stands for Type-III Seesaw

We consider the ratios of leptonic gauge couplings collected in Table 6 which are probes of LFU. For the first three listed therein, [102, 103]

$$\begin{aligned} g^X_{\mu /e}&\equiv \left( \frac{g_{\mu }}{g_e}\right) ^X \approx 1 + v_T^2 C_{HL,22}^{(3)} - v_T^2 C_{HL,11}^{(3)} \nonumber \\&\approx 1 + 0.06\left( \hat{C}_{HL,22}^{(3)} - \hat{C}_{HL,11}^{(3)}\right) , \end{aligned}$$
(40)

where \((g_{\mu }/g_e)^X\) is the ratio of leptonic gauge couplings extracted from the ratio \(\Gamma (X\rightarrow \mu \nu )/\Gamma (X\rightarrow e\nu )\) for \(X=\pi ,K\), and \(\Gamma (\tau \rightarrow \mu \nu \bar{\nu })/\Gamma (\tau \rightarrow e\nu \bar{\nu })\) for \(X=\ell \).Footnote 6 Leptonic W-boson decays can also be used to derive constraints on the ratios of leptonic gauge couplings as in \(g^W_{\mu /e}\equiv (g_{\mu }/g_e)^W\) and \(g^W_{\tau /\mu }\equiv (g_\tau /g_{\mu })^W\); still, we do not list them in the table, as the corresponding bounds are merely weaker versions of other constraints. We similarly have

$$\begin{aligned} g^X_{\tau /\mu } \equiv \left( \frac{g_\tau }{g_{\mu }}\right) ^X \approx 1 + 0.06\left( \hat{C}_{HL,33}^{(3)} - \hat{C}_{HL,22}^{(3)}\right) , \end{aligned}$$
(41)

where \((g_\tau /g_{\mu })^X\) is the ratio of leptonic gauge couplings extracted from the ratio \(\Gamma (\tau \rightarrow e \nu \bar{\nu })/\Gamma (\mu \rightarrow e\nu \bar{\nu })\) for \(X=\ell \), and \(\Gamma (\tau \rightarrow \pi \nu )/\Gamma (\pi \rightarrow \mu \nu )\) for \(X=\pi \).

In our Seesaw models the predicted deviation of the considered LFU ratios from 1 is at leading order proportional to \(\pm (|\theta _{\mu }|^2 - |\theta _j|^2)\) where \(j = e\) or \(\tau \), and the sign depends on j as well as which Seesaw model is considered, see Table 6. The derived constraints therefore give rise to hyperbolic contours in the figures presented in this section. Clearly, if the data favours a ratio to be, say, smaller than 1, predicting it to be larger than 1 will lead to a tighter constraint. This then translates into one of the mixing angles being slightly more stringently bounded than another one, with the roles reversed if the other Seesaw model is considered. Consequently, if the contribution from a specific mixing angle accommodates the data well in one model, the other model will necessarily increase the tension with the SM.

3.3.2 Light quark mixing

In Table 6 we furthermore consider ratios of the CKM-matrix element \(V_{us}\) extracted from the semi-leptonic kaon decays \(K_{\mu 3}\) and \(K_{e3}\), the leptonic kaon decay \(K_{\mu 2}\), and nuclear beta decay – these are

$$\begin{aligned} R(K_{\ell 3}) \equiv \frac{V_{us}^{K_{\mu 3}}}{V_{us}^{K_{e3}}} \quad \text {and}\quad R(V_{us}) \equiv \frac{V_{us}^{K_{\mu 2}}}{V_{us}^\beta }. \end{aligned}$$
(42)

The dependence of \(R(K_{\ell 3})\) on new physics is identical to that of \(g^\pi _{\mu /e}\) and \(g^K_{\mu /e}\), of which both can similarly be viewed as CKM ratios, with the latter being commonly denoted by \(R(K_{\ell 2})\). In the following we illustrate why \(R(V_{us})\) is a special case, and refer the reader to Ref. [102], wherein the ratio was originally proposed, for further information.

The extracted value of \(V_{us}\) from the leptonic kaon decay \(K_{\mu 2}\) is

(43)

where \(V_{us}\) is a Lagrangian parameter, the \(C_{HL,22}^{(3)}\) is due to the new direct contribution, Eq. (39), and parameterises the shift in the Fermi constant. Similarly, beta decay results in

(44)

which, as per Ref. [102], is translated to \(V_{us}^\beta \) using CKM unitarity of the Lagrangian parameters:

(45)

From this we derive the ratio

(46)

As is explained in Ref. [102], a crucial feature of \(R(V_{us})\) is the enhanced sensitivity to new physics due to \((V_{ud}/V_{us})^2 \approx 20\). Experimental data favours the ratio to be smaller than 1 with a significance between \(1\sigma \) and \(2\sigma \), depending on the number of quark flavours assumed for the calculation of the relevant decay constant, see also Table 6.

3.3.3 Discussion

Fig. 4
figure 4

Current constraints arising from LFU ratios at \(2\sigma \) in comparison with projected sensitivities of precision Higgsstrahlung measurements. The red-ruled regions indicate \(|\Delta \sigma /\sigma | < 0.5\%\) at \(\sqrt{s} = {240}\,\hbox {GeV}\). The dot-dashed and dotted red lines are the corresponding 1% contours at \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV, respectively

In Fig. 4, the constraints arising from LFU ratios are illustrated. Among those, \(R(V_{us})\) is very sensitive to muon mixing and highly relevant for the fermionic Seesaw models. For Type-I Seesaw, the ratio is enhanced and thus driven further away from the data. In fact, \(R(V_{us})\) is in conflict with any visible effect in Higgsstrahlung induced via electron or muon mixing, even more so than \(s_w\). The next-to-most competitive bounds on electron and muon mixing arise from \(g^K_{\mu /e}\) and \(g^\pi _{\mu /e}\), respectively.

In the case of Type-III Seesaw, the most important constraints stem from \(g^\pi _{\mu /e}\) and \(R(V_{us})\). Since requiring a discernible deviation in the Higgsstrahlung cross section mainly translates into a lower bound on the electron mixing angle in this model and muon mixing may thus be tuned arbitrarily small, the obtained upper limits on \(|\theta _e|\) are more relevant in this context. \(g^\pi _{\mu /e}\) demands \(|\theta _e|\lesssim {0.04}\) at \(2\sigma \), unless one allows for larger muon mixing and thus a cancellation, whereupon the maximal electron mixing angle allowed by LFU bounds can be rendered up to 50% larger. \(R(V_{us})\) implies \(|\theta _{\mu }|\lesssim {0.05}\) at \(2\sigma \), which holds largely independently of the value chosen for \(|\theta _e|\). Note that in a vein similar to \(m_W\), the bounds from \(g^K_{\mu /e}\) and \(R(V_{us})\) both constrain muon mixing efficiently enough so that no appreciable cancellations of the contributions to Higgsstrahlung from electron mixing can occur for Type-III Seesaw.

As can be seen in Fig. 4b, LFU data constrains tau mixing to \(|\theta _\tau | \lesssim 0.06\) for either Seesaw model, which arises from \(g^\ell _{\tau /\mu }\) in Type-I, and \(g^\pi _{\tau /\mu }\) for Type-III. These constraints can in principle be weakened if \(|\theta _{\mu }|\) is more sizeable; still, large changes are only observed for \({\mathcal {O}}({0.1})\) muon mixing, a scenario which is nonetheless excluded by other observables. While the contributions to \(R(V_{us})\) from electron and tau mixing may cancel, this does not open up parameter space in the Type-I Seesaw model, as tau mixing itself is too constrained.

Further LFU ratios not contained in Table 6 deviate from the SM prediction by (close to) \(2\sigma \) and thus present moderate anomalies in themselves, see Ref. [100]. Explicitly, \(g^K_{\tau /\mu }\) is measured to be smaller than 1, while \(g^\ell _{\tau /e}\) and \(g^W_{\tau /e}\) exceed the SM expectation. If the models under consideration ought to accommodate the data on \(g^K_{\tau /\mu }\), one would require large tau mixing in comparison with muon mixing for Type-I Seesaw, with the flavours swapped for Type-III Seesaw. The latter is unlikely to be realised for scenarios which are testable via Higgsstrahlung measurements, as large \(\Delta \sigma /\sigma _0\) are likely induced via electron mixing in this model, which then demands muon mixing to be very small due to the bounds arising from LFV, see the following section. Similarly, for Type-III Seesaw, the other two ratios necessitate tau mixing to substantially exceed electron mixing in magnitude, which is not a promising scenario either to be tested in the given context, in particular in light of the bound on BR(\(\tau \rightarrow 3e\)).

3.4 Lepton flavour violation

Table 7 Current and projected constraints on the LFV observables taken into account in this work, together with the pertinent model predictions. The current bounds hold at 90% C.L. The future reach listed for BR\((\mu \rightarrow 3e)\) refers to Phase II of the Mu3e experiment; an initial sensitivity of BR\((\mu \rightarrow 3e)\lesssim 2\times 10^{-15}\) is expected after Phase I. The upper (lower) value listed for the future reach of CR(\(\mu - e;\,\)Al) refers to COMET (Mu2e). The model predictions are obtained from matching onto SMEFT at the Seesaw scale \(\mu = M^X = {1}\,\hbox {TeV}\), running to the electroweak scale \(\mu = m_Z\), matching onto LEFT and running to the low scale \(\mu = {5}\,\hbox {GeV}\)

As is generically the case for models of neutrino mass generation, the Seesaw models predict sizeable rates for flavour-violating decays of charged leptons. These processes have not been observed to date and thus impart stringent bounds on the parameter space, which will likely get refined in the near future due to several ongoing or upcoming experiments, see Table 7. The scales relevant for these LFV decays are within the realm of LEFT and thus it is instrumental to discuss them in terms of the contributions to LEFT operators. Since we focus on the comparison with the sensitivities to the Higgsstrahlung process at colliders, we restrict ourselves to observables involving electron-flavoured transitions. A comprehensive investigation of LFV effects in the symmetry-protected Type-I Seesaw model can for instance be found in Ref. [118]. We will relegate the explicit matching conditions used in this section to Appendix B.

3.4.1 Radiative charged-lepton decays

The branching ratios for radiative flavour-violating charged-lepton decays read [119]

$$\begin{aligned} {\textrm{BR}}(\ell _i\rightarrow \ell _j\gamma ) = \frac{m_{\ell _i}^3}{4\pi \Gamma _{\ell _i}} \left( |C_{e\gamma ,ij}|^2 + |C_{e\gamma ,ji}|^2\right) , \end{aligned}$$
(47)

with the full decay width \(\Gamma _{\ell _i}\). We approximately find

$$\begin{aligned}{} & {} {\textrm{BR}}(\mu \rightarrow e\gamma ) \approx 7.117\times 10^{6}\Big |\hat{C}_{eB,12} - 0.55\,\hat{C}_{eW,12}\nonumber \\{} & {} \quad + \left( 1.77\,\hat{C}_{HL,12}^{(3)} - 0.48\,\hat{C}_{HL,12}^{(1)}\right) 10^{-6}\Big |^2, \end{aligned}$$
(48)
$$\begin{aligned}{} & {} {\textrm{BR}}(\tau \rightarrow e\gamma ) \approx 0.004\times 10^{6}\Big |\hat{C}_{eB,13} - 0.55\,\hat{C}_{eW,13}\nonumber \\{} & {} \quad + \left( 29.69\,\hat{C}_{HL,13}^{(3)} - 8.12\,\hat{C}_{HL,13}^{(1)}\right) 10^{-6}\Big |^2, \end{aligned}$$
(49)

where the SMEFT Wilson coefficients on the right are evaluated at the electroweak scale \(\mu = m_Z\). In both Seesaw models the one-loop matching contributions to the electromagnetic dipole operator \({\mathcal {O}}_{e\gamma }\) from the electroweak dipole operators \({\mathcal {O}}_{eB}\) and \({\mathcal {O}}_{eW}\) are of the same order of magnitude as the contributions from \({\mathcal {O}}_{HL}^{(1)}\) and \({\mathcal {O}}_{HL}^{(3)}\) which originate from RG running, see also Eq. (79) in Appendix B.

3.4.2 Trilepton decays

The branching ratio for trilepton decays with identical flavours in the final state is given by [120]Footnote 7

$$\begin{aligned}&{\textrm{BR}}(\ell _i\rightarrow \ell _j\ell _j \overline{\ell _j}) = \frac{m_{\ell _i}^5}{3(16\pi )^3 \Gamma _{\ell _i}} \Bigg [64\left| C_{ee,jijj}^{VLL}\right| ^2 \nonumber \\&\quad +64\left| C_{ee,jijj}^{VRR}\right| ^2 + 8 \left| C_{ee,jijj}^{VLR}\right| ^2 + 8 \left| C_{ee,jjji}^{VLR}\right| ^2 \nonumber \\&\quad +\frac{256 e^2}{m_{\ell _i}^2} \left( \ln \frac{m_{\ell _i}^2}{m_{\ell _j}^2} -\frac{11}{4}\right) \left( \left| C_{e\gamma }^{ij}\right| ^2 + \left| C_{e\gamma }^{ji}\right| ^2\right) \nonumber \\&\quad -\frac{64 e}{m_{\ell _i}} \mathrm {\Re }\Big [\left( 4\, C_{ee,jijj}^{VLL}+C_{ee,jijj}^{VLR}\right) C_{e\gamma }^{ji*} \nonumber \\&\quad + \left( 4\, C_{ee,jijj}^{VRR} + C_{ee,jjji}^{VLR}\right) C_{e\gamma }^{ij}\Big ]\Bigg ]. \end{aligned}$$
(50)

In the Type-III Seesaw model, these decays are dominated by the vector operators \({\mathcal {O}}^{VLL}_{ee}\) and \({\mathcal {O}}^{VLR}_{ee}\), with the flavour change occurring in the left-handed lepton bilinear. These operators receive large contributions from tree-level matching of the Type-III Seesaw model onto SMEFT, and then onto LEFT (see Appendix B). By neglecting all Wilson coefficients apart from \(C^{VLX}_{ee,jijj}\) with \(X=L,R\), we thus find

$$\begin{aligned} {\textrm{BR}}(\mu \rightarrow 3e) \approx \,&1.2\times 10^{-4} \nonumber \\&\Big [ 64\left| 0.27\Big (\hat{C}_{HL,12}^{(1)} + \hat{C}_{HL,12}^{(3)}\Big )\right| ^2 \nonumber \\&+ 8 \left| 0.49\Big (\hat{C}_{HL,12}^{(1)} + \hat{C}_{HL,12}^{(3)}\Big )\right| ^2 \Big ] \end{aligned}$$
(51)
$$\begin{aligned} {\textrm{BR}}(\tau \rightarrow 3e) \approx \,&0.2\times 10^{-4} \nonumber \\&\Big [ 64\left| 0.27 \Big (\hat{C}_{HL,13}^{(1)} + \hat{C}_{HL,13}^{(3)}\Big )\right| ^2\nonumber \\&+ 8 \left| 0.49\Big (\hat{C}_{HL,13}^{(1)} + \hat{C}_{HL,13}^{(3)}\Big )\right| ^2 \Big ], \end{aligned}$$
(52)

where the SMEFT Wilson coefficients on the right are evaluated at the electroweak scale \(\mu = m_Z\). In the Type-I Seesaw model, all Wilson coefficients entering the branching ratios for trilepton decays receive contributions from matching onto SMEFT only at loop level. In this case, the branching ratios are relatively more sensitive to the contributions from the electromagnetic dipole operator \({\mathcal {O}}_{e\gamma }\), and the above approximations are only accurate to about 20%.

3.4.3 \(\mu -e\) conversion in nuclei

As the scalar and gluon operators are suppressed in the fermionic Seesaw models, the \(\mu -e\) conversion rate takes the simple form [122, 123]

$$\begin{aligned} \omega _\textrm{conv}= & {} \left| -\frac{C_{e\gamma ,12}}{2 m_{\mu }} D + {\tilde{g}}_{LV}^{(p)} V^{(p)} + {\tilde{g}}_{LV}^{(n)} V^{(n)} \right| ^2\nonumber \\{} & {} + \left| - \frac{C_{e\gamma ,21}^*}{2m_{\mu }} D + {\tilde{g}}_{RV}^{(p)} V^{(p)} + {\tilde{g}}_{RV}^{(n)} V^{(n)} \right| ^2, \end{aligned}$$
(53)

where the overlap integrals D, \(V^{(p)}\) and \(V^{(n)}\), and muon capture rates \(\omega _\textrm{capt}\) can be found in Refs. [122, 124], and the effective coupling constants are

$$\begin{aligned} {\tilde{g}}_{LV}^{(p)}&=2\left( C_{eu,1211}^{VLL} +C_{eu,1211}^{VLR}\right) \nonumber \\&\quad +\left( C_{ed,1211}^{VLL} +C_{ed,1211}^{VLR}\right) , \end{aligned}$$
(54)
$$\begin{aligned} {\tilde{g}}_{RV}^{(p)}&= 2 \left( C_{eu,1211}^{VRR} + C_{ue,1112}^{VLR}\right) \nonumber \\&\quad + \left( C_{ed,1211}^{VRR} + C_{de,1112}^{VLR}\right) , \end{aligned}$$
(55)
$$\begin{aligned} {\tilde{g}}_{LV}^{(n)}&=\left( C_{eu,1211}^{VLL} +C_{eu,1211}^{VLR}\right) \nonumber \\&\quad +2\left( C_{ed,1211}^{VLL} +C_{ed,1211}^{VLR}\right) ,\quad \text {and}\quad \end{aligned}$$
(56)
$$\begin{aligned} {\tilde{g}}_{RV}^{(n)}&= \left( C_{eu,1211}^{VRR} + C_{ue,1112}^{VLR}\right) \nonumber \\&\quad +2 \left( C_{ed,1211}^{VRR} + C_{de,1112}^{VLR}\right) ; \end{aligned}$$
(57)

see Appendix B for approximate matching expressions.

We are interested in the conversion ratio \({\textrm{CR}}(\mu \rightarrow e)\), defined as the ratio of the \(\mu -e\) conversion rate \(\omega _\textrm{conv}\) over the muon capture rate \(\omega _\textrm{capt}\). For the Type-I and Type-III Seesaw models it approximates to

$$\begin{aligned}{} & {} {\textrm{CR}}(\mu \rightarrow e) \approx \left( \frac{m_{\mu }}{\hbox {GeV}} \right) ^5 \left\{ \begin{array}{c} 5.87 \\ 1.16 \\ 21.54 \end{array} \right\} \times 10^{5} \nonumber \\{} & {} \Bigg |\Big [2\left( \hat{C}_{eu,1211}^{VLL} +\hat{C}_{eu,1211}^{VLR}\right) \nonumber \\ {}{} & {} \quad +\left( \hat{C}_{ed,1211}^{VLL} +\hat{C}_{ed,1211}^{VLR}\right) \Big ] \left\{ \begin{array}{c} 0.0396 \\ 0.0974 \\ 0.0161 \end{array} \right\} \nonumber \\{} & {} \quad {} + \Big [\left( \hat{C}_{eu,1211}^{VLL} + \hat{C}_{eu,1211}^{VLR}\right) \nonumber \\{} & {} \quad {} + 2\left( \hat{C}_{ed,1211}^{VLL} + \hat{C}_{ed,1211}^{VLR}\right) \Big ] \left\{ \begin{array}{c} 0.0468 \\ 0.146 \\ 0.0173 \end{array} \right\} \Bigg |^2, \end{aligned}$$
(58)

where the Wilson coefficients on the right are evaluated at the low scale \(\mu = {5}\,\hbox {GeV}\), and where the upper, middle and lower entry in the brackets refers to a titanium (Ti), gold (Au) and aluminium (Al) target, respectively. We also include the predictions for the conversion ratios for lead (Pb) and sulfur (S) in Table 7, from which one can infer that the respective current bounds do not impose relevant constraints. As is reflected by the above approximation, \(\mu -e\) conversion is dominated by contributions from left-handed vector operators in both Seesaw models. This is evident in Type-III where these contributions are sourced at tree level, but also holds in Type-I. The electroweak dipole operator \({\mathcal {O}}_{e\gamma ,12}\) plays a subdominant role in both models.

3.4.4 Discussion

Fig. 5
figure 5

Current constraints from LFV observables depicted at 90% C.L., together with their prospective future reaches, in comparison with projected sensitivities of precision Higgsstrahlung measurements. The red-ruled regions indicate \(|\Delta \sigma /\sigma | < 0.5\%\) at \(\sqrt{s} = {240}\,\hbox {GeV}\). The dot-dashed and dotted red lines are the corresponding 1% contours at \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV, respectively. We do not depict the bounds from tau-flavoured processes in the \(\theta _e\)\(\theta _{\mu }\) plots, and vice versa. If included, they would appear as vertical lines with positioning highly dependent on the choice of the third mixing angle, which limits their relevance

We find that for the Type-I Seesaw model, as per the absence of tree-level contributions to trilepton decays and \(\mu -e\) conversion, the most competitive bounds currently arise from the non-observation of \(\mu \rightarrow e\gamma \) and \(\tau \rightarrow e\gamma \), see Fig. 5. Therefore, a detectable shift in \(\sigma (e^+ e^- \rightarrow Zh)\) would enforce either \(|\theta _e|\gtrsim 0.1\) and \(|\theta _{\mu }|\lesssim 10^{-4}\), or vice versa. Still, in the \(\mu -e\) sector, we expect the limits from BR(\(\mu \rightarrow 3e\)) and CR(\(\mu -e\)) to become more stringent in the future, and further improve this bound by up to two orders of magnitude. The relatively loose current bound from CR(\(\mu -e;\,\text {Au}\)) is due to cancellations between the effective vector couplings to protons and neutrons. These generically occur for all target materials in both fermionic Seesaw models, but are insignificant for Type-III. In the case of Type-I, matching at a larger scale reduces the cancellations for CR(\(\mu -e;\,\text {Au}\)), cf. Sect. 3.5. See also for instance Ref. [125] for a pertinent discussion. Currently, the non-observation of \(\tau \rightarrow e\gamma \) enforces \(|\theta _\tau |\lesssim 0.1\) for \(|\theta _e|\gtrsim 0.1\), and \(|\theta _e|\lesssim 0.02\) for \(|\theta _\tau |\gtrsim 0.6\). These limits can be expected to become slightly more stringent in light of the future reaches of BR(\(\tau \rightarrow e\gamma \)) and BR(\(\tau \rightarrow 3e\)), still, the improvements are expected to be less than an order of magnitude.

In Type-III Seesaw, due to tree-level contributions to the respective pertinent operators, the bounds from BR(\(\mu \rightarrow 3e\)) and CR(\(\mu -e\)) are stricter than that of BR(\(\mu \rightarrow e\gamma \)), and \(\tau \rightarrow 3e\) is also more competitive than \(\tau \rightarrow e\gamma \). Note that the projected sensitivity to BR(\(\mu \rightarrow e\gamma \)) at MEG II cannot even be expected to supersede the current bound on BR(\(\mu \rightarrow 3e\)). Moreover, the existing bound on \(\mu -e\) conversion in gold effectively enforces the muon mixing angle to be smaller than \(|\theta _{\mu }|\lesssim 10^{-5}\) for \(|\theta _e|> 10^{-2}\) which is required for an observable deviation of the Higgsstrahlung cross section at a future lepton collider. Thus, the measurements of the cross section are only sensitive to a rather pronounced hierarchy \(|\theta _{\mu }/\theta _e|\lesssim 10^{-3}\). A hierarchy \(|\theta _e/\theta _{\mu }|\lesssim 10^{-5}\), corresponding to the region visible in the top-left of Fig. 5a, is in principle also compatible with the LFV bounds, but still disfavoured by the constraints from electroweak and LFU observables, see the relevant sections above.

Similarly, the non-observation of \(\tau \rightarrow 3e\) presently constrains the tau mixing angle to \(|\theta _\tau |\lesssim {0.03}\) if a non-SM signature in \(\sigma (e^+ e^- \rightarrow Zh)\) is to be attainable, that is, tau mixing should not substantially exceed electron mixing in magnitude. This limit on \(|\theta _\tau |\) would be strengthened by an order of magnitude if no decay \(\tau \rightarrow 3e\) is observed at Belle II, which will then also necessitate a hierarchy \(|\theta _\tau /\theta _e|\lesssim {0.1}\) if a chance of detecting \(\Delta \sigma /\sigma _0\) is to be retained. In a similar vein, if no signals in \(\mu \rightarrow 3e\) or \(\mu -e\) conversion in aluminium are observed in the future, the ratio of the relevant mixing angles will be constrained to be even as small as \(|\theta _{\mu }/\theta _e|\lesssim 10^{-4}\) or \( 10^{-5}\), respectively.

3.5 Larger Seesaw scale

Table 8 Contributions to the observables considered in the phenomenology study in this work, but obtained from matching onto SMEFT at a Seesaw scale \(\mu = M^X = {10}\,\hbox {TeV}\)

Lastly, we will comment on the scenario with a larger matching scale. In Table 8 we have collected our results for the shifts in the observables considered in our work for the Seesaw scale \(\mu = M^X = {10}\,\hbox {TeV}\).

In the case of the Type-III Seesaw model, we find that the results discussed in this section seem to be fairly robust with respect to raising the triplet mass to \({\mathcal {O}}({10}\,\hbox {TeV})\) at least. That is, the numerical coefficients entering the expressions for the considered observables typically change by less than 20%. The only notable differences lie in the sensitivity of \(\delta m_W\) and \(\Delta \sigma /\sigma _0\) to tau-flavour mixing at larger centre-of-mass energies, where the respective coefficients grow by a factor of 2–5. Still, it is a subleading effect, as these observables remain much more sensitive to electron- and muon-flavour mixing. Therefore, the results for the Type-III Seesaw model discussed in Sect. 3 so far, for which \(M^\Sigma = {1}\,\hbox {TeV}\) is assumed, will also approximately hold for (moderately) larger masses.

In contrast, the observable phenomenology of Type-I Seesaw morphs somewhat nontrivially upon raising the Seesaw scale to \(M^\nu = {\mathcal {O}}({10}\,\hbox {TeV})\). Most profoundly, the Higgsstrahlung shift \(\Delta \sigma /\sigma _0\) now experiences a crossing near \(\sqrt{s} = {500}\,\hbox {GeV}\), whereupon the dependence on the squared electron mixing angle \(|\theta _e|^2\) reverses from positive to negative. Additionally, the trilepton decay rates receive a relative numerical boost of 200%, and the \(\mu -e\) conversion rates are, in general, significantly altered. Specifically, \(\mu -e\) conversion in gold and lead increase substantially due to the fact that the effective left-handed vector couplings to neutrons increase by a factor larger than 2, while the proton couplings and the dipole operator remain largely unchanged and thus the cancellations are much less efficient. On the contrary, \(\mu -e\) conversion in aluminium and titanium experience a suppression. This implies in particular that the current bound arising from \(\mu -e\) conversion in gold is clearly stronger than the one from titanium, unlike the scenario with \(M^{\nu } = {1}\,\hbox {TeV}\).

4 Summary

Fig. 6
figure 6

Summary plots featuring the most competitive current constraints as well as future reaches considered in this work in comparison with projected sensitivities of precision Higgsstrahlung measurements. The red-ruled regions indicate \(|\Delta \sigma /\sigma | < 0.5\%\) at \(\sqrt{s} = {240}\,\hbox {GeV}\). The dot-dashed and dotted red lines are the corresponding 1% contours at \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV, respectively

We have computed the correction to the tree-level cross section of the Higgsstrahlung process \(e^+e^-\rightarrow Zh\) in the LN-conserving limit of the Type-I and Type-III Seesaw models, and compared several benchmark sensitivities of next-generation lepton colliders to existing and prospective constraints from electroweak precision measurements, and LFU and LFV probes. Summary plots in the \(\theta _e\)\(\theta _{\mu }\) and \(\theta _e\)\(\theta _\tau \) planes are presented in Fig. 6.

As a major result, we found that existing data on the effective leptonic weak mixing angle and LFU observables preclude substantial corrections to the Higgsstrahlung cross section for Type-I Seesaw. The most likely signature of this model at a future lepton collider is therefore the absence of a detectable deviation from the SM prediction, at least if no further new physics modifying the electroweak and LFU sectors is introduced. For Type-III Seesaw, the current constraints (at \(2\sigma \)) leave genuinely viable parameter space that can be probed at an \(e^+e^-\) Higgs factory. Figure 7 provides a magnified view of this region. Concretely, for a centre-of-mass energy \(\sqrt{s} = {240}\,\hbox {GeV}\) the largest permitted shift in the Higgsstrahlung cross section is \(\sim \)5%; at \(\sqrt{s} = {365}\,\hbox {GeV}\) it is \(\sim \) 12%.

Fig. 7
figure 7

Plots zoomed in on the viable parameter regions in the Type-III Seesaw model. Only the most constraining observables are depicted. The red-ruled regions indicate \(|\Delta \sigma /\sigma | < 0.5\%\) at \(\sqrt{s} = {240}\,\hbox {GeV}\). The dot-dashed and dotted red lines are the corresponding 1% contours at \(\sqrt{s} = {240}\,\hbox {GeV}\) and 365 GeV, respectively

The viable region in Type-III is isolated by three main considerations. Firstly, the non-observation of LFV tightly constrains any scenario with sizeable mixing of heavy fermion singlets or triplets with two lepton flavours. These constraints are particularly strong for Type-III Seesaw, which induces tree-level contributions to trilepton decays and \(\mu -e\) conversion. Indeed, a detectable deviation in \(\sigma (e^+ e^- \rightarrow Zh)\) already necessitates a sizeable hierarchy between \(\theta _e\) and \(\theta _{\mu }\) which will become more pronounced if signals of LFV remain elusive in the future. (The situation is similar for Type-I Seesaw, where in the absence of contributions to LFV at tree level, the radiative decays \(\mu \rightarrow e\gamma \) and \(\tau \rightarrow e\gamma \) are more important.) Secondly (and thirdly), both the W-boson mass \(m_W\) and LFU data currently disfavour detectable corrections induced via muon mixing at the level of \(2\sigma \), but leave room for visible effects due to electron-flavoured couplings, which together with the LFV constraints enforces a hierarchy \(|\theta _{\mu }/\theta _e| \lesssim 10^{-3}\).

Focusing on the viable region in Fig. 7, the constraints arising from the LFU ratio \(g^\pi _{\mu /e}\) and \(m_W\) are similarly competitive and provide the most stringent constraint on electron mixing in the Type-III Seesaw model, with \(|\theta _e|\lesssim 0.04\) at \(2\sigma \). Note also that in the region of parameter space where \(\Delta \sigma /\sigma _0\) is detectable, tau-flavour mixing is more strictly constrained by the current bound on BR(\(\tau \rightarrow 3e\)) than by measurements of the weak mixing angle or pion decays. As is expected from Sect. 3.4, the most competitive upper limit on muon mixing in the given context currently arises from the non-observation of \(\mu -e\) conversion in gold, and will be further constrained by Mu3e as well as the searches for \(\mu -e\) conversion in aluminium at COMET and Mu2e. If an observation of these transitions remains elusive in the future, the currently viable parameter space will retreat to \(|\theta _{\mu }| \lesssim 10^{-7.5}\). Similarly, if \(\tau \rightarrow 3e\) is not observed at Belle II, tau mixing would need to be smaller than \(\theta _\tau = 10^{-2}\).

Since fermion triplets induce a tree-level contribution to \(e^+e^-\rightarrow Zh\) which is not mediated via the s channel as in the SM, deviations from the cross section induced via electron-flavour mixing grow approximately with s. In that sense, if the drop in statistics can be compensated by higher luminosity, the Type-III Seesaw model motivates precision measurements of the Higgsstrahlung process at higher centre-of-mass energies as well, whereas this is not indicated for Type-I Seesaw.

Overall, we have corroborated the expectation that a rich interplay of neutrino, Higgs, electroweak and flavour physics is to be expected for Seesaw models at low energies, and demonstrated the benefit of measuring the Higgsstrahlung cross section at multiple centre-of-mass energies for the Type-III Seesaw model. One may extend the research conducted in this work along two major avenues. Firstly, although the list of processes which we consider in the analysis captures a wide range of phenomenology of the fermionic Seesaw models, it is not exhaustive. In particular, taking into account observables sensitive to angular distributions for Higgs physics [126] as well as a comprehensive global fit in the electroweak sector should help to further differentiate between the low-energy signatures of Seesaw models. Secondly, since we relied on the assumption of an exactly conserved LN symmetry on the Lagrangian level, existing data on lepton mixing and the mass hierarchies in the neutrino sector was per definition not incorporated. While we expect the implications of explicit breakings of lepton number for the induced low-energy phenomenology to be small in general, any viable model of neutrino mass generation eventually needs to be tested against them. Lastly, we leave similar studies for different models of neutrino mass generation for future work.