1 Introduction

The existence of light neutrino masses and the oscillation phenomena can be accounted for in a minimal extension of the SM Lagrangian with sterile right-handed neutrinos \(N_R\), which allow for a lepton number violating Majorana mass term, as in the type I seesaw [1,2,3,4,5]. This Majorana mass scale is a parameter, not related to electroweak symmetry breaking, and in the naive (high-scale) seesaw it is taken to be large, so that it suppresses the induced mass for the mostly active light neutrinos, thus leading to very heavy massive states in addition to the light ones. However, the lightness of the known neutrinos could also be explained by symmetry principles, which is the argument of the linear and inverse seesaw variants [6, 7] to lower the mass scale of the heavy neutrinos, allowing us to probe their phenomenology at laboratory energies. Indeed, even if the heavy neutral leptons (HNLs) or heavy neutrinos are accessible, their interactions with the SM particles in these scenarios are also suppressed by their small mixing \(U_{\ell N}\) with the active neutrinos \(\nu _{\ell L}\), strongly constrained by experiments [8], and thus would have undetectably weak interactions.

However, a variety of new physics may be hidden at energies well above the EW scale: its possible effects on the SM degrees of freedom are systematically studied with the use of the SM effective field theory (SMEFT). This new physics can also impact the behavior of the heavy neutrinos if they are light enough to be included in the low-energy spectrum and, if present, would probably dominate over the interactions due to their mixing with the active neutrino states. Thus the HNLs interactions with the SM particles can be seen as the remnant of a new UV physics, described by an effective field theory including them also as part of its building blocks. This is the standard model effective field theory framework extended with right-handed neutrinos \(\nu \)SMEFT, with operators known up to mass dimension \(d=9\) [9,10,11,12,13,14,15,16].Footnote 1

This EFT including the \(N_R\) as accessible states has received increasing attention since it offers an efficient tool to parameterize the effects of UV physics and learn from prospective studies how to discover the HNLs by their new interactions, or to constrain the Wilson coefficients of the distinct operators consistent with the SM symmetry at a given mass dimension d and energy scale \(\Lambda \). Since its first introduction by the authors of [12] as an explicit EFT to study neutrino interactions, much progress has been made in the \(\nu \)SMEFT framework both from the phenomenology and from the theory side [17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56].

Many of these recent efforts have pointed to the phenomenology of HNLs within the \(\nu \)SMEFT framework at the LHC and future lepton colliders. Here we focus on the study of sensitivity projections for the HNL N with effective interactions at the LHeC, an \(e^{-}p\) collider to be built at the LHC tunnel [57,58,59]. For this study we consider a simplified scenario with only one heavy neutrino, neglecting its mixing with the active states and tackling an encompassable parameter space involving the HNL mass and its effective couplings. We consider the N decay to muons and jets, usually called the lepton-trijet final state: \(p ~ e^{-} \rightarrow \mu ^{\pm } + 3 \textrm{j}\). The process with final muons (\(p ~ e^{-} \rightarrow \mu ^{-} + 3 \textrm{j}\)) conserves lepton number but violates flavor (LFV), while the process with final anti-muons (\(p ~ e^{-} \rightarrow \mu ^{+} + 3 \textrm{j}\)) also violates lepton number by two units (LNV).

Previous studies of the lepton-trijet signal given by the type I seesaw mixing interactions at the LHeC can be found in [60,61,62], originally tackled in [63,64,65,66,67]. In Ref. [18] our group studied for the first time the potential of the LHeC to discover Majorana neutrinos in the \(\nu \)SMEFT context, and in Ref. [24] we explored the possibility to disentangle the contributions of effective operators with different Dirac-Lorentz structure to the LNV lepton-trijet process with the aid of angular distributions and polarization effects. Here we improve those results with up-to-date and realistic simulation and analysis at the reconstructed level. A recent prospective study for the Electron-Ion Collider (EIC) at Brookhaven can be found in [68]. Also a diversity of related LHeC sensitivity studies can be found in the recent literature: a search for HNLs with additional leptoquark interactions giving displaced fat-jets can be found in [69], prospects for charged leptons flavor violating signals are given in [70], as well as fat-jet searches of very heavy massive neutrinos in [71].

The paper is organized as follows: in Sect. 2 we describe the \(\nu \)SMEFT formalism and overview the phenomenology associated with the different effective interactions. In Sect. 2.1 we inspect the current existing bounds on the effective couplings to be considered for the benchmark scenarios probed in this study. In Sect. 3 we discuss our search strategy for the LHeC, characterizing the N production mechanism kinematics in ep colliders (3.1). We also revisit the N decay channels branching ratios and total width (3.2), discuss the SM processes considered as background at the reconstructed level, and discuss the multivariate analysis performed to separate the signals. The sensitivity prospects for the heavy N in \(\nu \)SMEFT at the LHeC are shown in Fig. 8. We close with a summary in Sect. 4.

2 Effective interactions formalism

We consider the SM Lagrangian to be extended with only one right-handed neutrino \(N_R\) with a Majorana mass term (\(\sim M_N\)).Footnote 2 The renormalizable \(d=4\) Lagrangian extension reads

(1)

After diagonalization, one gets a massive state N as an observable degree of freedom, together with the three known light neutrino states (with masses \(m_{\nu }\sim 0.1\) eV), which are all of Majorana nature. The flavor neutrino eigenstates contain some part of the heavy N due to the mixing \(U_{\ell N} = Y_\ell ~v/\sqrt{2} M_N\):

$$\begin{aligned} \nu _{\ell L}=\sum _{i=1}^{3}U_{\ell i}\nu _i+U_{\ell N}N. \end{aligned}$$

In turn, the heavy state N is mostly composed of the right-handed state \(N \simeq N_R\) with negligible mixing with the active \(\ell \) flavor states \(\nu _{\ell L}\), constrained by the naive seesaw relation \(U_{\ell N}\lesssim \sqrt{ \frac{m_{\nu }}{M_{N}}}\), and thus with negligible interaction through the SM electroweak currents when its mass is above the GeV scale.

Table 1 Basis of \(d=5\) and \(d=6\) operators with a right-handed neutrino N [12, 14]. Here \(l_i\), \(u _i\), \(d _i\) and \(L_i\), \(Q _i\) denote the right-handed singlets and the left-handed SU(2) doublets, respectively. The field \(\phi \) is the scalar doublet, \(B_{\mu \nu }\) and \(W_{\mu \nu }^I\) are the \(U(1)_{Y}\) and \(SU(2)_{L}\) field strengths. Also \(\sigma ^{\mu \nu }=\frac{i}{2}[\gamma ^{\mu }, \gamma ^{\nu }]\) and \(\epsilon =i\sigma ^{2}\) is the anti symmetric symbol in two dimensions. We follow the notation in [12] and quote the names in [53]. See Appendix A for the explicit Lagrangian terms

In our simplified setup, we will not include the renormalizable Lagrangian in (1), and thus neglect the mixings \(U_{\ell N}\). In this way, the new physics effects on the heavy state N (possibly due to the presence of new mediators in the UV scale \(\Lambda \)) are parameterized by a set of effective operators \(\mathcal {O}_\mathcal {J}\) constructed with the SM and the \(N_R\) fields and satisfying the \(SU(2)_L \otimes U(1)_Y\) gauge symmetry [12, 14, 72]. The total Lagrangian we consider is organized as follows:

$$\begin{aligned} \mathcal {L}=\mathcal {L}_{SM}+\sum _{d=5}^{\infty }\frac{1}{\Lambda ^{d-4}}\sum _{\mathcal {J}} \alpha _{\mathcal {J}} \mathcal {O}_{\mathcal {J}}^{d} \end{aligned}$$
(2)

where d is the mass dimension of the operator \(\mathcal {O}_{\mathcal {J}}^{d}\), \(\alpha _{\mathcal {J}}\) are the effective (Wilson) couplings and the sum in \(\mathcal {J}\) goes over all independent interactions at a given dimension d.

There are only three dimension 5 effective operators in (2). The well-known Weinberg operator \(\mathcal {O}_{W}=(\bar{L}\tilde{\phi })(\phi ^{\dagger }L^{c})\) [73] involving only left-handed neutrino fields, which violates lepton number and contributes to the light neutrino masses. The Anisimov–Graesser operator \(\mathcal {O}_{N\phi }=(\bar{N_R}N^{c}_{R})(\phi ^{\dagger } \phi )\) [9, 10] which also contributes to the Majorana mass term for \(N_R\) (when it is considered, in the renormalizable \(d=4\) Lagrangian in (1)) and gives Higgs-N-N interactions. And finally the dipole operator \(\mathcal {O}^{(5)}_{NB}= (\bar{N}_{R}\sigma _{\mu \nu }N^{c}_{R}) B^{\mu \nu }\) inducing magnetic moments for the heavy neutrinos, which is identically zero if we include just one sterile neutrino \(N_{R}\) in the theory, and was first studied in Ref. [13]. Their phenomenology both regarding Higgs and heavy N physics in colliders together and in comparison to the simple seesaw mixing interactions has been studied in [10, 23, 30, 31, 38, 46]. We refer the reader to [46] for a detailed description of the interplay between the type I seesaw Lagrangian and the dimension 5 operators in the generation of light and heavy neutrino masses and the heavy neutrino decays, in the case where three right-handed neutrinos are added to the SM field content. Here we will not consider the \(d=5\) operators, as in the simplified scenario with only one right-handed neutrino state added we will focus on, they make no contributions to the studied processes at electron-proton colliders when discarding the heavy-active neutrino mixings \(U_{\ell N}\), nor to the N decay. Thus we will only consider the contributions of the \(d=6\) operators, following the treatment presented in [12, 14], and shown in Table 1. Our implementation of the effective Lagrangian in FeynRules 2.3 has been discussed in [48]. Full expressions for each explicit Lagrangian term can be found in Appendix A, and we briefly discuss them in the following.

The Higgs dressed mixing operator \(\mathcal {O}_{LN\phi }\) is a Yukawa interaction for the N dressed with a Higgs doublet pair, and thus contributes with extra terms at tree-level to light neutrino masses \(m_{\nu }\) and mixings [51]. It also leads to new \(N \nu \)-Higgs interactions, as read from Eq. (A1). The neutral (NC) and charged (CC) bosonic currents \(\mathcal {O}_{NN\phi }, \mathcal {O}_{N l \phi }\) are written in Eqs. (A2) and (A3). The charged current couples the W boson with right-handed chiral leptons: a \((V+A)\) structure as opposed to the \((V-A)\) SM charged weak interaction.

The explicit expressions of the \(d=6\) dipole momenta operators \(\mathcal {O}_{NB}, \mathcal {O}_{NW}\) are written in Eqs. (A4) and (A5). They induce dipole interactions of the N with the \(Z, \gamma \) and W bosons (\(d_Z, d_{\gamma }\) and \(d_W\)) [13, 47, 53]. These operators are generated at one-loop level in the complete UV theory [12, 74], and thus their contributions are suppressed by a loop factor \(1/ (16 \pi ^2)\), which is taken into account in our numerical calculations.

We classify the four-fermion interactions in terms of possible UV mediators connecting the fermion lines in the Lagrangian terms given by each operator.Footnote 3 There are three types of vector-mediated neutral currents involving two N fields, one vector-mediated charged current and, as shown in Eqs. (A10) to (A13), the last four-fermion operators induce Lagrangian terms that can be obtained from an UV completion where the fermion lines are mediated by neutral or charged scalars.

2.1 Constraints on effective couplings

In the last years many works on right-handed effective neutrino interactions (\(\nu \)SMEFT) including \(d=6\) operators have derived bounds for the different effective couplings values \(\alpha _{\mathcal {J}}\), or alternatively on the new physics scale \(\Lambda \), given by existing experimental direct or indirect searches of BSM phenomena. Most of these constraints are applicable for \(m_N\) masses below the range we consider in this work: \(m_N = 100- 500\) GeV, but it is worth making some comments for a review and comparison of the different approaches.

A very recent and illustrative review of constraints on the dim=6 \(\nu \)SMEFT operator coefficients can be found in [53]. The authors neglect the seesaw mixings between the active and heavy N neutrinos, in the same fashion we do here, and obtain bounds considering each effective operator acting separately one at a time, thus avoiding cancellations. This approach generally leads to conservative constraints, as it does not allow for N production and detection through different operators. They show plots in the \(m_N- C/\Lambda ^2\) plane for each operator coupling and masses \(m_N< 100\) GeV.Footnote 4

When more than one operator acting at a time is considered, bounds can be obtained from a variety of processes involving combinations of couplings (typically with different production and decay channels for the N). We refer the reader to Ref. [51] for some estimates obtained from collider searches: we must also be careful when considering their bounds values, as the authors do not neglect the mixings between heavy and active neutrinos in their calculations, which leads to a somewhat different phenomenology.

We discuss in the following the bounds that could be applicable for N masses in the electroweak scale (\(\simeq v=246\) GeV): \(10^2< m_N < 10^3\) GeV and give the values we take for our numerical calculations as estimates compatible with our simplified scenario. A detailed calculation of the bounds that could be considered for every effective operator from the plethora of existing and future experiments is beyond the scope of the present work.

a. Standard vector boson operators: Higgs, neutral and charged currents. The operator \(\mathcal {O}_{LN\phi }\) (A1) induces extra contributions at tree-level to light neutrino masses and their mixing with the N, and also an invisible Higgs decay channel into a light and a heavy neutrino. The bounds from \(h \rightarrow \nu N\), valid for \(m_N<m_h\), translate to \(\alpha ^{(i)}_{LN\phi } \lesssim 0.3\) for \(\Lambda = 1\) TeV, for every flavor i when the operator is the only one active [53]. Bounds on the coefficient of the neutral bosonic current operator \(\mathcal {O}_{NN\phi }\) (A2) can be obtained from invisible Z decays and mono-photon searches at LEP, but these apply only for \(m_N< m_Z\) [51]. Also, the tensorial current (one-loop level generated) operators \(\mathcal {O}_{NB}, \mathcal {O}_{NW}\) (A4), (A5) can be constrained exploiting the known bounds on the N dipole couplings to the bosons WZ and \( \gamma \) [75] obtained at LEP and the LHC; these give bounds above the values we consider (and obtain) here. In the case of the electron flavor, the coupling \(\alpha ^{(i=1)}_{NW}\) is bounded by its contribution to the unobserved neutrinoless double beta decay (\(0\nu \beta \beta \)-decay), as will be discussed below.

We now turn to discuss the bounds that can constrain the coefficient of the operator \(\mathcal {O}_{N l \phi }\) which contributes to the charged bosonic current in Eq. (A3)

$$\begin{aligned} \frac{\alpha _{N l \phi }}{\Lambda ^2} \frac{g v^2}{2 \sqrt{2}} \; ({\overline{N}}_R \gamma _{\mu } l_{R})\; W^{+ \mu } \end{aligned}$$
(3)

with a similar structure to the CC of the SM, for each charged lepton flavor \(l_i= e, \mu , \tau \) [12].

The most restrictive bound on the \(\mathcal {O}_{N l \phi }\) operator coupling \(\alpha ^{(1)}_{N l \phi }\) for the electron flavor comes from the non-observation of neutrinoless double beta decay (\(0\nu \beta \beta \)-decay). The stringent limit on the lifetime \(\tau _{{0\nu }_{\beta \beta }} \ge 1.1 \times 10^{26}\) years obtained by the KamLAND-Zen Collaboration [76] gives us a mass-dependent bound \(\alpha _{0\nu \beta \beta }(m_N) \le 3.2\times 10^{-2}\left( \frac{m_N}{100 ~ ~\text {GeV}} \right) ^{1/2}\) for \(\Lambda =1\) TeV on the coupling of every effective operator contributing to the vertex udNe (the details of the derivation can be followed from Refs. [18, 20], see also [54]).Footnote 5 We will thus fix the coupling \(\alpha ^{(1)}_{N l \phi }(m_N) = \alpha _{0\nu \beta \beta }(m_N)\) throughout all the numerical calculations in this work.Footnote 6

In the case of the muon and tau families, limits on this coupling can be obtained from the well-known existing bounds on the seesaw mixings \(U_{\ell N}\). One can consider the relation

$$\begin{aligned} U_{\ell N}\simeq \frac{ v^2}{2}\frac{\alpha ^{(i=\ell )}_{N l \phi }}{\Lambda ^2} \end{aligned}$$
(4)

to derive bounds for the bosonic charged current effective coupling from the mixings with flavor \(\ell \equiv l_i\).

The most restrictive bounds for \(m_N\) above the electroweak scale for the muon flavor come from the charged Lepton Flavor Violating (cLFV) processes induced by the quantum effects of the heavy neutrinos. The most stringent one can be derived from the MEG limit on the muon radiative decay branching fraction \(Br(\mu ^{+}\rightarrow e^{+}\gamma )\le 4.2 \; 10^{-13}\) [80]. Following the treatment in Ref. [81], we can translate their bounds on the product of mixings asFootnote 7

$$\begin{aligned} |\eta _{e \mu }|= \frac{1}{2} U_{e N} U_{\mu N}= \frac{1}{2} \left( \frac{v^2}{2} \frac{\alpha ^{(1)}_{N l \phi }}{\Lambda ^2}\right) \left( \frac{v^2}{2} \frac{\alpha ^{(2)}_{N l \phi }}{\Lambda ^2}\right) . \end{aligned}$$

Given that we already consider \(\alpha ^{(1)}_{N l \phi }= \alpha _{0\nu \beta \beta }(m_N)\), this tiny value for the electron family coupling almost saturates the bound, leaving us with a possible limit on the muon flavor coupling which ranges from \(\alpha ^{(2)}_{N l \phi }(m_N=100 ~ \text {GeV})\le 2.39\) to \(\alpha ^{(2)}_{N l \phi }(m_N=500 ~ \text {GeV})\le 0.36\) for \(\Lambda =1 ~ \text {TeV}\), which are above the considered values for the effective couplings in the current numerical analysis.

For the tau flavor, the possible bounds that can be considered from \(\tau \rightarrow e \gamma \) decays are even looser. Taking the limit from the most stringent scenario in [81] for \(|\eta _{e \tau }|\lesssim 10^{-5}\) and taking again \(\alpha ^{(1)}_{N l \phi }= \alpha _{0\nu \beta \beta }(m_N)\), we obtain a bound ranging from \(\alpha ^{(3)}_{N l \phi }(m_N=100 ~\text {GeV}) \lesssim 0.68\) to \(\alpha ^{(3)}_{N l \phi }(m_N=500 ~\text {GeV}) \lesssim 0.30\) for \(\Lambda =1 ~ \text {TeV}\).

The exercise of translating the bounds on the type I seesaw mixings to the effective coupling \(\alpha ^{(i=\ell )}_{N l \phi }\) using the relation in (4) can also be done with the recent results from the LHC experiments. For the masses considered in this work, the bounds on \(|U_{\mu N}|^2\) from Ref. [82] on the same-sign dimuons signal are the strongest and give us couplings ranging roughly from \(\alpha ^{(2)}_{N l \phi }(m_N=100 ~ \text {GeV})\lesssim 2.1\) to \(\alpha ^{(2)}_{N l \phi }(m_N=1 ~ \text {TeV}) \lesssim 14.8\) for \(\Lambda =1 ~ \text {TeV}\), which again are above the values considered in this work.

b. Four-fermion operators: charged and neutral currents. The existing constraints on the mixings \(U_{\ell N}\) obtained from N decay-in-flight and peak searches in meson decays can be used as well [53] to constrain the four-fermion charged current operators \(\mathcal {O}_{duNl},\mathcal {O}_{QuNL}, \mathcal {O}_{LNQd}, \mathcal {O}_{QNLd}\) in Table 1 with a method similar to the one used in [52] to rescale the existing bounds on each \(U_{\ell N}\), given that the new operators may only generate a subset of the processes (allowed by the SM and the seesaw mixings) considered for each experiment.

The four-fermion charged currents involving quarks could also induce mono-lepton processes in colliders (\(pp \rightarrow N \ell \)) in which the final state consists of a single observed lepton and missing energy. The bounds from LHC mono-lepton searches recast in Ref. [29] lead to flat bounds \(\alpha ^{(1)} < 0.1\) for the electron flavor operators. These bounds rely on the assumption that the N are long-lived or decay invisibly in the detectors, which is not the case if one considers every effective interaction for the N decay width (see Sect. 3.2), so we do not take them directly into account for this study, since we consider the harder bounds coming from \(0\nu \beta \beta -\)decay. As explained above, the bound \(\alpha _{0\nu \beta \beta }(m_N)=3.2\times 10^{-2}\left( \frac{m_N}{100 ~ ~\text {GeV}} \right) ^{1/2}\) for \(\Lambda =1\) TeV also applies to the couplings \(\alpha ^{(1, 1)}_{duNl}, ~\alpha ^{(1, 1)}_{QuNL}, ~\alpha ^{(1, 1)}_{LNQd}, ~\alpha ^{(1, 1)}_{QNLd}\).

The four-leptons interactions from \(\mathcal {O}_{eNN}, \mathcal {O}_{LNLl}\) are bounded considering mono-photon searches at LEP, with a single photon recoiling against invisible particles (\(e^+ e^- \rightarrow N N \gamma \)). For lower \(m_N\), these operators can be bounded using tau and muon decays [29, 53].

The neutral current vectorial four-fermion operators involving two N fields and light quarks \(\mathcal {O}_{fNN}, f = u, d\) and \(\mathcal {O}_{QNN}\) can induce quark scatterings in which the only visible signal is a single jet, produced mainly by a gluon emitted by any of the quarks, when the heavy N escape undetected and give a missing energy signal. In Ref. [29] these bounds are obtained by recasting LHC mono-jet searches, and would give us flat bounds \(\alpha < 0.4\) for \(\Lambda =1\) TeV. Two comments are in order: as before, these bounds rely on the N being long-lived or escaping undetected. Also, these operators are not involved in the N decay or the processes considered for the LHeC phenomenology, so we do not take them into account in this study.

In our numerical setup, every \(d=6\) operator in Table 1 for every flavor is turned on at the same time. We follow this democratic approach, since it leads to more realistic results, given that in most cases specific BSM UV models will generate not only one operator, but contribute to several operators when the correct matching for the model is calculated, due to operator mixing [32, 53, 54].

We set the values of the effective couplings of the operators contributing to \(0\nu \beta \beta -\)decay for the first family to the value \(\alpha _{0\nu \beta \beta }(m_N)=3.2\times 10^{-2}\left( \frac{m_N}{100 ~ ~\text {GeV}} \right) ^{1/2}\). These operators contribute to the total decay width \(\Gamma _N\), and to the N production vertex in the considered processes. As we aim to obtain sensitivity estimates for heavy Majorana neutrinos N in possible searches at the future LHeC, we do not impose constraints on the other effective operators. We set the other effective couplings to the same value \(\alpha \) but only consider benchmark points with \(\alpha \le 0.3\) for masses in the range \(m_N = 100- 500\) GeV to be conservative. The loop factor in the couplings of the \(\mathcal {O}_{NB}\) and \(\mathcal {O}_{NW}\) operators is considered in the numerical calculations: we fix \(\alpha _{NB}=\alpha _{NW}=\alpha \), but include the loop factor in the interaction vertices, see (A4) and (A5).

3 Collider analysis

In this study we analyze the sensitivity projections for the HNL N with effective interactions at the LHeC, considering its decay to muons and jets. The process with final muons (\(p ~ e^{-} \rightarrow \mu ^{-} + 3 \textrm{j}\)) conserves lepton number but violates flavor (LFV), while the process with final anti-muons (\(p ~ e^{-} \rightarrow \mu ^{+} + 3 \textrm{j}\)) also violates lepton number by two units (LNV). We will analyze the two channels separately.

The LHeC is proposed to be an \(e^{-}p\) collider built at the LHC tunnel, using an electron beam with 60 GeV energy in the the existing 7 TeV proton beam, giving a center-of mass energy close to 1.3 TeV. It is expected to achieve an integrated luminosity \(\mathcal {L}=100 ~{\textrm{fb}}^{-1}\) per operation year, and \(1~ {\textrm{ab}}^{-1}\) in total [57,58,59]. Here we consider an integrated luminosity of \(100 ~ {\textrm{fb}}^{-1}\) to calculate our physical numbers of events.

The numerical tools used for the calculations are the following. We use our implementation of the \(d=6\) Lagrangian introduced in [48] in the FeynRules 2.3 software [83] and generate UFO files [84] as output. The cross sections for the processes \(p ~ e^{-} \rightarrow \mu ^{\pm } + 3 \textrm{j}\) are calculated using MadGraph5_aMC@NLO 3.4.1 [85, 86] generating LHE events at parton level. These are read by the embedded version of PYTHIA 6 [87], which is suited to handle proton-electron collisions, unlike the latest versions, as done in [61, 62]. Then the events are interphased to Delphes 3.5.0 [88] with the default delphes-card-LHeC card for a fast detector simulation. Jets are defined in Delphes using FastJet [89], with the anti\(-k_T\) algorithm.

The analysis of the generated events at the reconstructed level is made with the expert mode in MadAnalysis5 1.8.58 [90], and the multivariate statistical analysis with the Root TMVA package [91].

For concreteness, we simplify the parameter space setting all the \(d=6\) effective couplings \(\alpha _{\mathcal {J}}\) in Eq. (2) and Table 1 to the same numerical value \(\alpha \), and explore a grid of signal benchmark scenarios (see Table 3).Footnote 8 The total decay widths of the heavy Majorana neutrino N are given to MadGraph from our analytical calculations updated from [20] for each \(m_N\) and \(\alpha \) benchmark point (see Sect. 3.2).Footnote 9 We also fix the new physics scale \(\Lambda =1\) TeV and show our results for different N mass values \(m_N = 100, 125, 150, 200, 300, 500\) GeV. In our numerical simulations we have considered hard scattering energies with \(\sqrt{\hat{s}}< \Lambda \) in order to ensure the validity of the effective Lagrangian approach, by imposing the total transverse energy (TET) deposit to be below the new physics scale \(\Lambda \) in every event.

3.1 Signals characterization

The LNV and LFV processes we want to study share the same production mechanism at electron-proton colliders, namely \(p ~ e^{-} \rightarrow N \textrm{j}\) with the subsequent decays \(N \rightarrow \mu ^{\pm } \textrm{jj}\), pictured in Fig. 1. The operators that explicitly contribute to the production mode considered \(q e^{-} \rightarrow N \textrm{j}\) are the same as the ones that contribute to the \(N\rightarrow \mu ^{\pm } \textrm{jj}\) decay (vertices I and II in Fig. 1, respectively). These are the vectorial \(\mathcal {O}_{Nl\phi }\), which provides an NlW vertex which combined with the SM \(W q q'\) contributes to the production and decay of the heavy N, and the four-fermion operators with a single N and two quarks: the vectorial \(\mathcal {O}_{duNl}\) and the scalar \(\mathcal {O}_{QuNl}\), \(\mathcal {O}_{LNQd}\) and \(\mathcal {O}_{QNLd}\). The explicit analytical expressions for the cross-section \(\sigma (p ~ e^{-} \rightarrow N \textrm{j} \rightarrow \mu ^{-} \textrm{jj})\) can be found in [24].

To characterize the kinematics of the production of the heavy N in the LHeC we generate \(10^5\) \(p ~ e^- \rightarrow N \textrm{j}\) parton-level events in MadGraph5_aMC@NLO, for the chosen \(m_N\) benchmark points, with effective couplings fixed to \(\alpha =0.2\). In Fig. 2 we show the Lab-frame physical events distributions of the produced N boost velocity \(\beta _N\), boost factor \(\gamma _N\), energies \(E_N\), and the distances between the N and the beam-jet \(\Delta R (N,\textrm{j})\), as well as the pseudo-rapidities \(\eta \) of the heavy N and the jet (notice the positive \(\hat{z}\) axis points in the incident proton direction, corresponding to \(\theta =0\)).

Fig. 1
figure 1

Majorana neutrino N production and semi-leptonic decay at an eP collider

The large asymmetry in the beam energies at electron-proton colliders tends to boost the final particles in the proton beam direction, identified with large positive pseudo-rapidity \(\eta \) values, affecting the angular correlations in the Lab-frame. The center of mass energy at the LHeC collider is projected to be \(\sqrt{S}= (4 E_p E_e)^{1/2} \sim 1.3\) TeV. This energy allows to produce boosted heavy neutrinos in the Lab-frame, with threshold energies corresponding to the production of the N at rest in the CM-frame. The boost velocity of the threshold CM-frame in the Lab can be obtained as a function of the N mass \(m_N\) and the electron beam energy as

$$\begin{aligned} \beta _{Th}= \frac{m_N^2-4 E_e^2}{m_N^2 +4 E_e^2}, \end{aligned}$$

corresponding to a minimal value of the Bjorken variable \(x_{{Th}}= m_N^2/ 4 E_p E_e\). For higher x values, with \(m_N^2/ 4 E_p E_e< x < 1\), the N and the jet are produced back-to-back in the CM-frame.

Fig. 2
figure 2

Kinematic characterization of \(p ~ e^- \rightarrow N \textrm{j}\) parton-level events at the LHeC

The Lab-frame decay length of the N is given by \(d_N^{~Lab}= \beta \gamma c / \Gamma _N\), with \(\gamma = E_N/m_N\). Given the values of the N decay widths (see Fig. 3), we find all our benchmark signal scenarios produce prompt N decays, with typical mean decay lengths ranging from \(d \sim 10^{-14}\) m for \(m_N=500\) GeV to \(d \sim 10^{-11}\) m for \(m_N=100\) GeV.

The \(m_N=100\) GeV sample has some events with negative \(\eta _N\) pseudo-rapidity values, as the boost velocity of the threshold CM-frame is negative. Thus in some events the heavy N is going backwards in the Lab, and also, in some events the beam jet is turned backwards.

As the N has opposite azimuthal angle to the beam jet, and can be very boosted in the Lab-frame, we expect to probe the LHeC ability to disentangle this signal with a strategy focusing on the separation of the N decay products from the beam jet. When simulating the full production and decay processes, together with the detector simulation and considering possible backgrounds, we find a simple cut-and-count strategy does not lead to statistically significant signal to background separation, suggesting a deeper analysis. This can be understood by considering that while for higher \(m_N\) values the kinematic distributions of the signal tend to show a better separation from the backgrounds, their cross-sections diminish, making difficult to separate their respective contributions (see Fig. 4).

3.2 N decay

We update here the calculation of the N total decay width and branching ratios from our works in [19, 20]. In the \(m_N\) mass range we are considering, the relevant decay channels of the heavy Majorana neutrino are those shown in Fig. 3 (left), plotted here considering the contribution of every effective operator in Table 1 contributing to the N decay, with all couplings fixed to the same value \(\alpha =0.2\) except for the ones contributing to the \(0\nu \beta \beta \)-decay as explained in Sect. 2.1.

Fig. 3
figure 3

Branching ratios of relevant channels (left) and total N decay width (right) considering all the effective operators in Table 1, \(\Lambda =1\) TeV and couplings \(\alpha =0.2\) (see text)

The most relevant channels for \(m_N< m_W\) are those induced by four-fermion operators, which include the decays to charged (anti-)leptons and up and down type quarks \(N \rightarrow lud \), as well as the decays to light neutrinos plus quark or lepton pairs.Footnote 10 For even lower mass \(m_N<30\) GeV, the decay \(N \rightarrow \nu \gamma \) dominates the width. While the channels to W bosons start to dominate at higher mass values, the decays to Z bosons are always sub-dominant (below \(10^{-5}\) for the \(\alpha < 0.3\) \(\Lambda =1\) TeV considered), and we do not show them here. The total N decay width calculated with the full expressions updated from [19, 20] is compared in Fig. 3 (right) with an \(m_N^5\) dependence.

In our numerical simulations throughout this work we give the value of the total N decay-width calculated considering all the effective operators couplings equal to the same value \(\alpha \) for each mass \(\Gamma _{N}(m_N, \alpha )\) as input for the Monte Carlo (MC) events generation in MadGraph for every signal benchmark point in Table 3.Footnote 11

3.3 Backgrounds

Although both studied signals are forbidden by flavor and lepton number conservation in the SM, and thus the signals have no SM irreducible backgrounds, one has to consider SM backgrounds which involve possible lepton charge misidentification, and final states with extra unobserved light neutrinos. We take into account the backgrounds in Table 2, for the analysis of both signal channels, following the discussions in Refs. [61] and [62].

Backgrounds (B1), (B2), and (B5) in Table 2 arise from di-vector boson production together with a jet and and electron, when the vector bosons (\(V=W\) or \(V=Z\)) decay into a pair of jets and a di-muon pair or (anti-)muon and a light neutrino. If the electron is soft, these processes can be confused with signals with extra radiated soft electrons, which cannot be rejected without decreasing the signal efficiency.

The other important backgrounds come from di-vector boson production with a jet and a light neutrino, as in backgrounds labeled (B3), (B4) and (B6) in Table 2. Here, when one of the vectors is a W decaying leptonically, the final state only differs from the signals in additional light neutrinos which escape undetected.

Table 2 Background processes considered for \(p ~ e^{-} \rightarrow \mu ^{\pm } + 3 \textrm{j}\)

In fact, these last kind of processes could also be faked by the N effective interactions, producing heavy neutrinos N instead of light neutrinos, as in \(p ~ e^{-} \rightarrow \textrm{j} N (V V) \rightarrow \textrm{j} N (\textrm{jj} \mu ^{+} {\nu _{\mu }})\) that could, in principle, escape undetected or also be virtual, contributing to the background amplitudes in diagrams as an internal line. We have checked that both contributions are negligible. In the first case, for the heavy \(m_N\) mass values we are considering the N must decay promptly in the detector volume and should be noticed, changing the final state. Second, the diagrams with virtual N include two effective vertices (both with scalar and vector operators) which are suppressed by the neutrinoless double beta decay bound in the case of initial quarks of the first family. The second family initial quarks contribution is also suppressed by their PDF as sea-quarks inside the proton. Both effects force the effective contributions to the backgrounds to be negligible.

One could in principle consider also backgrounds arising from single W production with radiated jets, but we have checked they can be reduced, given that the events with radiated jets can be clearly distinguished from the signals and the backgrounds considered, as the radiated jets have very low \(p_T\). The background arising from single Z production decaying to taus, and then muons, which can give a \(\textrm{jj} \mu ^{\pm }\) plus missing energy final state, has a much smaller cross section, and can be eliminated because these soft muons can be distinguished from the signal. Three vector boson production is not considered, due to its much smaller cross section.

3.4 Pre-selection cuts and generated datasets

We generate \(10^{5}\) MC events for each LFV and LNV signal and the backgrounds in Table 2 in MadGraph. We adopt the following basic acceptance cuts on the generated final leptons \(\ell \) and jets \(\textrm{j}\): \(p_{T}(\ell )>2\) GeV, \(p_{T}(\textrm{j})> 5\) GeV, \(|\eta _{(\ell , \textrm{j})}| \le 4.5\), and no cuts on the possible final photons. We keep the default isolation criteria between any jets and leptons (\(\Delta R_{\textrm{jj}, \ell \ell , \ell \textrm{j}} > 0.4\)).

In Fig. 4 we show the parton-level cross section values we obtain for both the LFV and LNV processes, generating signal events datasets with different values of the masses \(m_N\) and effective couplings \(\alpha \), together with the different background values. For \(m_N\) near 120 GeV, we find an enhancement of the signal cross section, partly due to the contribution of the \(\mathcal {O}^{(i)}_{Nl\phi }\) operator to the N decay when the W is on-shell, which opens when \(m_N> m_W\) and becomes a subleading decay channel when its decay to a Higgs boson and a light neutrino \(N \rightarrow h \nu \) is open for \(m_N> m_h\), as can be seen in Fig. 3.

Fig. 4
figure 4

Parton-level cross sections for the LFV (left) and LNV (right) processes, and the six backgrounds considered (see text and Table 2)

After parton shower and hadronization in Pythia and fast detector simulation with Delphes, we require (for signal reconstruction) a cut on the total transverse energy variable TET\(<900\) GeV to insure the validity of the effective Lagrangian approach (with \(\Lambda =1\) TeV). Also, we ask for only one final muon (or anti-muon), depending on the signal, and at least 3 jets, without cuts on their transverse momenta \(p_T(\textrm{j})\). The MC background sample after this pre-selection cuts has \(3.7\times 10^5\) events for the LFV and \(3.6\times 10^5\) for the LNV channels. The pre-selection cuts preserve more than 90% of the generated events in every signal benchmark sample, for both channels.

These are the pure-signals and pure-backgrounds datasets used to train the multivariate analysis (similarly generated but independent samples are mixed and used as input for the classification application).

3.5 Multivariate analysis

We use many high-level observables obtained from the information of the final reconstructed objects as input for the TMVA analysis to classify signal and background events, using a boosted decision tree (BDT) algorithm.Footnote 12 The BDT was trained with samples of \(3\times 10^4\) MC signal and background events for each benchmark point, randomly chosen from the samples passing the pre-selection cuts described above. The rest of the events in each sample are used for testing. Then the method is applied to classify independently generated samples containing a mixture of \(10^5\) MC events for every signal benchmark and each background process in Table 2.

Fig. 5
figure 5

TMVA input variables distributions for the signal point \(m_N= 100\) GeV and \(\alpha =0.2\), and backgrounds

The discriminating power of the BDT relies on the fact that the signal and the background are characterized by different features that can be entangled. The most relevant distributions are shown in Fig. 5 for \(m_N=100\) GeV and Fig. 6 for \(m_N=500\) GeV, for the LNV channel. These are the transverse momenta and pseudo-rapidities of the final muon (or anti-muon) jets and the missing energy, and also the invariant mass of each jet, of jet pairs, and the invariant masses of pairs of jets and the muons \(M(\textrm{j}_{a}, \textrm{j}_{b}, \mu ^{\pm })\).

Fig. 6
figure 6

TMVA input variables distributions for the signal point \(m_N= 500\) GeV and \(\alpha =0.2\), and backgrounds

We expect the jet beam to be the one with higher \(p_T\), identified as \(\textrm{j}_1\) in our analysis, and thus, the invariant mass \(M(\textrm{j}_{2}, \textrm{j}_{3}, \mu ^{\pm })\) is expected to reconstruct the value of \(m_N\) for each signal benchmark point. This is the case for the lower \(m_N\) values, but as can be seen comparing Figs. 5 and 6, this criterion fails when the N is heavy (and energetic) enough to produce the hardest jet on its decay. As expected, the invariant masses of jet pairs in the background events peak mostly around the vector boson masses, but this also happens in the signal events when the bosonic charged current operator \(\mathcal {O}_{Nl\phi }\) in (A3) drives the N decay.

It can be seen from the plots in Figs. 5 and 6 that the distributions do not favor a cut-and-count strategy to discriminate the signals. Although for the higher mass benchmark sample (\(m_N=500\) GeV) the invariant mass distributions in the last row of Fig. 6 seem to allow for a significant separation, it must be kept in mind the short number of events obtained for this signal benchmark, given the low cross-section value (see also Fig. 4).

In order to obtain the optimized values for the BDT cut for each benchmark sample, we input the expected number of physical background and signal events after applying the pre-selection cuts. As an example, the BDT variable normalized distributions for the LNV (\(pe^{-}\rightarrow \mu ^{+} + 3 \textrm{j}\)) signal datasets with \(m_N=100\) GeV, \(m_N=500\) GeV and \(\alpha =0.2\) and the combined backgrounds are shown in Fig. 7, together with the optimal BDT cut efficiency curves. The plots in Figs. 5, 6 and 7 show the results for the LNV channel \(p e^{-} \rightarrow \mu ^{+} +3 \textrm{j}\). The efficiencies and distributions for the LFV case are similar and we do not show them here. The final number of physical events classified as signal and background for both the LFV and LNV processes are given in Table 3.

Fig. 7
figure 7

BDT normalized distributions and Cut efficiencies for signal and background samples for the LNV process \(p e^{-} \rightarrow \mu ^{+} +3 \textrm{j}\) at LHeC. Top: \(m_N=100\) GeV, \(\alpha = 0.2\). For \(N_s=146\) and \(N_b=212\), BDT cut \(\ge 0.0247\) gives significance \({N_s}/{\sqrt{N_s+N_b}}=11.8419\). Bottom: \(m_N=500\) GeV, \(\alpha = 0.2\). For \(N_{s}=16\) and \(N_b =212\), BDT cut \(\ge 0.1186\) gives significance \({N_s}/{\sqrt{N_s +N_b}}=3.8117\)

We use multivariate analysis to classify events as signal or background for the benchmark signal generated datasets. The numbers of physical events at the LHeC after applying both the pre-selection and BDT classification cuts are shown in Table 3, for an integrated luminosity of \(\mathcal {L}=100 ~ {\textrm{fb}}^{-1}\).

Table 3 Number of events of LFV signal (left) and LNV signal (right) and backgrounds after TMVA classification at the LHeC for an integrated luminosity \(\mathcal {L}=100 ~{\textrm{fb}}^{-1}\)

3.6 Results

The projected sensitivity limits on the effective couplings \(\alpha \) for the heavy Majorana neutrino masses \(m_N\) in the \(100{-}500\) GeV range obtained from both studied processes at the LHeC are shown in Fig. 8.

Fig. 8
figure 8

\(5\sigma \)-Discovery and 95% CL limits, for the muon-trijet (left) and anti-muon-trijet (right) channels at the LHeC (taking \(\Lambda =1\) TeV)

The 95% CLs exclusion limits in the \(m_N- \alpha \) plane are calculated following the PDG review on Statistics [92] and Appendix B in [75]. For each signal point in Table 3 we calculate the upper number of signal events \(s^{up}\) consistent at 95% CLs with the observation of the expected number of background events, by supposing that the data collected in the experiment exactly matches the integer part of the number of events for the background prediction. The shaded areas (lower mass, higher couplings) correspond to the parameter regions where the interpolated expected number of signal events exceeds the upper allowed value \(s^{up}\), and thus the limits are imposed directly on the parameter space values. As the number of events classified as background after the BDT cut changes from one signal benchmark point to another, we show the curves corresponding to the greatest (and lowest) upper number of signal events \(s^{up}\) for each channel. The region between this two curves is displayed in a lighter color in the plots in Fig. 8.

We also show the \(5\sigma \)- discovery contours, obtained using the well-known formula for the signal statistical significance [93]:

$$\begin{aligned} Z \sigma = \sqrt{2 \left[ (N_s+N_b) ~\text {ln} \left( 1+ \frac{N_s}{N_b}\right) -N_s\right] }=5 \sigma . \end{aligned}$$

The lower mass and higher coupling (\(m_N<150\) GeV, \(\alpha >0.15\)) regions in the parameter space could be separated with \(5\sigma \) significance from the expected backgrounds for both the LFV and LNV channels, meaning the LHeC would be able to discover this lepton-trijet signals after collecting \(100 ~{\textrm{fb}}^{-1}\) of data.

The obtained sensitivity prospects show that the LHeC could be able to probe the scenario of a heavy N with a mass near and above the electroweak scale, and constrain the effective couplings (mostly those of the muon family) to a region of the parameter space as tight as the bounds that are currently considered for the \(\mathcal {O}(10)\) GeV scale masses (see Refs. [29, 51, 53] for comparable \(\nu \)SMEFT phenomenological studies). Also, we find the results for discovery and exclusion regions are similar for both the muon-trijet (LFV) and the anti-muon-trijet (LNV) signals studied.

It is important to stress that the bounds in Fig. 8 are obtained for a realistic benchmark scenario configuration where we fix all the couplings of the \(d=6\) \(\nu \)SMEFT operators in Table 1 for every flavor to the same numerical value \(\alpha \), excepting those of the operators contributing to the unobserved neutrinoless double beta decay, corresponding to the first fermions family (\(i=1\)), which are set to the value of the corresponding bound \(\alpha _{0\nu \beta \beta }(m_N) = 3.2\times 10^{-2}\left( \frac{m_N}{100 ~ ~\text {GeV}} \right) ^{1/2}\) for \(\Lambda =1\) TeV. The loop factor in the couplings of the \(\mathcal {O}_{NB}\) and \(\mathcal {O}_{NW}\) dipole operators is included in the numerical calculations: we fix \(\alpha _{NB}=\alpha _{NW}=\alpha \), but take into account the loop factor in the interaction vertices, as presented in the Lagrangian terms (A4) and (A5).

Also, the \(\nu \)SMEFT approach considered here only includes one heavy Majorana neutrino N as observable degree of freedom, and neglects the renormalizable type I seesaw mixing terms it could have with the active neutrinos. In this sense, the obtained sensitivity prospects for the LHeC appear to be looser than those obtained in Refs. [61, 62], but they are just non-comparable.

The interested reader can find sensitivity prospects for the near-future experiments concerning the dimension-6 \(\nu \)SMEFT interactions in [42] for a long-lived N at the LHC exploiting possible displaced vertices searches, and in [49] for prompt and displaced N decays at future Higgs factories, in both cases for lighter N benchmark scenarios with \(m_N \lesssim 60 ~\text {GeV}\). Also, a variety of testable signals in planned experiments are discussed in [51]. We have explored the discovery potential for the heavy N at the ILC in [48]. However, we are leaving for future work a systematic comparison of the sensitivity reaches of near-future planned experiments.

4 Summary

In this paper we study the prospects of the future LHeC electron-proton collider to discover or constrain the \(\nu \)SMEFT interactions, performing the first dedicated and realistic analysis of the well-known lepton-trijet signals, both for the lepton flavor violating \(p ~ e^{-} \rightarrow \mu ^{-} + 3 \textrm{j}\) (LFV) and the lepton number violating \(p ~ e^{-} \rightarrow \mu ^{+} + 3 \textrm{j}\) (LNV) channels. Despite both processes being irreducible SM background-free, we take into account the possible backgrounds due to charge misidentification and final states with extra unobserved light neutrinos, performing a dedicated simulation and analysis at the reconstructed level.

The effective field theory extending the standard model with sterile right-handed neutrinos \(\nu \)SMEFT is an adequate tool for parameterizing new high-scale weakly coupled physics in a model independent manner, and allows for a systematic study of the HNLs phenomenology in current and future experiments. We consider heavy Majorana neutrinos coupled to ordinary matter by dimension 6 effective operators, focusing on a simplified scenario with only one right-handed neutrino added, and consider every operator contributing to the N production and decay channels.

A thorough signal kinematic characterization for the LHeC environment would suggest the use of the separation of the N decay products from the beam jet for a search strategy, but we find the detailed simulation of the complete N production and decay to the lepton-trijet final states cannot be significantly discriminated from the backgrounds on a cut-and-count approach at the reconstructed level. We thus focus on testing the performance of a multivariate analysis with a boosted decision tree (BDT) algorithm.

The obtained 95% CLs exclusion limits in the \(m_N- \alpha \) plane are presented in Fig. 8, showing that the LHeC could constrain the effective couplings (mostly those of the muon family) to a region of the parameter space as tight as the bounds that are currently considered for the sub-electroweak scale masses (see Refs. [29, 51, 53]).

Our results demonstrate that the LHeC is also an excellent facility for discovering heavy Majorana neutrinos around the electroweak scale, even in the limit of the \(\nu \)SMEFT where one discards their mixing with the active neutrino states. The LNV and LFV lepton-trijet signatures could be “golden channels” for HNLs searches. A discovery of heavy neutrinos would have far-reaching consequences, but also constraining the possible new physics involved in neutrino mass generation can be a path to resolving the origin of the observed neutrino masses, which is one of the most challenging open questions in particle physics.