1 Introduction

The standard model (SM) cannot explain the present matter–antimatter asymmetry in our universe [1,2,3]. A possibility to tackle this problem is to consider additional sources of baryon number violation, as predicted in several beyond the SM (BSM) contexts, such as Grand Unified Theories [4]. On the other side, no baryon number (B) violating (BNV) process has been observed so far, despite the numerous experimental searches on BNV decays of nucleons—which provide the most stringent constraints—hadrons, heavy quarks and leptons, and Z boson [5].

Without assuming any specific model, an effective field theory (EFT) approach can be adopted to describe BNV processes. The first attempt in this direction goes back to the late 1970s [6,7,8,9], followed by a few more recent studies [10,11,12]. All these analyses are performed in the so-called SM effective field theory (SMEFT) context, characterised by the construction of non-renormalisable operators, invariant under the SM gauge symmetries, and built up in terms of SM fermions, gauge bosons and the \(SU(2)_L\)-doublet scalar boson (“Higgs” for short) [13,14,15]. The cut-off of the theory suppressing these operators will be referred to as \(\Lambda _{B}\). At the lowest order in the expansion in \(1/\Lambda _{B}\), four BNV independent structures of canonical dimension \(d=6\) were identified [6,7,8,9],

$$\begin{aligned} \mathcal {O}_{1}&=\bar{d}^{C}_{R\alpha }u_{R\beta } \ \bar{Q}^{C}_{L\gamma i}L_{Lj} \ \epsilon _{ij}\ \epsilon _{\alpha \beta \gamma },\nonumber \\ \mathcal {O}_{2}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta } \ \bar{u}^{C}_{R\gamma }e_{R } \ \epsilon _{ij}\ \epsilon _{\alpha \beta \gamma },\nonumber \\ \mathcal {O}_{3}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta } \ \bar{Q}^{C}_{L\gamma k}L_{Ll} \ \epsilon _{il}\,\epsilon _{kj}\ \epsilon _{\alpha \beta \gamma },\nonumber \\ \mathcal {O}_{4}&=\bar{d}^{C}_{R\alpha }u_{R\beta } \ \bar{u}^{C}_{R \gamma }e_{R} \ \epsilon _{\alpha \beta \gamma },\nonumber \end{aligned}$$
(1)

where \(Q_L\equiv (u_L, d_L)^T\), \(u_R\), \(d_R\), \(L_L\equiv (\nu _L, e_L)^T\), and \(e_R\) are the SM fermions, \(\epsilon _{\alpha \beta \gamma }\) and \(\epsilon _{ij}\) are the antisymmetric tensors for the colour and electroweak (EW) contractions. If right-handed (RH) neutrinos, \(N_R\), are considered in addition, this set is extended by two operators:

$$\begin{aligned} \mathcal {O}_{5}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta } \ \bar{d}^{C}_{R\gamma }N_{R } \ \epsilon _{ij} \epsilon _{\alpha \beta \gamma },\nonumber \\ \mathcal {O}_{6}&=\bar{u}^{C}_{R\alpha }d_{R\beta } \ \bar{d}^{C}_{R\gamma }N_{R} \ \epsilon _{\alpha \beta \gamma }.\nonumber \end{aligned}$$
(2)

The operators listed in the previous equations refer only to one generation of fermions. Moving to the three generation case does not require the introduction of additional structures, but only to insert explicitly flavour indices on the fermion fields.

The operators in Eqs. (1) and (2) preserve \(B-L\) with \(\Delta B=+1=\Delta L\), and then a baryon can only decay into an anti-lepton and a meson. The constraints on the proton lifetime [16,17,18] translate into a lower bound on the cut-off \(\Lambda _{B}\) of about \(10^{15}\mathrm {\,GeV}\), independently of the specific flavour contraction that can be considered for each operator. On the contrary, when a flavour symmetry is considered, such as the so-called Minimal Flavour Violation ansatz in its global [19,20,21,22,23,24,25,26,27,28] or gauged [24, 29,30,31,32,33,34] versions, the scale \(\Lambda _{B}\) can be lowered, but still it will be much larger than the electroweak scale \(v\approx 246\mathrm {\,GeV}\).

The basic ingredient of the SMEFT construction is the treatment of the Higgs field as an exact EW doublet. Although this hypothesis is currently supported by collider searches (see for example Ref. [35]), the present uncertainties leave open the possibility for alternative descriptions of the EW symmetry breaking (EWSB) mechanism, potentially free from the Hierarchy problem. Still in the context of effective approaches, a description that allows for deviations from the exact EW doublet representation for the Higgs field is the so-called Higgs effective field theory (HEFT) Lagrangian that generalises the SMEFT one. The HEFT Lagrangian is the most general description of gauge and Higgs couplings, respecting the paradigm of Lorentz and \(SU(3)_c\times SU(2)_L\times U(1)_Y\) gauge invariance: it is a very useful tool to describe an extended class of “Higgs” models, from the SM and the SMEFT scenarios, to Goldstone Boson Higgs models [36,37,38,39,40,41,42] and dilaton-like constructions [43,44,45,46,47].

The aim of this paper is to construct the BNV operator basis in the HEFT context, completing in this way previous studies on the HEFT framework.

In the next section, the HEFT setup is summarised and the BNV basis is presented. The comparison between the HEFT basis and the corresponding one in the SMEFT setup is discussed in Sect. 3. The counting of the distinct flavour contractions, considering a generic number of fermions, is performed in Sect. 4, based on the Hilbert series technique. The latter is a mathematical method from invariant theory to count the number of independent structures invariant under a certain symmetry group (for recent phenomenology applications see Refs. [48,49,50,51,52,53]).

2 The BNV HEFT Lagrangian

The crucial difference between the SMEFT and the HEFT is the relationship between the physical Higgs field h(x) and the SM Goldstone bosons (GBs) \(\overrightarrow{\pi }(x)\): in the SMEFT, the four fields belong to the \(SU(2)_L\) doublet \(\Phi (x)\),

$$\begin{aligned} \Phi (x)=\mathbf {U}(x) \begin{pmatrix} 0 \\ \frac{v+h(x)}{\sqrt{2}} \end{pmatrix}, \end{aligned}$$
(3)

where

$$\begin{aligned} \mathbf {U}(x)\equiv e^{i\overrightarrow{\sigma }\cdot \overrightarrow{\pi }(x)/v} \end{aligned}$$
(4)

is the GB matrix. In the HEFT, instead, the physical Higgs and the GB matrix are treated as independent objects [54,55,56,57,58,59,60,61,62]. This fact, together with the adimensionality of the GB matrix, leads to a much larger number of operators in the HEFT with respect to the SMEFT, at the same order in the expansion. As a consequence, HEFT exhibits the following distinguishing features [62,63,64,65,66,67,68]:

  • several correlations typical of the SMEFT, such as those between triple and quartic gauge couplings, are lost in the HEFT;

  • Higgs couplings are completely free in the HEFT, while they can be correlated to pure gauge couplings in the SMEFT;

  • some couplings that are expected to be strongly suppressed in the SMEFT, are instead predicted with higher strength in the HEFT and are potentially visible in the present LHC run.

In Ref. [62], the complete HEFT Lagrangian, invariant under baryon and lepton numbers, has been presented at first order in the expansion on the new physics scale [69], making explicit the custodial symmetry nature of the operators. The building blocks used for these structures are the SM gauge bosons, the GB matrix \(\mathbf {U}(x)\), the physical Higgs field h(x) and the SM fermions arranged in doublets of the global \(SU(2)_L\) or \(SU(2)_R\) symmetries. Arranging the RH fermionsFootnote 1 in doublets of \(SU(2)_R\),

$$\begin{aligned} Q_{R}= \begin{pmatrix} u_{R} \\ d_{R} \end{pmatrix} \qquad L_{R}= \begin{pmatrix} N_{R} \\ e_{R} \end{pmatrix}, \end{aligned}$$
(5)

allows one to distinguish the custodial symmetry preserving operators from those that instead violate it. Furthermore, this notation is consistent with the HEFT formalism, where the GBs matrix \(\mathbf {U}(x)\) transforms as a bi-doublet of the global \(SU(2)_L\times SU(2)_R\) symmetry,

$$\begin{aligned} \mathbf {U}(x)\rightarrow L\, \mathbf {U}(x)\, R^\dag , \end{aligned}$$
(6)

being L, R the unitary transformation associated to \(SU(2)_{L,R}\), respectively. Indeed, the Yukawa couplings are given by

$$\begin{aligned} \bar{Q}_{L}\mathbf {U}(x)Y_{Q}Q_{R},\qquad \bar{L}_{L}\mathbf {U}(x)Y_{L}L_{R}, \end{aligned}$$
(7)

where the Yukawa matrices are written in a compact notation as \(6\times 6\) block-diagonal matrices in the flavour space, \(Y_{Q}=\mathrm{diag}(Y_{u},Y_{d})\) and \(Y_{L}=\mathrm{diag}(Y_{\nu },Y_{e})\). Furthermore, it is useful to introduce the scalar chiral field \(\mathbf {T}(x)\),

$$\begin{aligned} \mathbf {T}(x)\equiv \mathbf {U}(x)\sigma _3 \mathbf {U}(x)^\dag ,\qquad \mathbf {T}(x)\rightarrow L \mathbf {T}(x) L^\dag , \end{aligned}$$
(8)

which breaks \(SU(2)_R\), while preserving \(SU(2)_L\), and therefore behaves as a spurion for the custodial symmetry.

In the HEFT Lagrangian, the dependence on the physical Higgs is conventionally described through adimensional generic functions \(\mathcal {F}(h/v)\) [54, 70], being v the EW vacuum expectation value. These functions are commonly written as a polynomial expansion in h / v, \(\mathcal {F}(h/v)=1+\alpha (h/v)+\beta (h/v)^2+\cdots \), which follows from the fact that the physical Higgs is an isosinglet scalar of the EW symmetry. The study of the scalar field manifold, depending on the specific \(\mathcal {F}(h)\), can indeed lead to phenomenological consequences, allowing to disentangle between different frameworks. This has been analysed in Refs. [71,72,73].

One could expect that the basis of BNV operators introduced in Eqs. (1) and (2) will not be modified in the HEFT framework, as they are purely fermionic. Indeed, these six operators are simply rewritten in terms of \(SU(2)_L\) and \(SU(2)_R\) fermion doublets. However, the fact that the GB matrix \(\mathbf {U}\) and the chiral scalar field \(\mathbf {T}\) are adimensional allows one to construct additional independent structures with the same canonical dimensions.

The set of operators that constitutes the BNV HEFT basis, at the first order in the expansion on \(\Lambda _{B}\), consists of 12 independent structures:

$$\begin{aligned} \mathcal {R}_{1}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta } \, \bar{Q}^{C}_{Lk\gamma }L_{Ll} \, \epsilon _{il}\, \epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{1}(h)\nonumber \\ \mathcal {R}_{2}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta } \ \bar{Q}^{C}_{Lk\gamma }(\mathbf {T}L_{L})_{l} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{2}(h)\nonumber \\ \mathcal {R}_{3}&=\bar{Q}^{C}_{Ri\alpha }Q_{Rj\beta } \ \bar{Q}^{C}_{Rk\gamma }L_{Rl} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{3}(h)\nonumber \\ \mathcal {R}_{4}&=\bar{Q}^{C}_{Ri\alpha }Q_{Rj\beta } \, \bar{Q}^{C}_{Rk\gamma }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}L_{R})_{l} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{4}(h)\nonumber \\ \mathcal {R}_{5}&=\bar{Q}^{C}_{Ri\alpha }Q_{Rj\beta }\bar{Q}^{C}_{Lk\gamma }L_{Ll} \, \epsilon _{ij}\,\epsilon _{kl} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{5}(h)\nonumber \\ \mathcal {R}_{6}&=\bar{Q}^{C}_{Ri\alpha }Q_{Rj\beta }\bar{Q}^{C}_{Lk\gamma }(\mathbf {T}L_{L})_{l} \, \epsilon _{ij}\,\epsilon _{kl} \ \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{6}(h)\nonumber \\ \mathcal {R}_{7}&=(\bar{Q}^{C}_{R\alpha }\mathbf {U}^{t})_{i}(\mathbf {T}\mathbf {U}Q_{R\beta })_{j}\bar{Q}^{C}_{Lk\gamma }L_{Ll} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{7}(h)\nonumber \\ \mathcal {R}_{8}&=(\bar{Q}^{C}_{R\alpha }\mathbf {U}^{t})_{i}(\mathbf {T}\mathbf {U}Q_{R\beta })_{j}\bar{Q}^{C}_{Lk\gamma }(\mathbf {T}L_{L})_{l} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{8}(h)\nonumber \\ \mathcal {R}_{9}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta }\bar{Q}^{C}_{Rk\gamma }L_{Rl} \, \epsilon _{ij}\,\epsilon _{kl} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{9}(h)\nonumber \\ \mathcal {R}_{10}&=\bar{Q}^{C}_{Li\alpha }Q_{Lj\beta }\bar{Q}^{C}_{Rk\gamma }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}L_{R})_{l} \, \epsilon _{ij}\,\epsilon _{kl} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{10}(h)\nonumber \\ \mathcal {R}_{11}&=(\bar{Q}^{C}_{L\alpha }\mathbf {U}^*)_i(\mathbf {U}^\dagger \mathbf {T}Q_{L\beta })_ j\bar{Q}^{C}_{Rk\gamma }L_{Rl} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{11}(h)\nonumber \\ \mathcal {R}_{12}&=(\bar{Q}^{C}_{L\alpha }\mathbf {U}^*)_i(\mathbf {U}^\dagger \mathbf {T}Q_{L\beta })_ j\bar{Q}^{C}_{Rk\gamma }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}L_{R})_{l} \, \epsilon _{il}\,\epsilon _{kj}\, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{12}(h). \end{aligned}$$
(9)

Other BNV operators can be constructed, but are redundant with respect to the structures in this list. For example, one could consider an operator similar to \(\mathcal {R}_2\), with \(\mathbf {T}\) contracted to the second quark doublet instead than to the lepton doublet: however, \(\mathcal {R}_1\), \(\mathcal {R}_2\) and this alternative operator are not independent among each other and one should choose only two of them. Other examples will be discussed in Sect. 4.

All the operators in this list have canonical mass dimension 6 and therefore are suppressed by \(\Lambda ^2_{B}\). Indeed, the insertion of the scalar chiral field \(\mathbf {T}\) or of the GB matrix does not lead to any additional mass suppression. Among these 12 operators, only four of them are custodial symmetry preserving, \(\mathcal {R}_{1}\), \(\mathcal {R}_{3}\), \(\mathcal {R}_{5}\) and \(\mathcal {R}_{9}\), and thus do not contain the custodial spurion \(\mathbf {T}\).

When ignoring RH neutrinos, the number of independent operators reduces to nine: in particular, \(\mathcal {R}_4\), \(\mathcal {R}_{10}\) and \(\mathcal {R}_{12}\) turn out to be vanishing or redundant with respect to the other structures.

3 Comparison with the SMEFT

The BNV SMEFT operators in Eqs. (1) and (2) and the ones in Eq. (9) present a series of similarities:

  • all the operators can be written in terms of scalar currents, being the other type of contractions vanishing or redundant by the Fierz identity;

  • both bases contain operators classified into four distinct classes: schematically, \(Q_{L}Q_{L}Q_{L}L_{L}\), \(Q_{R}Q_{R}Q_{R}L_{R}\), \(Q_{L}Q_{L}Q_{R}L_{R}\) and \(Q_{R}Q_{R}Q_{L}L_{L}\);

  • the operators in both bases preserve \(B-L\).

On the other side, there is not a one-to-one relation between the two sets of operators, as indeed:

  • the \(d=6\) SMEFT basis consists of only six independent operators, while the HEFT one presents 12 structures;

  • only two combinations of SMEFT operators, \(\mathcal {O}_{4}-\mathcal {O}_{6}\) and \(\mathcal {O}_{2}+\mathcal {O}_{5}\), in Eqs. (1) and (2), contain sources of custodial symmetry breaking; on the other hand, all the operators in Eq. (9) are custodial symmetry breaking, except for \(\mathcal {R}_{1}\), \(\mathcal {R}_{3}\), \(\mathcal {R}_{5}\) and \(\mathcal {R}_{9}\);

  • \(B-L\) non-invariant operators can be found in the SMEFT Lagrangian at dimensions different from six [74], while this is not the case in the HEFT, where indeed \(B-L\) invariance is guaranteed by hypercharge invariance. This follows from two facts: first, hypercharge can be identified with \(B-L\) in theories invariant under the \(SU(2)_L\times SU(2)_R\) symmetry, such as in left–right symmetric models [75, 76]. In these frameworks, as the RH fermions also belong to an SU(2) doublet representation, and they have the same electric charge as their left-handed (LH) counterparts, both LH and RH fields must have the same hypercharge, \(-1\) for leptons and 1 / 3 for quarks, in a given convention. In a compact notation, then the hypercharge can be written as \(B-L\):

    $$\begin{aligned} \psi _L&\rightarrow e^{i(B-L)\theta (x)}\psi _L\nonumber \\ \psi _R&\rightarrow e^{i(B-L)\theta (x)}e^{i\theta (x)\sigma _3}\psi _R,\nonumber \end{aligned}$$
    (10)

    where \(\theta (x)\) is the transformation parameter. The second fact which guarantees the identification of hypercharge and \(B-L\) is that the only spurion breaking \(SU(2)_R\), in the HEFT context is the scalar chiral field \(\mathbf {T}\). As it does not carry hypercharge, its insertion in an operator cannot lead to hypercharge violation, neither of \(B-L\). In the SMEFT, where hypercharge and \(B-L\) are independent, SM gauge invariant operators can violate \(B-L\), and the lowest dimensional example is the so-called Weinberg operator \((\bar{L}^c_L\tilde{\Phi }^*)(\tilde{\Phi }^\dag L_L)\). In HEFT, this operator cannot be constructed, unless other sources of \(SU(2)_R\) violation are considered. As a title of example, one could consider the Pauli matrix \(\sigma _+=(\sigma _1+i\sigma _2)/2\), which allows one to write the equivalent to the Weinberg operator in HEFT [77]:

    $$\begin{aligned} (\bar{L}^c_L\mathbf {U}^*)\sigma _+(\mathbf {U}^\dag L_L). \end{aligned}$$
    (11)

    This operator preserves hypercharge, but violates \(SU(2)_R\) and lepton number by two units, as can be seen by writing explicitly the transformation under hypercharge of the GB matrix:

    $$\begin{aligned} \mathbf {U}(x)\rightarrow \mathbf {U}(x) e^{-i\theta (x)\sigma _3}. \end{aligned}$$
    (12)

    Notice that this is a three dimensional operator and therefore provides a direct mass term for the light active neutrinos. In contrast, the Weinberg operator in the SMEFT is of \(d=5\) and thus suppressed by a power of the mass scale at which lepton number is broken. This is an example of the strong impact of the adimensionality of the GB matrix \(\mathbf {U}\) with respect to the \(SU(2)_L\) doublet Higgs of the SMEFT. In the rest of the paper, no other sources of \(SU(2)_R\) violation will be considered beside \(\mathbf {T}\), consistently with previous studies in the HEFT context.

It is interesting to determine the connection between the operators in Eq. (9) and those in Eqs. (1) and (2), as it will help to identify possible ways to distinguish the two descriptions. The connection for the HEFT operators which do not contain GBs is straightforward:

$$\begin{aligned}&\mathcal {R}_1 \rightarrow \mathcal {O}_3\nonumber \\&\mathcal {R}_3 \rightarrow \mathcal {O}_4+\mathcal {O}_6\nonumber \\&\mathcal {R}_4 \rightarrow -\mathcal {O}_4+\mathcal {O}_6\\&\mathcal {R}_5 \rightarrow -\mathcal {O}_1\nonumber \\&\mathcal {R}_9 \rightarrow \mathcal {O}_2-\mathcal {O}_5\nonumber \\&\mathcal {R}_{10} \rightarrow -\mathcal {O}_2-\mathcal {O}_5.\nonumber \end{aligned}$$
(13)

Notice, indeed, that the combination \(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}\) appearing in \(R_4\), \(R_{10}\) and \(R_{12}\) simplifies to \(\sigma _3\) once using the definition of \(\mathbf {T}\) in Eq. (8). This list shows that there is a linear correspondence between six operators of the HEFT basis and the six operators of the \(d=6\) SMEFT one. The other HEFT operators contain interactions that can be described by SMEFT operators with dimension 8. An example is the following:

$$\begin{aligned} \mathcal {R}_2 \rightarrow \bar{Q}^{C}_{Li\alpha }Q_{Lj\beta }\bar{Q}^{C}_{Lk\gamma }[(\tilde{\Phi }\tilde{\Phi }^\dag - \Phi \Phi ^\dag )L_{L}]_l \epsilon _{il}\epsilon _{kj} \ \epsilon _{\alpha \beta \gamma }\quad \end{aligned}$$
(14)

where the h-independent couplings of the combination \(\tilde{\Phi }\tilde{\Phi }^\dag -\Phi \Phi ^\dag \) in the unitary gauge play the same role as the scalar chiral field \(\mathbf {T}\) in \(\mathcal {R}_2\).

The study of the connections between the HEFT and SMEFT operators leads to the conclusion that several correlations typical of the SMEFT are lost in the HEFT and that some couplings that are expected to be strongly suppressed in the SMEFT are instead predicted to be relevant in HEFT. This fact has already been pointed out in Refs. [60, 62, 64] for the B and L invariant couplings and is confirmed here for the B and L non-invariant ones. An example is the comparison between the decay rates of the proton and of the neutron: \(\Gamma (p\rightarrow \pi ^0e^+)\) and \(\Gamma (n\rightarrow \pi ^0\bar{\nu }_e)\). In the \(d=6\) SMEFT framework, the values of these two observables are predicted to be exactly the same, while this correlation can be broken considering \(d=8\) operators. On the other side, in the HEFT context, the operators \(R_2\), \(R_6\), \(R_7\), \(R_8\), \(R_{11}\), \(R_{12}\) contribute differently to the two decay rates, and no correlation arises at any order. An experimental discrepancy among these two observables could then be explained either in terms of the SMEFT, but advocating \(d=8\) contributions, or in terms of the HEFT Lagrangian. The magnitude of the discrepancy is what could tell which is the correct description: a relative difference between the two decay rates larger than about \((v^2/\Lambda _{B}^2)^2\) cannot be compatible with the \(d=8\) SMEFT Lagrangian and, instead, could well be accounted for in the HEFT context.

At present, the non-observation of the proton decay puts a lower bound on the ratio \(\Lambda _{B}/c_i\) of about \(10^{15}\,\mathrm {GeV}\), where \(c_i\) represents the combination of the operator coefficients entering the proton decay rate. As a result, this strategy to disentangle the two frameworks is an interesting feature from the theoretical side, although experimentally is not viable yet. Moreover, it allows one to estimate the order of magnitude of the contributions to these decay rates from the \(d=8\) SMEFT operators of about \(10^{-51}\), with respect to those from the \(d=6\) ones.

4 Flavour contraction counting

The number of independent flavour contractions can be counted directly considering the symmetries of the operators in Eq. (9). Alternatively, one can adopt the Hilbert series technique, which provides a polynomial function whose terms can be matched with the operators in Eq. (9) and the corresponding coefficients count the number of independent flavour contractions. Although the matching is straightforward in the absence of scalar fields, as for the BNV HEFT operators considered here, one should be careful when dealing with structures containing the fields \(\mathbf {T}\) and \(\mathbf {U}\), in order to remove the redundancies due to \(\mathbf {T}^2 =\mathbbm {1}\) and \(\mathbf {U}^\dag \mathbf {U}=\mathbbm {1}\).

The discussion of the number of flavour contractions adopting the Hilbert series technique is presented below, considering in all generality \(N_f\) fermion families.

The counting for \(\mathcal {R}_1\) is \(N_f^2(2N_f^2+1)/3\) and coincides with the one in Ref. [12], where it is discussed in terms of flavour representations by using Young tableaux. The counting of \(\mathcal {R}_2\) is the same as \(\mathcal {R}_1\), as \(\mathbf {T}\) only adds a flip of sign in the second component of the lepton doublet. A few cases with \(\mathbf {T}\) insertions in the \(Q_{L}Q_{L}Q_{L}L_{L}\) (LLLL for brevity) operators are redundant and have been subtracted from the total counting.

For the \(Q_{R}Q_{R}Q_{R}L_{R}\) (RRRR) operators, \(\mathcal {R}_3\) and \(\mathcal {R}_4\), which are written exclusively in terms of \(SU(2)_R\) doublets, the counting simply mirrors that of the LLLL ones and each operator presents \(N_f^2(2N_f^2+1)/3\) flavour contractions. This is not consistent with the results in the SMEFT case (see Refs. [12, 78]), where the total number of flavour contractions for the RRRR structures, \(\mathcal {O}_4\) and \(\mathcal {O}_6\), is \(2N_f^4\). This apparent contradiction is easily solved noticing that the \(SU(2)_R\) symmetry is still partially preserved in the operators \(\mathcal {R}_3\) and \(\mathcal {R}_4\) and prevents part of the possible flavour contractions among four RH singlet fermions. Indeed, rewriting explicitly the flavour indices abcd, one gets

$$\begin{aligned} \mathcal {R}_3^{abcd}&= \mathcal {O}_4^{\{bc\}ad}+\mathcal {O}_6^{\{bc\}ad},\\ \mathcal {R}_4^{abcd}&= -\mathcal {O}_4^{\{bc\}ad}+\mathcal {O}_6^{\{bc\}ad},\nonumber \end{aligned}$$
(15)

where the brackets should read \(\mathcal {O}_i^{\{ab\}cd}\equiv \mathcal {O}_i^{abcd} +\mathcal {O}_i^{bacd}\). This shows that \(\mathcal {R}_3\) and \(\mathcal {R}_4\) only contain the flavour symmetric contractions in b and c of the SMEFT operators. The flavour antisymmetric contractions are instead described by two additional structures:

$$\begin{aligned} \mathcal {R}_3^\prime&=\bar{Q}^{C}_{Ri\alpha }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}Q_{R\beta })_j \ \bar{Q}^{C}_{Rk\gamma }L_{Rl} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma },\nonumber \\ \mathcal {R}_4^\prime&=\bar{Q}^{C}_{Ri\alpha }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}Q_{R\beta })_j \, \bar{Q}^{C}_{Rk\gamma }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}L_{R})_{l} \, \epsilon _{il}\,\epsilon _{kj} \, \epsilon _{\alpha \beta \gamma }.\nonumber \end{aligned}$$
(16)

These two operators are redundant with respect to \(R_3\) and \(R_4\) for \(N_f=1\), but they should be added to the list in Eq. (9) for \(N_f>1\) (see Ref. [9] for a similar discussion in the SMEFT). The number of the flavour contractions of these four RRRR operators sums up to \(2N_f^4\) matching the result for the SMEFT case.

The operators \(\mathcal {R}_5\)\(\mathcal {R}_8\) exhibit a \(Q_{R}Q_{R}Q_{L}L_{L}\) (RRLL) structure. Among these, only \(\mathcal {R}_5\) can be directly related to a \(d=6\) operator of the SMEFT Lagrangian. Rewriting the expression for \(\mathcal {R}_5\) in Eq. (13), making explicit the flavour indices, one can see that \(\mathcal {R}_5\) only contains part of the interactions described by \(\mathcal {O}_1\):

$$\begin{aligned} \mathcal {R}_5^{abcd}=-\mathcal {O}_1^{\{ab\}cd}. \end{aligned}$$
(17)

Similarly, the operator \(\mathcal {R}_6^{abcd}\) contains only the flavour contractions symmetric in a and b. It is therefore necessary to introduce two additional operators that completely break the \(SU(2)_R\) structure between the first two \(SU(2)_R\) quark doublets in \(\mathcal {R}_5\) and \(\mathcal {R}_6\):

$$\begin{aligned} \mathcal {R}^\prime _{5}&=\bar{Q}^{C}_{Ri\alpha }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}Q_{R\beta })_j\bar{Q}^{C}_{Lk\gamma }L_{Ll} \, \epsilon _{ij}\,\epsilon _{kl} \, \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{5}(h).\nonumber \\ \mathcal {R}^\prime _{6}&=\bar{Q}^{C}_{Ri\alpha }(\mathbf {U}^{\dagger }\mathbf {T}\mathbf {U}Q_{R\beta })_j\bar{Q}^{C}_{Lk\gamma }(\mathbf {T}L_{L})_{l} \, \epsilon _{ij}\,\epsilon _{kl} \ \epsilon _{\alpha \beta \gamma } \, \mathcal {F}_{6}(h). \end{aligned}$$
(18)

As for the previous case, these two structures are redundant with \(\mathcal {R}_5\) and \(\mathcal {R}_6\) for \(N_f=1\), otherwise they should be added to the basis. \(\mathcal {R}^\prime _{5}\) and \(\mathcal {R}^\prime _{6}\) contain the interactions with the combinations antisymmetric in a and b. Therefore \(\mathcal {R}_5\) and \(\mathcal {R}^\prime _{5}\) provide altogether the flavour contractions of the SMEFT operator \(\mathcal {O}_1\). On the other hand, the interactions of \(\mathcal {R}^6\) and \(\mathcal {R}^\prime _{6}\) are described by a \(d=8\) operator of the SMEFT Lagrangian.

The independent structures contained in the two remaining RRLL operators, \(\mathcal {R}_7\) and \(\mathcal {R}_8\), read in the unitary gauge

$$\begin{aligned} \bar{u}^C_{R\alpha a}\,u_{R\beta b}\,\bar{d}^C_{L\gamma c}\,e_{L d}\varepsilon _{\alpha \beta \gamma },\qquad \bar{d}^C_{R\alpha a}\,d_{R\beta b}\,\bar{u}^C_{L\gamma c}\,\nu _{L d}\varepsilon _{\alpha \beta \gamma }, \end{aligned}$$
(19)

and they are non-vanishing only for the combinations antisymmetric in a and b. As a result, the number of independent flavour contractions for each of these operators is \(N_f^3(N_f-1)/2\).

The counting for the \(Q_{L}Q_{L}Q_{R}L_{R}\) (LLRR) operators is not fully analogous to that of the RRLL ones. The interactions in \(\mathcal {R}^9\) and \(\mathcal {R}^{10}\) are described by linear combinations of the operators \(\mathcal {O}_2\) and \(\mathcal {O}_5\) of the SMEFT Lagrangian, as in Eq. (13). The number of their flavour contractions is \(N_f^3(N_f+1)/2\) for each of them, in agreement with Refs. [12, 78]. Finally, the counting of the flavour contractions of \(\mathcal {R}_{11}\) and \(\mathcal {R}_{12}\) is analogous to the one for their RRLL counterparts, \(\mathcal {R}_7\) and \(\mathcal {R}_8\): \(N_f^3(N_f-1)/2\).

As a result of the previous discussion, the number of flavour contractions can be summarised as follows:

$$\begin{aligned} \begin{array}{ll} \mathcal {R}_1 \rightarrow N_f^2(2N_f^2+1)/3\\ \mathcal {R}_2 \rightarrow N_f^2(2N_f^2+1)/3\\ \mathcal {R}_3,\,\mathcal {R}^\prime _3 \rightarrow N_f^4\\ \mathcal {R}_4,\,\mathcal {R}^\prime _4 \rightarrow N_f^4\\ \mathcal {R}_5,\, \mathcal {R}^\prime _5 \rightarrow N_f^4\\ \mathcal {R}_6,\,\mathcal {R}^\prime _6 \rightarrow N_f^4 \end{array} \qquad \begin{array}{ll} &{}\mathcal {R}_7 \rightarrow N_f^3(N_f-1)/2\\ &{}\mathcal {R}_8 \rightarrow N_f^3(N_f-1)/2\\ &{}\mathcal {R}_9 \rightarrow N_f^3(N_f+1)/2\\ &{}\mathcal {R}_{10} \rightarrow N_f^3(N_f+1)/2\\ &{}\mathcal {R}_{11} \rightarrow N_f^3(N_f-1)/2\\ &{}\mathcal {R}_{12} \rightarrow N_f^3(N_f-1)/2.\\ \end{array} \end{aligned}$$
(20)

This analysis completes previous studies on the HEFT Lagrangian, which received much attention in the last years because of its relevance in collider searches. This paper provides, for the first time, the complete set of leading operators which are not invariant under baryon and lepton numbers, but do preserve the \(B-L\) combination.

A detailed comparison with the SMEFT Lagrangian is also presented, pointing out a strategy to distinguish between the two approaches. Finally, the Hilbert series technique, which has recently undergone a revival of interest, has been adopted to discuss the number of flavour independent contractions for a generic number of fermion families.