1 Introduction

The CDF collaboration has recently announced a new result of W-boson mass \(m_W|_{\textrm{CDF}}=80.4335\pm 0.0094\) GeV [1] which deviates from the standard model prediction \(m_W|_\textrm{SM}=80.357\pm 0.006\) GeV [2] at \(7\sigma \). Such a high precision measurement of W mass may be a significant indication for the new physics beyond the standard model.

On theoretical grounds, the CDF W-mass anomaly possibly originates from (i) a non-minimal Higgs sector that contains the standard model Higgs field and directly contributes to this mass deviation via relevant Higgs mechanism, (ii) tree level mixings of the standard model Z and even W bosons with new particles that cause the W mass as shifted, and/or (iii) loop-level quantum corrections due to the presence of new particles to gauge boson self-energies that modify the Peskin–Takeuchi parameters STU [3,4,5]. Several efforts have been made in the literature to provide possible solutions to this puzzle, see Refs. [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53] for an incomplete list. In this work, we show that the model based upon \(SU(3)_C\otimes SU(3)_L\otimes U(1)_X\) (called 3-3-1) gauge symmetry [54,55,56,57,58,59] manifestly accommodates the CDF W-mass anomaly. The reason for this model choice is that all the effects dedicated above of the new physics are actually dictated by the 3-3-1 gauge principle, and thus they are very predictive.

One of the fundamental motivations for the 3-3-1 gauge symmetry extension is that it can address the number of fermion families naturally. Indeed, in the standard model, the number of fermion families on the theoretical ground is left arbitrarily. The reason may come from a fact that the gauge anomalies are cancelled out for every fermion family; no correlation between families is needed. This is due to the weak isospin symmetry with the relevant \(SU(2)_L\) trace \({\textrm{Tr}}[\{T_i,T_j\}T_k]=0\) for any fermion representation. The simplest extension of \(SU(2)_L\) to \(SU(3)_L\) yields the corresponding trace nontrivially, which does not vanish for complex representations. With enlargement of respective fermion representations under \(SU(3)_L\), each family now depends on anomaly. The \([SU(3)_L]^3\) anomaly vanishes if all the families must be taken into account, implying that the number of families is a multiple of color number (cf. [55]). The QCD asymptotic freedom demands that the number of families is smaller than or equal to five. It follows that the family number is just three, coinciding with experiment. Hence, the core of the 3-3-1 extension is to signify a higher weak isospin symmetry, \(SU(3)_L\), directly enlarged from \(SU(2)_L\). It is noted that the new \(U(1)_X\) group necessarily included ensures an algebraic closure between electric charge and \(SU(3)_L\), analogously to the standard model that requires the hypercharge. The remaining anomalies associated with \(U(1)_X\) vanish too. Interestingly, besides the fermion family number, this new gauge principle yields a potential solution to the profound questions of electric charge quantization [60,61,62,63,64], discrepancy of third quark family [65,66,67], and strong CP conservation [68,69,70,71]. Additionally, it naturally addresses the issues of neutrino mass generation [72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87], flavor physics [88,89,90,91,92,93,94], dark matter stability [95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113], as well as cosmic inflation and baryon asymmetry [114,115,116].

The 3-3-1 model can be classified, based upon the embedding of electric charge operator in the new gauge symmetry, say \(Q=T_3+\beta T_8+X\), through the \(\beta \) parameter, where \(T_j\) (\(j=1,2,3,\ldots ,8\)) and X are \(SU(3)_L\) and \(U(1)_X\) charges, respectively. Generally, the 3-3-1 model possesses a lepton triplet of form \((\nu _L, e_L, E^q_L)\) where E is some field with electric charge q, related to the \(\beta \) parameter as \(\beta =-(1+2q)/\sqrt{3}\). Notice that switching representations with conjugated representations of \(SU(3)_L\), e.g. \((\nu _L,e_L, E^q_L)\rightarrow (e_L,-\nu _L,E^q_L)\), changes \(\beta \rightarrow -\beta \) and leads to a version with rather similar phenomenology, including the W mass, thus not interpreted in this work. There are two typical variants of the 3-3-1 model as far as the lepton sectors are relevant. The minimal 3-3-1 model combines known leptons into a triplet \((\nu _L,e_L,e^c_R)\) for each family, thus \(\beta =-\sqrt{3}\) [54,55,56], whereas the 3-3-1 model with right-hand neutrinos introduces three right-handed neutrinos (\(\nu _R\)’s) to perform \((\nu _L,e_L,\nu ^c_R)\) for each family, thus \(\beta =-1/\sqrt{3}\) [57,58,59]. Particularly-interested theories that modify the minimal 3-3-1 model but keep \(\beta =-\sqrt{3}\) consist of the 3-3-1 model with exotic charged leptons [117], the reduced 3-3-1 model [118], and the simple 3-3-1 model [105]. Also, the theories that modify the 3-3-1 model with right-handed neutrinos but keep \(\beta =-1/\sqrt{3}\) include the economical 3-3-1 model [119, 120] and the 3-3-1 model with neutral (heavy) fermions [77,78,79, 99].Footnote 1 Alternatively, the 3-3-1 model without exotic charge proposes new copies of charged leptons, called heavy charged leptons, forming \((\nu _L, e_L, E^-_L)\) for each family, thus \(\beta =1/\sqrt{3}\) [122, 123]. And, the flipped 3-3-1 model presents a distinct arrangement for fermion representations with \(\beta =1/\sqrt{3}\) [124, 125]. From this point of view, the variants of the 3-3-1 model mainly differ in fermion content and Higgs sector, besides \(\beta \) that specifies the gauge spectrum. Although the fermion content is fixed by the anomaly cancelation, the QCD asymptotic freedom, and the electric charge embedding, the Higgs sector is actually arbitrary, having plenty of multiplets by contrast, as seen from the modified models of the typical 3-3-1 models. According to the existing 3-3-1 theories, each of which may possess two Higgs triplets, three Higgs triplets, three Higgs triplets plus one Higgs sextet, or even many Higgs multiplets if given a flavor symmetry.

It is noted that the same electric charge embedding, i.e. \(\beta \) (or q), cannot distinguish alternative particle contents and vacuum structures, e.g. the 3-3-1 model with right-handed neutrinos vs. the 3-3-1 model with neutral fermions, as well as unwanted Higgs vacuum alignments in these models. The behavior of baryon number minus lepton number (\(B-L\)) symmetry put forward in [104, 105, 107, 109] for this kind of the model may provide insight in signifying the alternative particle contents as well as classifying the nontrivial Higgs vacuum structures. Hence, the 3-3-1 model can be characterized by \(\beta \) (or q) and \(B-L\) behavior, which are important for analysing the sources of the W-mass shift. As a matter of fact, the 3-3-1 models possibly contain new neutral (\(Z',X\)) and charged (Y) gauge bosons presenting interesting mixing phenomena with usual WZ bosons, which along with the Higgs vacuum structures cause the W-mass shift at tree level. Additionally, the new non-Hermitian gauge vector doublet (XY) and inert Higgs multiplets presented in 3-3-1 models for neutrino mass and/or dark matter also contribute to this mass shift at loop level.

The rest of this work is organized as follows. In Sect. 2 we set up a generic 3-3-1 model in which relevant Higgs mechanism important for the W-mass shift is determined and classified by electric charge conservation and \(B-L\) behavior. In Sect. 3, we investigate various novel contributions of the model to the W-mass shift. A remark of scalar sextet contribution to the W-mass shift is given in Sect. 4. The extra important constraints for 3-3-1 model are discussed in Sect. 5. We make a conclusion in Sect. 6.

2 Description of the model

We first present the necessary features of the 3-3-1 model with arbitrary \(\beta \) embedding (or q charge) parameter. We then determine distinct gauge-boson mass spectra according to profiles of Higgs vacuum structures, which affect differently to the W-mass anomaly.

2.1 Particle content

The 3-3-1 gauge symmetry is given by

$$\begin{aligned} SU(3)_C\otimes SU(3)_L\otimes U(1)_X, \end{aligned}$$
(1)

where the first factor is the usual color group, while the last two are directly extended from the electroweak group, as mentioned. The decomposition scheme of the extended gauge sector into the usual gauge groups takes the form,

$$\begin{aligned} SU(3)_L\otimes U(1)_X\rightarrow & {} SU(2)_L\otimes U(1)_{T_8}\otimes U(1)_X\nonumber \\ {}\rightarrow & {} SU(2)_L\otimes U(1)_Y\rightarrow U(1)_Q, \end{aligned}$$
(2)

where \(T_j\,(j=1,2,3,\ldots ,8)\) and X stand for \(SU(3)_L\) and \(U(1)_X\) charges, respectively. Additionally, the hypercharge Y and the electric charge Q are embedded, respectively, as

$$\begin{aligned} Y\equiv & {} \beta T_8+X, \end{aligned}$$
(3)
$$\begin{aligned} Q\equiv & {} T_3+Y=T_3+\beta T_8+X. \end{aligned}$$
(4)

In other words, when the 3-3-1 symmetry is broken down to the standard model symmetry, the hypercharge is composed of the two new diagonal charges, \(T_8\) and X, as broken. When the electroweak symmetry is broken down to electromagnetic symmetry, the electric charge is composed of the third weak-isospin component and the hypercharge, as usual. The coefficient \(\beta \) is related to a basic charge parameter q that is the electric charge of the third component (E) of a lepton triplet, such as \(\beta = -(1+2q)/\sqrt{3}\). It is noted that the 3-3-1 model can possess variants that differ by corresponding \(\beta \) (or q) values, as imposed in Table 1.

Table 1 A roadmap for 3-3-1 versions and relevant vacuum structures. Minimal: The minimal 3-3-1 model and its variants for \(\beta =-{\sqrt{3}}\) (or \(q=1\)); RHNs: The 3-3-1 model with right-handed neutrinos and its variants for \(\beta =-1/{\sqrt{3}}\) (or \(q=0\)); HCLs: The 3-3-1 model with heavy charged leptons and its flipped 3-3-1 variant (see [124, 125] in detail) for \(\beta =1/{\sqrt{3}}\) (or \(q=-1\)); Generic: The generic 3-3-1 model whose \(\beta \) (or q) satisfies \(|\beta |<1.824\) (or \(-2.08<q<1.08\)), except for the previous cases. [Verify Sect. 5 for the \(\beta \) range and Landau pole.] A star “\(*\)” shows a vacuum expectation value viably for the corresponding scalar component. For \(q=0,-1\), the standard vacuum applies for \(N,E^{-}\) versions with \([B-L](N,E^{-})=0\), protected by \(P_{M}\), whereas the abnormal vacuum happens for \(\nu ^{c},E^{-}\) versions with \([B-L](\nu ^{c},E^{-})=1\), not protected by \(P_{M}\)

The fermion content transforms under the gauge symmetry in (1) as

$$\begin{aligned}{} & {} \psi _{aL} = \begin{pmatrix} \nu _{aL}\\ e_{aL}\\ E_{aL}\end{pmatrix}\sim (1,3,-1/3+q/3),\end{aligned}$$
(5)
$$\begin{aligned}{} & {} e_{aR} \sim (1,1,-1),\quad E_{aR}\sim (1,1,q),\end{aligned}$$
(6)
$$\begin{aligned}{} & {} Q_{\alpha L} = \begin{pmatrix} d_{\alpha L}\\ -u_{\alpha L}\\ J_{\alpha L} \end{pmatrix}\sim (3,3^*,-q/3),\end{aligned}$$
(7)
$$\begin{aligned}{} & {} Q_{3 L} = \begin{pmatrix} u_{3L}\\ d_{3L}\\ J_{3L}\end{pmatrix}\sim (3,3,1/3+q/3),\end{aligned}$$
(8)
$$\begin{aligned}{} & {} u_{a R} \sim (3,1,2/3),\quad d_{aR}\sim (3,1,-1/3),\end{aligned}$$
(9)
$$\begin{aligned}{} & {} J_{\alpha R} \sim (3,1,-1/3 -q),\quad J_{3R}\sim (3,1,2/3 +q), \end{aligned}$$
(10)

where \(a=1,2,3\) and \(\alpha =1,2\) are family indices. The new fields \(E_{a}\), \(J_{\alpha }\), and \(J_3\) have been introduced necessarily for completing the fermion representations and canceling all the anomalies. Notice that they possess electric charges, such as \(Q(E_a)=q\), \(Q(J_\alpha )=-1/3-q\), and \(Q(J_3)=2/3+q\). If \(q=1\), we achieve the 3-3-1 model with exotic charged leptons, and we denote \(P_a\equiv E^+_a\) [117]. If \(q=0\), the 3-3-1 model with neutral (heavy) fermions arises, and we define \(N_a\equiv E^0_a\) [121]. For the typical versions, the minimal 3-3-1 model [the 3-3-1 model with right-handed neutrinos] are obtained by replacing \(E_{aL}\) by \((e_{aR})^c\) [\((\nu _{aR})^c\), if imposed], whereas \(E_{aR}\) is suppressed, which possess \(q=1\) [\(q=0\)], respectively. This replacement does not apply for quarks, since the color and spacetime symmetries commute. Despite of the same q, the relevant models have alternative phenomenologies. Specially for \(q=0\), \(N_a\) may gain a charge \(B-L=0\) different from that of \(\nu _{aR}\), revealing a theory for dark matter [99, 107]. For \(q=-1\), we obtain the 3-3-1 model with heavy charged leptons \((E^-_a)\) [122, 123]. Interestingly, this version also implies dark matter stability if \(E^-_{a}\) have \(B-L=0\) different from that of the usual charged leptons (see, e.g., that in [109]). Along the line for \(q=-1\), the flipped 3-3-1 model puts all quark families in antitriplets, while two lepton families in triplets and the remaining lepton family in sextet, which differs from the above arrangement [124, 125]. This kind of the model has an extra chiral fermion triplet resided in the sextet, but it is highly degenerate in mass, negligibly contributing to W mass. The other sources that affect W mass are identical to the unflipped version with \(q=-1\). That said, it is able to collect all the viable lepton sectors (family and left-handed indices omitted, right-handed counterparts if viable are in singlet) in Table 1, while the corresponding quark sectors are not listed, since they have a common form differing only in electric charge for exotic quarks.

Three scalar triplets are generally introduced as

$$\begin{aligned} \rho= & {} \begin{pmatrix} \rho ^+_1\\ \rho ^0_2\\ \rho ^{q+1}_3 \end{pmatrix}\sim (1,3,2/3+q/3),\end{aligned}$$
(11)
$$\begin{aligned} \eta= & {} \begin{pmatrix} \eta ^0_1\\ \eta ^{-}_2\\ \eta ^q_3 \end{pmatrix}\sim (1,3,-1/3+q/3),\end{aligned}$$
(12)
$$\begin{aligned} \chi= & {} \begin{pmatrix} \chi ^{-q}_1\\ \chi ^{-q-1}_2\\ \chi ^0_3 \end{pmatrix}\sim (1,3,-1/3-2q/3), \end{aligned}$$
(13)

for which \(\chi \) breaks the 3-3-1 symmetry down to the standard model, giving mass for new particles, while \(\rho ,\eta \) break the standard model symmetry down to \(SU(3)_C\otimes U(1)_Q\), providing mass for ordinary particles. It is noted that one of the triplets \(\rho ,\eta \) may be excluded [105, 118]. By contrast, in flavor symmetry theories [77,78,79], a large amount of scalar triplets may be introduced. Furthermore, scalar sextets may also be added to the present content [56, 76,77,78,79]. However, the following investigation does not depend on such changes of scalar multiplet number; instead, it results from the vacuum structure of scalar fields. As shown in [126], it is sufficient to consider three triplets (given above) and one sextet, where in this work the sextet will be separately treated, without loss of generality. This vacuum structure depends on q as well as \(B-L\) behavior, studied below in order, along with implied gauge boson masses.

Concerning \(B-L\), it is stressed that \(E_a,J_a\) in fermion multiplets generically have \(B-L\) charge differently from that of the usual leptons and quarks, respectively. Let \([B-L](E_a)=n\). We obtain \(B-L={\textrm{diag}}(-1,-1,n)\) for lepton triplets, which neither commutes nor closes algebraically with \(SU(3)_L\). If \(B-L\) is conserved, an extra \(U(1)_{{\mathcal {N}}}\) group is required by symmetry principles such that \(B-L=\beta ' T_8+{\mathcal {N}}\), where \(\beta '=-2(1+n)/\sqrt{3}\). We achieve \([B-L](J_3)=n+4/3\), \([B-L](J_\alpha )=-n-2/3\), and \([B-L](X,Y,\chi _{1,2})=-[B-L](\eta _3,\rho _3)=-n-1\), while the rest of fields takes usual value. Since \(T_8\) is gauged, \(B-L\) and \({\mathcal {N}}\) must be gauged. We impose \(\nu _{aR}\) for cancelling \(U(1)_{{\mathcal {N}}}\) anomalies and a superheavy scalar singlet \(\phi _{B-L}\) with \(B-L=2\) that couples to \(\nu _{R}\nu _R\) and breaks \(U(1)_{{\mathcal {N}}}\). After symmetry breaking, the neutrinos gain a small mass via canonical seesaw, while there exists a residual matter parity \(P_M=(-1)^{3(B-L)+2s}\) not commuted with \(SU(3)_L\) [107, 109]. The 3-3-1 model with \(n=0\), as mentioned, possesses a nontrivial matter parity for new fields, \(E_a\), \(J_a\), \(\eta _3\), \(\rho _3\), \(\chi _{1,2}\), X, and Y, such as

$$\begin{aligned}{} & {} P_M \left[ \begin{array}{ccc} \nu _a &{} d_\alpha &{} u_3 \\ e_a &{} -u_\alpha &{} d_3 \\ E_a &{} J_\alpha &{} J_3 \end{array}\right] =\left[ \begin{array}{ccc} + &{} + &{} + \\ + &{} + &{} + \\ - &{} -&{} - \end{array}\right] ,\end{aligned}$$
(14)
$$\begin{aligned}{} & {} P_M \left[ \begin{array}{ccc} \rho ^+_1 &{} \eta ^0_1 &{} \chi ^{-q}_1 \\ \rho ^0_2 &{} \eta ^-_2 &{} \chi ^{-q-1}_2 \\ \rho ^{q+1}_3 &{} \eta ^{q}_3 &{} \chi ^{0}_3 \end{array}\right] =\left[ \begin{array}{ccc} + &{} + &{} - \\ + &{} + &{} - \\ - &{} -&{} + \end{array}\right] ,\end{aligned}$$
(15)
$$\begin{aligned}{} & {} P_M \left[ \begin{array}{ccc} (\gamma ,Z,Z') &{} W^+ &{} X^{-q} \\ W^- &{} (\gamma ,Z,Z') &{} Y^{-q-1} \\ X^q &{} Y^{q+1} &{} (\gamma ,Z,Z') \end{array}\right] =\left[ \begin{array}{ccc} + &{} + &{} - \\ + &{} + &{} - \\ - &{} -&{} + \end{array}\right] .\nonumber \\ \end{aligned}$$
(16)

Hence, in spite of \(q=0\) (\(q=-1\)), the relevant scalars \(\eta _3,\chi _1\) (\(\rho _3,\chi _2\)) cannot develop a vacuum expectation value (VEV) due to the matter parity conservation. Alternatively, the 3-3-1 model with \(n=1\), including the minimal 3-3-1 model, the 3-3-1 model with right-handed neutrinos, and even the 3-3-1 model with heavy charged leptons, transform trivially under \(P_M\), i.e. \(P_M=1\) for every field. In this case, the scalars \(\eta _3,\chi _1\) (\(\rho _3,\chi _2\)) if electrically neutral can develop a VEV, not protected by \(P_M\), actually governed by \(B-L\) (or \(P_M\)) violating interactions. With the aid of \(P_M\), a summary of possible vacuum structures for 3-3-1 variants is given in Table 1.

If the \(B-L\) symmetry is approximate, we avoid a \(U(1)_{{\mathcal {N}}}\) extension as given above. In this case, the interactions violating \(B-L\) enter, such as

$$\begin{aligned}{} & {} s^u_{3a} \bar{Q}_{3L}\chi u_{aR}+s^d_{\alpha a}\bar{Q}_{\alpha L}\chi ^* d_{aR} +s^J_{33} \bar{Q}_{3L}\eta J_{3R}\nonumber \\{} & {} \quad +s^J_{\alpha \beta }\bar{Q}_{\alpha L}\eta ^* J_{\beta R}+s^J_{3\alpha }\bar{Q}_{3L}\rho J_{\alpha R} + s^J_{\alpha 3} \bar{Q}_{\alpha L}\rho ^* J_{3R}\nonumber \\{} & {} \quad +\bar{\mu }^2\eta ^\dagger \chi + (\bar{\lambda }_1\eta ^\dagger \chi + \bar{\lambda }_2 \eta ^\dagger \eta \nonumber \\{} & {} \quad +\bar{\lambda }_3 \rho ^\dagger \rho +\bar{\lambda }_4 \chi ^\dagger \chi )\eta ^\dagger \chi +\bar{\lambda }_5 (\eta ^\dagger \rho ) (\rho ^\dagger \chi )+H.c., \end{aligned}$$
(17)

for the 3-3-1 model with right-handed neutrinos (\(q=0,n=1\)), besides the normal Yukawa couplings (h’s) and scalar self-couplings (\(\mu \)’s and \(\lambda \)’s) – as in the usual theory – which conserve \(B-L\). It is easily verified that all these violating couplings violate L by 2 units, except for \(\bar{\lambda }_1\) by 4 units, while preserve B and a lepton parity \((-1)^L\). Thus, the proton stability is protected by B and \((-1)^L\). Furthermore, the violating couplings must be small, e.g. \(\bar{\mu }\ll \mu \), \(\bar{\lambda }\ll \lambda \), and \(s\ll h\), since by contrast L-conservation sets \(\bar{\mu },\bar{\lambda },s=0\) but \(\mu ,\lambda ,h \ne 0\). In this case, the potential minimization gives L-violating VEVs, \(\langle \eta _3,\chi _1 \rangle \sim \bar{\mu }^2/w\), as suppressed, where w is the 3-3-1 breaking scale (cf. [127]). If discarding the unwanted coupling \(\psi _L\psi _L\rho \) by a symmetry \(\rho \rightarrow -\rho \), the neutrinos gain a naturally small mass via L-violating effective interaction, \(\frac{s'}{w}\psi _L \psi _L \eta \eta \), to be \(m_\nu \sim s' u^2/w\), doubly suppressed by \(s'\ll h\) and \(u\ll w\), where u is a weak scale. The presence of small \(\langle \eta _3,\chi _1 \rangle \) and s’s lead to a small mixing between usual quarks and exotic quarks causing flavor changing Z-currents, as studied below (Sect. 5).

2.2 Gauge spectrum for \(q\ne 0,-1\) or \(B-L\) conservation

For \(q\ne 0\) and \(q\ne -1\), the scalar components that are electrically neutral can develop VEVs, such as

$$\begin{aligned} \langle \rho \rangle = \frac{1}{\sqrt{2}} \left( \begin{array}{c} 0\\ v \\ 0 \end{array}\right) ,\quad \langle \eta \rangle = \frac{1}{\sqrt{2}}\left( \begin{array}{c} u \\ 0\\ 0 \end{array}\right) ,\quad \langle \chi \rangle = \frac{1}{\sqrt{2}} \left( \begin{array}{c} 0\\ 0\\ w \end{array}\right) . \end{aligned}$$
(18)

Since w breaks the 3-3-1 symmetry, while uv break the standard model symmetry, we impose \(w\gg v,u\) for consistency. This standard vacuum alignment has been extensively studied in the literature, even applying for the 3-3-1 model with arbitrary q (see Table 1). However, notice that for the model with \(q=0\) (\(q=-1\)), an extra symmetry such as \(B-L\) and its residual matter parity is needed to prevent the other neutral scalars \(\eta _3,\chi _1\) (\(\rho _3,\chi _2\)) from developing a VEV, ensuring the standard vacuum structure, as mentioned [107, 109]. Particularly for \(q=0\), this section applies for the 3-3-1 model with neutral (heavy) fermions, not for the 3-3-1 model with right-handed neutrinos.

The mass spectrum of the gauge bosons is given by

$$\begin{aligned} {\mathcal {L}} \supset \sum _{\Phi }(D^\mu \langle \Phi \rangle )^\dagger (D_\mu \langle \Phi \rangle ), \end{aligned}$$
(19)

where \(\Phi \) runs over the scalar triplets, and the covariant derivative is \(D_\mu = \partial _\mu + i g_s t_j G_{j\mu } + i g T_j A_{j\mu } + i g_X X B_\mu \). Here (\(g_s,g,g_X\)) and (\(G_j,A_j,B\)) are the gauge couplings and gauge bosons of 3-3-1 subgroups, respectively, and \(t_j\) is \(SU(3)_C\) charges.

Define non-Hermitian gauge bosons,

$$\begin{aligned} W^\pm = \frac{A_1\mp i A_2}{\sqrt{2}},\quad X^{\mp q} = \frac{A_4\mp i A_5}{\sqrt{2}},\quad Y^{\mp (1+q)} = \frac{A_6\mp i A_7}{\sqrt{2}}, \end{aligned}$$
(20)

which are coupled to the weight-raising and -lowering operators,

$$\begin{aligned} T_\pm =\frac{T_1\pm i T_2}{\sqrt{2}},\quad U_\pm =\frac{T_4\pm i T_5}{\sqrt{2}},\quad V_\pm =\frac{T_6\pm i T_7}{\sqrt{2}}, \end{aligned}$$
(21)

respectively. \(W^\pm \), \(X^{\mp q}\), and \(Y^{\mp (1+q)}\) are physical fields by themselves with masses

$$\begin{aligned} m^2_{W} = \frac{g^2}{4}(v^2+u^2),\quad m^2_X=\frac{g^2}{4}(w^2+u^2),\quad m^2_Y=\frac{g^2}{4}(w^2+v^2), \end{aligned}$$
(22)

respectively. W is identical to that of the standard model, while (XY) form a new, heavy gauge vector doublet with a mass splitting \(|m^2_Y-m^2_X|< m^2_W\).

Consider neutral gauge bosons. The photon field A that is coupled to the electric charge Q with coupling e is given by \(A/e=(A_3+\beta A_8)/g+B/g_X\), which is determined from \(Q=T_3+\beta T_8+X\) by substituting each generator with corresponding gauge field over coupling. This is a direct result of electric charge conservation, neither depending on VEVs nor necessarily diagonalizing the relevant mass matrix, as proved in [126]. The normalization of photon field implies \(s_W\equiv e/g=g_X/\sqrt{g^2+(1+\beta ^2)g_X^2}\), which matches the sine of the Weinberg angle in the standard model. Hence, the photon field A can be rewritten as

$$\begin{aligned} A = s_W A_3 + c_W \left( \beta t_W A_8 +\sqrt{1-\beta ^2 t^2_W}B\right) , \end{aligned}$$
(23)

where the expression in the parentheses is just the hypercharge field defined by (3). The standard model Z field is given orthogonally to A,

$$\begin{aligned} Z = c_W A_3 - s_W \left( \beta t_W A_8 +\sqrt{1-\beta ^2 t^2_W}B\right) , \end{aligned}$$
(24)

as usual, while a new \(Z'\) field is obtained orthogonally to the hypercharge field,

$$\begin{aligned} Z' = \sqrt{1-\beta ^2 t^2_W} A_8 - \beta t_W B. \end{aligned}$$
(25)

The photon A is massless and decoupled, as a physical field, while Z and \(Z'\) mix via a symmetric mass matrix with elements, given by

$$\begin{aligned} m^2_Z= & {} \frac{g^2}{4c^2_W} (v^2+u^2), \end{aligned}$$
(26)
$$\begin{aligned} m^2_{ZZ'}= & {} \frac{g^2\left[ (\sqrt{3}\beta t^2_W-1) v^2+(\sqrt{3}\beta t^2_W+1)u^2\right] }{4\sqrt{3} c_W\sqrt{1-\beta ^2 t^2_W}}, \end{aligned}$$
(27)
$$\begin{aligned} m^2_{Z'}= & {} \frac{g^2\left[ 4w^2+(\sqrt{3}\beta t^2_W-1)^2v^2+(\sqrt{3}\beta t^2_W+1)^2u^2\right] }{12(1-\beta ^2 t^2_W)}.\nonumber \\ \end{aligned}$$
(28)

Diagonalizing the Z-\(Z'\) mass matrix, we obtain two physical fields,

$$\begin{aligned} Z_1=c_\varphi Z - s_\varphi Z',\quad Z_2=s_\varphi Z + c_\varphi Z', \end{aligned}$$
(29)

where the Z-\(Z'\) mixing angle (\(\varphi \)) and \(Z_{1,2}\) masses are

$$\begin{aligned} t_{2\varphi }= & {} \frac{2m^2_{ZZ'}}{m^2_{Z'}-m^2_Z}\nonumber \\\simeq & {} \frac{\sqrt{3(1-\beta ^2 t^2_W)}\left[ (\sqrt{3}\beta t^2_W-1)v^2+(\sqrt{3}\beta t^2_W+1)u^2\right] }{2w^2 c_W}, \end{aligned}$$
(30)
$$\begin{aligned} m^2_ {Z_1}= & {} \frac{1}{2} \left[ m^2_Z +m^2_{Z'} - \sqrt{(m^2_Z - m^2_{Z'})^2 + 4m^4_ {ZZ'} }\right] \simeq m^2_Z-\frac{m^4_{ZZ'}}{m^2_{Z'}}\nonumber \\\simeq & {} \frac{g^2}{4c^2_W}\left\{ v^2+u^2-\frac{\left[ (\sqrt{3}\beta t^2_W-1)v^2+(\sqrt{3}\beta t^2_W+1)u^2\right] ^2}{4w^2}\right\} ,\nonumber \\ \end{aligned}$$
(31)
$$\begin{aligned} m^2_ {Z_2}= & {} \frac{1}{2} \left[ m^2_Z +m^2_{Z'} + \sqrt{(m^2_Z - m^2_{Z'})^2 + 4m^4_ {ZZ'} }\right] \simeq m^2_{Z'}. \end{aligned}$$
(32)

The mixing angle \(\varphi \) is small, suppressed by \((v,u)^2/w^2\). Additionally, \(Z_1\) has a mass approximating that of Z, called the standard model Z-like boson, while \(Z_2\) is a new, heavy gauge boson with mass proportional to w.

2.3 Gauge spectrum for \(q= 0,-1\) and \(B-L\) violation

For \(q=0\) (\(q=-1\)), the other scalars \(\eta _3,\chi _1\) (\(\rho _3,\chi _2\)) are electrically neutral and possibly develop a VEV in addition to the VEVs uvw, given above [80, 116, 120]. In this case, the \(B-L\) symmetry as mentioned is necessarily violated [104, 105], since \(\eta _3,\chi _1\) (\(\rho _3,\chi _2\)) have nonzero \(B-L\) number. This yields a vacuum structure, called abnormal vacuum, different from the previous model (see Table 1). Particularly for \(q=0\), this section applies for the 3-3-1 model with right-handed neutrinos, not for the 3-3-1 model with neutral (heavy) fermions. Because the physics obtained is the same independent of \(q=0\) or \(-1\) for phenomena interested in this work, we consider only \(q=0\), thus \(\beta =-1/\sqrt{3}\). In this case, although both \(\eta \) and \(\chi \) transform the same under the gauge symmetry, they differ in \(B-L\) numbers, possibly obtaining VEVs at the first and third components. That said, the vacuum alignment under consideration is generically given by

$$\begin{aligned} \langle \rho \rangle = \frac{1}{\sqrt{2}} \left( \begin{array}{c} 0\\ v \\ 0 \end{array}\right) ,\quad \langle \eta \rangle = \frac{1}{\sqrt{2}}\left( \begin{array}{c} u \\ 0\\ w' \end{array}\right) ,\quad \langle \chi \rangle = \frac{1}{\sqrt{2}} \left( \begin{array}{c} u' \\ 0\\ w \end{array}\right) . \end{aligned}$$
(33)

In contrast to u, v, and w that conserve \(B-L\), the remaining VEVs \(u',w'\) break this number, suppressed by relevant violation interactions. To be consistent with the standard model, we impose \(u'\ll u\) and \(w'\ll w\), in addition to \(u,v\ll w\) [76].

Substituting the VEVs in (33) to the Lagrangian, we get

$$\begin{aligned} {\mathcal {L}}&\supset \begin{pmatrix} W^-&Y^- \end{pmatrix} M^2_c \begin{pmatrix} W^+&Y^+ \end{pmatrix}^T+\frac{g^2}{8}(w^2+u^2+w'^2+u'^2)A^2_5\nonumber \\&+\frac{1}{2}\begin{pmatrix}A_3&A_8&B&A_4\end{pmatrix}M^2_0 \begin{pmatrix}A_3&A_8&B&A_4\end{pmatrix}^T, \end{aligned}$$
(34)

where \(W^{\pm }=(A_1\mp iA_2)/\sqrt{2}\) and \(Y^{\pm }=(A_6\pm iA_7)/\sqrt{2}\) defined as before mix, while the real and imaginary parts of \(X^{0,0*}=(A_4\mp i A_5)/\sqrt{2}\) behave differently, i.e. \(A_4\) mixes with \(A_{3,8}\) and B, whereas \(A_5\) does not. That said, \(A_5\) is decoupled, as physical field, with mass,

$$\begin{aligned} m^2_{A_5} = \frac{g^2}{4}(w^2+u^2+w'^2+u'^2). \end{aligned}$$
(35)

The mass matrices of the charged and neutral gauge bosons are given, respectively, by

$$\begin{aligned} M^2_c= & {} \frac{g^2}{4}\left( \begin{array}{cc}v^2+u^2+u'^2 &{} w u'+uw' \\ wu'+uw' &{} w^2+v^2+w'^2\end{array} \right) ,\end{aligned}$$
(36)
$$\begin{aligned} M^2_0= & {} \frac{g^2}{4}\left( \begin{array}{cccc} v^2+u^2+u'^2 &{} \frac{u^2-v^2+u'^2}{\sqrt{3}} &{} -\frac{2(2v^2+u^2+u'^2) t_X}{3} &{} wu'+uw' \\ \frac{u^2-v^2+u'^2}{\sqrt{3}} &{} \frac{4w^2+v^2+u^2+4w'^2+u'^2}{3} &{} \frac{2[2(w^2+v^2+w'^2)-u^2-u'^2]t_X}{3\sqrt{3}} &{} -\frac{wu'+uw'}{\sqrt{3}}\\ -\frac{2(2v^2+u^2+u'^2)t_X}{3} &{} \frac{2[2(w^2+v^2+w'^2)-u^2-u'^2]t_X}{3\sqrt{3}} &{} \frac{4(w^2+4v^2+u^2+w'^2+u'^2)t_X^2}{9} &{} -\frac{4(wu'+uw')t_X}{3}\\ wu'+uw' &{} -\frac{wu'+uw'}{\sqrt{3}} &{} -\frac{4(wu'+uw')t_X}{3} &{} \frac{4}{g^2}m^2_{A_5} \end{array} \right) , \end{aligned}$$
(37)

where \(t_X=g_X/g\). It is noteworthy that both the mixing of W and Y and the mixing of (\(A_3\), \(A_8\), B) and \(A_4\) are caused by \(u',w'\).

Diagonalizing \(M^2_c\), we obtain two physical fields

$$\begin{aligned} W_1=c_\theta W - s_\theta Y,\quad Y_1 = s_\theta W + c_\theta Y, \end{aligned}$$
(38)

where the W-Y mixing angle (\(\theta \)) is given by

$$\begin{aligned} t_{2\theta } = \frac{2(w u'+uw')}{w^2-u^2+w'^2-u'^2}, \end{aligned}$$
(39)

which implies that \(\theta \simeq u'/w+(u/w)(w'/w)\) is very small, because of \(u'\ll u\) and \(u,w'\ll w\). The \(W_1,Y_1\) masses are

$$\begin{aligned} m^2_{W_1}= & {} \frac{g^2}{8}\Big [w^2+2v^2+u^2+w'^2+u'^2\nonumber \\ {}{} & {} -\sqrt{(w^2-u^2+w'^2-u'^2)^2+4(wu'+uw')^2}\Big ]\nonumber \\\simeq & {} \frac{g^2}{4}\left( v^2+u^2-\frac{2uu'w'}{w}-\frac{u^2w'^2}{w^2}\right) ,\end{aligned}$$
(40)
$$\begin{aligned} m^2_{Y_1}= & {} \frac{g^2}{8}\Big [w^2+2v^2+u^2+w'^2+u'^2\nonumber \\ {}{} & {} +\sqrt{(w^2-u^2+w'^2-u'^2)^2+4(wu'+uw')^2}\Big ]\nonumber \\\simeq & {} \frac{g^2}{4}\left( w^2+v^2+w'^2+u'^2\right) . \end{aligned}$$
(41)

\(W_1\) is the standard model W-like boson, whereas \(Y_1\) is a new, heavy charged gauge boson.

To diagonalize \(M^2_0\), we define the photon A, the usual Z, and the new \(Z'\) as in the previous model, but for \(\beta =-1/\sqrt{3}\), such as

$$\begin{aligned} A= & {} s_W A_3 + c_W \left( -\frac{t_W}{\sqrt{3}} A_8 +\sqrt{1-\frac{t^2_W}{3}}B\right) , \end{aligned}$$
(42)
$$\begin{aligned} Z= & {} c_W A_3 - s_W \left( -\frac{t_W}{\sqrt{3}} A_8 +\sqrt{1-\frac{t^2_W}{3}}B\right) , \end{aligned}$$
(43)
$$\begin{aligned} Z^\prime= & {} \sqrt{1-\frac{t^2_W}{3}} A_8 + \frac{t_W}{\sqrt{3}} B, \end{aligned}$$
(44)

where \(s_W=\sqrt{3} g_X/\sqrt{3g^2+4g_X^2}\). In the new basis (A, Z, \(Z'\), \(A_4\)), the photon A is massless and decoupled, as usual, while Z, \(Z'\), and \(A_4\) mix by themselves via the mass matrix,

$$\begin{aligned} M^{\prime 2}_0 \!=\! \frac{g^2}{4}\!\left( \!\!\begin{array}{ccc} \frac{v^2+u^2+u'^2}{c^2_W} &{} \frac{(u^2+u'^2)c_{2W}-v^2}{c^2_W\sqrt{1+2c_{2W}}} &{} \frac{wu'+uw'}{c_W}\\ \frac{(u^2+u'^2)c_{2W}-v^2}{c^2_W\sqrt{1+2c_{2W}}} &{} \frac{4(w^2+w'^2)c^4_W+v^2+(u^2+u'^2)c^2_{2W}}{c^2_W(1+2c_{2W})} &{} -\frac{wu'+uw'}{c_W\sqrt{1+2c_{2W}}}\\ \frac{wu'+uw'}{c_W} &{} -\frac{wu'+uw'}{c_W\sqrt{1+2c_{2W}}} &{} \frac{4}{g^2}m^2_{A_5} \end{array} \!\!\right) \!. \end{aligned}$$
(45)

It is easily verified that \(M^{\prime 2}_0\) (thus \(M^2_0\)) contains an exact eigenvalue,

$$\begin{aligned} m^2_{{\mathcal {A}}_4} = \frac{g^2}{4}(w^2+u^2+w'^2+u'^2), \end{aligned}$$
(46)

with a corresponding exact eigenstate,

$$\begin{aligned} {\mathcal {A}}_4 = s_{\theta '}Z+c_{\theta '}\left( t_{\theta '} \sqrt{3-4s^2_{W}}Z'+\sqrt{1-t^2_{\theta '}(3-4s^2_{W})}A_4\right) , \end{aligned}$$
(47)

where \(s_{\theta '} = s_{2\theta }/2c_W\) is very small as \(\theta \) is, hence \({\mathcal {A}}_4\simeq A_4\). It is noteworthy that \({\mathcal {A}}_4\) and \(A_5\) always have equal masses.Footnote 2 Hence, we identify

$$\begin{aligned} X^{0,0*}_1 =\frac{{\mathcal {A}}_4\mp iA_5}{\sqrt{2}} \end{aligned}$$
(48)

to be a physical non-Hermitian field, with the common mass,

$$\begin{aligned} m^2_{X_1}=\frac{g^2}{4}(w^2+u^2+w'^2+u'^2). \end{aligned}$$
(49)

To diagonalize \(M^{\prime 2}_0\), we choose two gauge vectors orthogonally to \({\mathcal {A}}_4\), such as

$$\begin{aligned} {\mathcal {Z}}= & {} c_{\theta '}Z-s_{\theta '}\left( t_{\theta '}\sqrt{3-4s^2_{W}} Z'+\sqrt{1-t^2_{\theta '}(3-4s^2_{W})}A_4\right) ,\nonumber \\ \end{aligned}$$
(50)
$$\begin{aligned} {\mathcal {Z}}'= & {} \sqrt{1-t^2_{\theta '}(3-4s^2_{W})}Z'-t_{\theta '}\sqrt{3-4s^2_{W}}A_4, \end{aligned}$$
(51)

where \({\mathcal {Z}}\simeq Z\) and \({\mathcal {Z}}'\simeq Z'\) similar to \({\mathcal {A}}_4\simeq A_4\), since \(|\theta '|\ll 1\). In the new basis (\({\mathcal {Z}}\), \({\mathcal {Z}}'\), \({\mathcal {A}}_4\)), the field \({\mathcal {A}}_4\) is decoupled, while \({\mathcal {Z}}\) and \({\mathcal {Z}}'\) mix by themselves via a symmetric mass matrix with elements, given by

$$\begin{aligned} m^2_{{\mathcal {Z}}}= & {} \frac{g^2[4v^2+(u^2+u'^2)(3+c^2_{2\theta })-(w^2+w'^2)s^2_{2\theta }]}{4(4c^2_W-s_{2\theta }^2)},\end{aligned}$$
(52)
$$\begin{aligned} m^2_{{\mathcal {Z}}{\mathcal {Z}}'}= & {} \frac{g^2\{[(u^2+u'^2)c_{2W}-v^2]c_{2\theta }-(wu'+uw')(1+2c_{2W})s_\theta c_\theta \}}{\sqrt{1+2c_{2W}}(4c^2_W-s_{2\theta }^2)}, \nonumber \\\end{aligned}$$
(53)
$$\begin{aligned} m^2_{{\mathcal {Z}}'}= & {} \frac{g^2[(w^2\!+\!w'^2)(16c^4_W\!-\!s^2_{2\theta })\!+\!4v^2c^2_{2\theta }\!+\!(u^2\!+\!u'^2) (4c^2_{2W}\!-\!s^2_{2\theta })]}{4(1\!+\!2c_{2W})(4c^2_W\!-\!s^2_{2\theta })}.\nonumber \\ \end{aligned}$$
(54)
Fig. 1
figure 1

Viable (vw) regimes bounded by the CDF W-boson mass at 1\(\sigma \), 2\(\sigma \), and \(3\sigma \) ranges (in cyans), according to the 3-3-1 model with \(\beta =1/\sqrt{3}\) (left panel), \(\beta =-1/\sqrt{3}\) (middle panel), and \(\beta =-\sqrt{3}\) (right panel), where the FCNC (lower light-red) and Landau pole (upper light-red, if applied) bounds are also included

Diagonalizing this mass matrix, we obtain physical fields,

$$\begin{aligned} Z_1=c_\varphi {\mathcal {Z}} - s_\varphi {\mathcal {Z}}',\quad Z_2=s_\varphi {\mathcal {Z}} + c_\varphi {\mathcal {Z}}', \end{aligned}$$
(55)

where the \({\mathcal {Z}}\)-\({\mathcal {Z}}'\) mixing angle (\(\varphi \)) and \(Z_1,Z_2\) masses are

$$\begin{aligned} t_{2\varphi }= & {} \frac{2m^2_{{\mathcal {Z}}{\mathcal {Z}}'}}{m^2_{{\mathcal {Z}}'}-m^2_{{\mathcal {Z}}}}\simeq \frac{\sqrt{3-4s^2_W}(u^2c_{2W}-v^2)}{2w^2c^4_W}, \end{aligned}$$
(56)
$$\begin{aligned} m^2_ {Z_1}= & {} \frac{1}{2} \left[ m^2_{{\mathcal {Z}}} +m^2_{{\mathcal {Z}}'} - \sqrt{(m^2_{{\mathcal {Z}}} - m^2_{{\mathcal {Z}}'})^2 + 4m^4_ {{\mathcal {Z}}{\mathcal {Z}}'} }\right] \simeq m^2_{\mathcal {Z}}-\frac{m^4_{{\mathcal {Z}}{\mathcal {Z}}'}}{m^2_{{\mathcal {Z}}'}}\nonumber \\\simeq & {} \frac{g^2}{4c^2_W}\left[ v^2+u^2-\frac{2uu'w'}{w}-\frac{u^2w'^2}{w^2}-\frac{(u^2c_{2W}-v^2)^2}{4c^4_W w^2}\right] , \end{aligned}$$
(57)
$$\begin{aligned} m^2_ {Z_2}= & {} \frac{1}{2} \left[ m^2_{{\mathcal {Z}}} +m^2_{{\mathcal {Z}}'} + \sqrt{(m^2_{{\mathcal {Z}}} - m^2_{{\mathcal {Z}}'})^2 + 4m^4_ {{\mathcal {Z}}{\mathcal {Z}}'} }\right] \simeq m^2_{{\mathcal {Z}}'}. \end{aligned}$$
(58)

The \(\varphi \) angle is small, suppressed by \((u,v)^2/w^2\). Additionally, the field \(Z_1\) is the standard model Z-like boson, while \(Z_2\) is a new, heavy gauge boson with mass at w.

3 Sources of the W-mass shift

The W-mass shift in the 3-3-1 model arises from various new physics sources, Z-\(Z'\) mixing, X-Y and Z-\(Z'\)-X mixings, a gauge vector doublet as well as a new Higgs doublet that are not degenerate in mass. They are newly recognized, depending on relevant 3-3-1 model.

Let us remind the reader that as already done in the literature of electroweak precision fit [6, 10, 11, 13, 14, 18, 20, 22, 31], a positive and dominant contribution of the Peskin-Takeuchi T-parameter can generate an enhancement of the W-boson mass consistent with the recent CDF measurement.

3.1 Z-\(Z'\) mixing in the model with \(q\ne 0,-1\) or \(B-L\) conservation

The 3-3-1 model under consideration reveals a tree-level mixing of Z with \(Z'\), while W is retained. Because of the Z-\(Z'\) mixing, the observed \(Z_1\)-boson mass (\(m_{Z_1}\)) is reduced in comparison with the standard model Z-boson mass (\(m_Z\)). This gives rise to a positive contribution to the T parameter at tree level,

$$\begin{aligned} \alpha T= & {} \rho -1=\frac{m^2_W}{c^2_W m^2_{Z_1}}-1=\frac{m^2_Z}{m^2_{Z_1}}-1\nonumber \\\simeq & {} \frac{\left[ (\sqrt{3}\beta t^2_W-1)v^2+(\sqrt{3}\beta t^2_W+1)u^2\right] ^2}{4w^2(v^2+u^2)}, \end{aligned}$$
(59)

where note that \(m_W=m_Z c_W\) [128]. Since \(m_{Z_1}\) is precisely measured and fixed, this enhances the mass of W boson proportionally to \(\alpha T\), such as [3,4,5]

$$\begin{aligned} \Delta m^2_W= & {} \frac{c^4_W m^2_Z}{c^2_W-s^2_W}\left( \frac{m^2_Z}{m_{Z_1}^2}-1\right) \nonumber \\\simeq & {} \frac{c^4_W m^2_Z}{c^2_W-s^2_W}\frac{\left[ (\sqrt{3}\beta t^2_W-1)v^2+(\sqrt{3}\beta t^2_W+1)u^2\right] ^2}{4w^2(v^2+u^2)}.\nonumber \\ \end{aligned}$$
(60)

We input the parameters as \(\alpha \simeq 1/127.955\), \(s_W^2\simeq 0.231\), and \(u^2+v^2=246^2\) GeV\(^2\). Additionally, we consider the three 3-3-1 models according to \(\beta =1/\sqrt{3}\), \(\beta =-1/\sqrt{3}\), and \(\beta =-\sqrt{3}\), where the last two are identical to the 3-3-1 model with neutral (heavy) fermions and the minimal 3-3-1 model, respectively. We make a contour of \(\Delta m^2_{W} = 80.4335^2 - 80.357^2\) GeV\(^2\) taking central values [1, 2] as function of v and w for the mentioned models as in Fig. 1. Here, the black line is for the central value. Additionally, the \(1\sigma , \, 2\sigma \), and \(3\sigma \) ranges are also shown (in cyans). Besides, the excluded region (light red) by the FCNCs and the Landau pole limit if it applies (taken as 5 TeV) have appropriately been included to plot (cf. Sect. 5).

From Fig. 1, we obtain the viable ranges for the new physics and weak scales, namely, (3.9 TeV \(<w<4.4\) TeV and 0 GeV \(<v<65.7\) GeV), (3.9 TeV \(<w<4.4\) TeV and 237.1 GeV \(<v<246\) GeV), and (3.9 TeV \(<w<5\) TeV and 195.2 GeV \(<v<219.1\) GeV) according to the 3-3-1 model with \(\beta =1/\sqrt{3}\), \(\beta =-1/\sqrt{3}\), and \(\beta =-\sqrt{3}\), respectively. Note that the weak scale u depends on v as \(u=\sqrt{(246~\text {GeV})^2-v^2}\).

3.2 W-Y and Z-\(Z'\)-X mixings in the model with \(q=0\) and \(B-L\) violation

It is clear that when \(q=0\), thus \(\beta =-1/\sqrt{3}\), the scalar triplets, \(\eta \) and \(\chi \), in the 3-3-1 model with right-handed neutrinos have two electrically-neutral entries at the top and bottom and they can all develop VEVs. However, the VEVs \(u'\),\(w'\) violate lepton number and should be small, i.e. \(u'\ll u\) and \(w'\ll w\), as given. These \(u',w'\) cause the mixings between the usual W boson and new charged gauge boson Y as well as among \(Z,Z'\) and new neutral gauge boson X. Whereas, the normal VEVs uvw induce Z\(Z'\) mixing similarly to the previous model.

Because of the mixings, both physical masses of W and Z bosons, i.e. \(W_1\) and \(Z_1\) respectively, are reduced in comparison with the standard model values, such as

$$\begin{aligned} m^2_W= & {} \frac{g^2}{4}(v^2+u^2+u'^2)\rightarrow m^2_{W_1}\nonumber \\\simeq & {} \frac{g^2}{4}\left( v^2+u^2-\frac{2uu'w'}{w}-\frac{u^2w'^2}{w^2}\right) , \end{aligned}$$
(61)
$$\begin{aligned} m^2_Z= & {} \frac{g^2}{4c^2_W}(v^2+u^2+u'^2)\rightarrow m^2_{Z_1}\nonumber \\\simeq & {} \frac{g^2}{4c^2_W}\left[ v^2+u^2-\frac{2uu'w'}{w}-\frac{u^2w'^2}{w^2}-\frac{(u^2c_{2W}-v^2)^2}{4c^4_W w^2}\right] ,\nonumber \\ \end{aligned}$$
(62)

a phenomenon occurs similarly to the economical 3-3-1 model [120]. New observation is that the reduction of Z mass is bigger than that of W mass. Correspondingly, this produces a \(\rho \)-parameter bigger than 1, causing the CDF W-mass shift, as measured.

That said, from (61) and (62) we obtain a positive contribution to the T parameter at tree level, such as

$$\begin{aligned} \alpha T=\rho -1=\frac{m^2_{W_1}}{c^2_W m^2_{Z_1}}-1\simeq \frac{(u^2c_{2W}-v^2)^2}{4c^4_W(u^2+v^2)w^2}. \end{aligned}$$
(63)

It is noteworthy that the three terms that do change the WZ masses in (61) and (62) are at the same order, because of \(u'/u\sim w'/w\sim (u,v)/w\). Additionally, the first two of these terms caused by \(u',w'\) come from WY and \((Z,Z')\)X mixings, while the last term suppressed by \((u,v)^2/w^2\) arises from the Z\(Z'\) mixing. Although both kinds of the mixings reduce the WZ masses, only the Z\(Z'\) mixing governs the \(\rho \)-parameter deviation which subsequently affects the W mass as shifted, similar to the previous model.

That said, the enhancement of the W-boson mass is simply given by [3,4,5],

$$\begin{aligned} \Delta m^2_W= & {} \frac{c^4_W m^2_Z}{c^2_W-s^2_W}\left( \frac{m^2_{W_1}}{c^2_W m^2_{Z_1}}-1\right) \simeq \frac{c^4_W m^2_Z}{c^2_W-s^2_W}\frac{(u^2c_{2W}-v^2)^2}{4c^4_W(u^2+v^2)w^2}.\nonumber \\ \end{aligned}$$
(64)

The result in the previous sector for \(\beta =-1/\sqrt{3}\) applies to this model without change, however. Although the phenomenologies of the two mentioned models are distinct, characterized by \(B-L\) conservation or violation, the Z\(Z'\) mixing is crucial to set the CDF W-mass anomaly. The mixing effects caused by \(u',w'\) effectively not contributing to the W-mass shift as observed are probably due to the fact that \(u',w'\) break lepton number, associated with Majoron fields that are eaten by the corresponding non-Hermitian gauge bosons (XY). Hence, \(u',w'\) only affect the XY observables.

3.3 Oblique contributions of the non-degenerate vector doublet

At one-loop level, the two new non-Hermitian gauge bosons predicted by the 3-3-1 model \(X^{\pm q}\) and \(Y^{\pm (1+q)}\), which form an \(SU(2)_L\) doublet according to the decomposition of \(SU(3)_L\) gauge adjoint \(8=3\oplus 2\oplus 2^*\oplus 1\), contribute to the oblique parameters S, T, and U, through transverse self-energies. The W-boson mass shift induced by these oblique corrections can be expressed as follows [3,4,5]

$$\begin{aligned} \Delta m^2_{W} = \frac{c_W^2 m_Z^2\alpha }{c_W^2-s_W^2}\left( -\frac{S}{2}+c_W^2 T+\frac{c_W^2-s_W^2}{4s_W^2}U\right) . \end{aligned}$$
(65)
Fig. 2
figure 2

S, T,  and U parameters plotted as functions of v, where the upper panels (the lower panels) correspond to the 3-3-1 model with \(\beta =-1/\sqrt{3}\) (\(\beta =-\sqrt{3}\)), and w is free to float in the range 3.9–10 TeV (3.9–5 TeV), as appropriate

Fig. 3
figure 3

Contribution of the non-degenerate gauge vector doublet to the W mass shift

For the 3-3-1 model with right-handed neutrinos, i.e. \(\beta =-1/\sqrt{3}\), the oblique corrections have already been computed, given by [129],

$$\begin{aligned} S= & {} \frac{1}{4\pi }\left[ 5\ln \frac{m^2_Y}{m^2_X}+\left( 5+ \frac{4m^2_Y}{3m^2_Z}\right) \bar{F}_0(m^2_Z,m_Y,m_Y)\right. \nonumber \\{} & {} \left. -\left( 1+\frac{4m^2_X}{3m^2_Z}\right) \bar{F}_0(m^2_Z,m_X,m_X) \right] , \end{aligned}$$
(66)
$$\begin{aligned} T= & {} \frac{3\sqrt{2}G_F}{16\pi ^2\alpha }\left( m^2_Y+m^2_X-\frac{2m^2_Y m^2_X}{m^2_Y-m^2_X}\ln \frac{m^2_Y}{m^2_X}\right) \nonumber \\{} & {} +\frac{1}{4\pi s^2_W}\left( \frac{m^2_Y+m^2_X}{m^2_Y-m^2_X}\ln \frac{m^2_Y}{m^2_X}-2+ t^2_W\ln \frac{m^2_Y}{m^2_X}\right) , \end{aligned}$$
(67)
$$\begin{aligned} U= & {} \frac{1}{\pi }\left[ \left( \frac{3}{4} +\frac{m^2_Y}{m^2_Z}+2s^2_W\right) \bar{F}_0(m^2_Z,m_Y,m_Y)\right. \nonumber \\{} & {} \left. +\left( \frac{3}{4} +\frac{m^2_X}{m^2_Z}\right) \bar{F}_0(m^2_Z,m_X,m_X)\right. \nonumber \\{} & {} - \left( \frac{3}{2}+\frac{m^2_Y+m^2_X}{m^2_W}-\frac{(m^2_Y-m^2_X)^2}{2m^4_W}\right) \bar{F}_0(m^2_W,m_Y,m_X)\nonumber \\{} & {} -\left( \frac{m^2_Y+m^2_X}{2m^2_W}+\frac{5(m^2_Y-m^2_X)^2}{2m^4_W}+2\right) \bar{F}_0(0,m_Y,m_X)\nonumber \\{} & {} -\frac{5(m^2_Y+m^2_X)}{4m^2_W}\nonumber \\{} & {} -\left. \frac{2(m^2_Y-m^2_X)^2}{m^4_W}-\left( \frac{m^2_Y-m^2_X}{2m^2_W}-\frac{m^4_Y-m^4_X}{m^4_W}\right. \right. \nonumber \\{} & {} \left. \left. -\frac{5(m^4_Y+m^4_X)}{4m^2_W(m^2_Y-m^2_X)}\right) \ln \frac{m^2_Y}{m^2_X} \right] , \end{aligned}$$
(68)

while for the minimal 3-3-1 model, i.e. \(\beta =-\sqrt{3}\), they are [130, 131]

$$\begin{aligned} S= & {} \frac{3}{\pi }\left[ \frac{7}{4} \ln \frac{m^2_Y}{m^2_X}+\left( \frac{3}{4}+\frac{2(1+2s^2_W)}{3}-\frac{m^2_Y}{m^2_Z}\right) \bar{F}_0(m^2_Z,m_Y,m_Y)\right. \nonumber \\{} & {} -\left. \left( \frac{3}{4}+\frac{1-4s^2_W}{3}-\frac{m^2_X}{m^2_Z}\right) \bar{F}_0(m^2_Z,m_X,m_X) \right] ,\end{aligned}$$
(69)
$$\begin{aligned} T= & {} \frac{3\sqrt{2}G_F}{16\pi ^2\alpha }\left( m^2_Y+m^2_X-\frac{2m^2_Y m^2_X}{m^2_Y-m^2_X}\ln \frac{m^2_Y}{m^2_X}\right) \nonumber \\{} & {} +\frac{1}{4\pi s^2_W}\left( \frac{m^2_Y+m^2_X}{m^2_Y-m^2_X}\ln \frac{m^2_Y}{m^2_X}-2+3 t^2_W\ln \frac{m^2_Y}{m^2_X}\right) ,\end{aligned}$$
(70)
$$\begin{aligned} U= & {} \frac{1}{\pi }\left[ \left( \frac{3}{4} -\frac{2m^2_Y}{m^2_Z}+4s^2_W\right) \bar{F}_0(m^2_Z,m_Y,m_Y)\right. \nonumber \\{} & {} \left. +\left( \frac{3}{4} -\frac{2m^2_X}{m^2_Z}-2s^2_W\right) \bar{F}_0(m^2_Z,m_X,m_X)\right. \nonumber \\{} & {} - \left( \frac{3}{2}-\frac{2(m^2_Y+m^2_X)}{m^2_W}-\frac{(m^2_Y-m^2_X)^2}{2m^4_W}\right) \bar{F}_0(m^2_W,m_Y,m_X)\nonumber \\{} & {} +\left( \frac{3(m^2_Y+m^2_X)}{2m^2_W}+\frac{3(m^2_Y-m^2_X)^2}{2m^4_W}-2\right) \bar{F}_0(0,m_Y,m_X)\nonumber \\{} & {} +\frac{3(m^2_Y+m^2_X)}{4m^2_W}\nonumber \\{} & {} -\left. \frac{(m^2_Y-m^2_X)^2}{m^4_W}-\left( \frac{m^2_Y-m^2_X}{4m^2_W}-\frac{m^4_Y-m^4_X}{2m^4_W}\right. \right. \nonumber \\{} & {} \left. \left. +\frac{3m^2_Ym^2_X}{2m^2_W(m^2_Y-m^2_X)}\right) \ln \frac{m^2_Y}{m^2_X}\right] . \end{aligned}$$
(71)

Above, the function \(F_0\) is defined as in Ref. [129].

In Fig. 2, we plot the \(S,\, T,\, U\) parameters as functions of v (where u is followed by \(u=\sqrt{(246\text { GeV})^2-v^2}\)), where three upper panels correspond to the 3-3-1 model with \(\beta =-1/\sqrt{3}\) for w in the range of \(3.9-10\) TeV, while three lower panels correspond to the 3-3-1 model with \(\beta =-\sqrt{3}\) for w in the range of \(3.9{-}5\) TeV since this model is limited by the Landau pole (cf. Sect. 5). It is clear that the values of \(S,\, T, \, U\) can be negative or positive (except for the rightmost upper panel), depending on the value of v as well as the sign and magnitude of X and Y mass splitting. Additionally, the magnitudes of \(S,\, T, \, U\) parameters are very small compared with the value of the T parameter \((T\sim 0.18)\) in the case of tree-level Z-\(Z'\) and Z-\(Z'\)-X mixings. Hence, the oblique contributions due to the XY gauge vector doublet are not significant, i.e. not enough to accommodate the CDF W-boson mass anomaly if the tree-level sources of mixing are not included.

Indeed, substituting (66)–(68) [(69)–(71)] into (65) and taking \(v=0\)–246 GeV and \(w=3.9\)–10 TeV (\(w=3.9\)–5 TeV) for the 3-3-1 model with \(\beta =-1/\sqrt{3}\) (\(\beta =-\sqrt{3}\)), we obtain the enhancement of the W-boson mass caused by the oblique corrections to be too small, namely, \(\Delta m_W\le 0.00042\) GeV (\(\Delta m_W\le 0.00192\) GeV), as explicitly shown in Fig. 3, incompatible with the experimental measurement.

3.4 Oblique contributions of a non-degenerate scalar doublet

Under the standard model symmetry, the three scalar triplets as given correspondingly contain three \(SU(2)_L\) scalar doublets, since each triplet is decomposed as \(3=2\oplus 1\). One of the doublets is eaten by WZ leaving only the usual physical Higgs field, other one of the doublets is completely eaten by the XY gauge vector doublet. The remaining scalar doublet is really a new physical Higgs doublet, which potentially contributes to the W-boson mass via STU parameters, partly noted in [132]. Such a physical Higgs doublet also exits in the 3-3-1 models for dark matter as the first and second entries of inert scalar triplets [104, 105], or in 3-3-1 models with flavor symmetries as contained in scalar flavon triplets [77,78,79]. Even if one includes a scalar sextet [56, 76,77,78,79], a new Higgs doublet correspondingly arises, since \(6=3\oplus 2\oplus 1\) under \(SU(2)_L\). In this case, the \(SU(2)_L\) scalar triplet also contributes to STU parameters, but this contribution is neglected because of its significant tree-level contribution as discussed below.

That said, a new physical Higgs doublet in addition to the usual Higgs doublet is popularly presented in the 3-3-1 model. Without loss of generality, we consider a generic scalar triplet,

$$\begin{aligned} \phi = (\phi _1,\, \phi _2,\, \phi _3)^T\sim (1,3,X_\phi ), \end{aligned}$$
(72)

which contains the new \(SU(2)_L\) Higgs doublet, i.e. \((\phi _1,\phi _2)\), as desirable. Here \(\phi \) can have an arbitrary X-charge, making a significant contribution to the oblique parameters independent of electric charge. It is easily shown that the contributions of \(\phi \) to SU parameters are more smaller than that to T, if the scalar fields are radically heavier than WZ masses. Indeed, in the 3-3-1 model, both \(\phi _{1,2}\) are typically heavy at TeV scale, while the mass-squared splitting of \(\phi _{1,2}\) is only proportional to the weak scale [104, 105]. In this case, the W-boson mass shift is governed by the T parameter, such as

$$\begin{aligned} \Delta m^2_W = \frac{c_W^4 m_Z^2}{c_W^2-s_W^2} \frac{G_F}{8\sqrt{2}\pi ^2}\left( m^2_1+m^2_2-\frac{2m^2_1 m^2_2}{m^2_1-m^2_2}\ln \frac{m^2_1}{m^2_2}\right) , \end{aligned}$$
(73)

where \(m_{1,2}\) are the masses of \(\phi _{1,2}\), respectively.

Taking the central values of W mass from the CDF experiment [1] and the standard model prediction [2], respectively, in Fig. 4 we contour \(\Delta m^2_W\) as function of \(\delta m=m_2-m_1\) and \(m_1\) by the black line. We obtain the viable value, \(\delta m \simeq 98\) GeV, as appropriate. Further, the contributions to \(\Delta m^2_W\) depend only on the mass splitting \(\delta m\), nearly insensitive to \(m_1\). For clarity, the \(1\sigma \), \(2\sigma \), and \(3\sigma \) ranges of the measured W mass are also shown (in cyans).

Fig. 4
figure 4

CDF-allowed regions contoured in \(\delta m\)\(m_1\) plane

Last, but not least, it is clear that \(m_1\sim \lambda _{\phi -H} v^2/\delta m\) where \(\lambda _{\phi -H}\) relates to the coupling of \(\phi \) to the usual Higgs field. Hence, \(\phi \) obtains a mass at TeV, i.e. \(m_1\sim \) TeV, only if \(\lambda _{\phi -H}\) is at the perturbative limit [52].

4 Tree-level contribution of a sextet

A scalar sextet of type,

$$\begin{aligned} S=\begin{pmatrix} S^0_{11} &{} \frac{S^-_{12}}{\sqrt{2}} &{} \frac{S^q_{13}}{\sqrt{2}}\\ \frac{S^-_{12}}{\sqrt{2}} &{} S^{--}_{22} &{} \frac{S^{q-1}_{23}}{\sqrt{2}}\\ \frac{S^q_{13}}{\sqrt{2}} &{} \frac{S^{q-1}_{23}}{\sqrt{2}} &{} S^{2q}_{33} \end{pmatrix}, \end{aligned}$$
(74)

often studied in the 3-3-1 models [56, 57, 72, 76, 133], can develop a vacuum value such as

$$\begin{aligned} \langle S\rangle =\frac{1}{\sqrt{2}}\begin{pmatrix} \kappa &{} 0 &{} 0\\ 0 &{} 0&{} 0\\ 0 &{} 0 &{} 0 \end{pmatrix}. \end{aligned}$$
(75)

Other fields if electrically neutral can also have a VEV. But they belong to a \(SU(2)_L\) doublet or singlet, giving a contribution similar to the above cases of scalar triplets, and are not interested. Here, the nontrivial vacuum structure is associated with the \(SU(2)_L\) scalar triplet, \((S_{11}, S_{12}, S_{22})\), contained in the sextet, unlike those in the cases of scalar triplets. Unfortunately, this sextet gives a negative contribution to the \(\rho \)-parameter,

$$\begin{aligned} \rho \simeq 1-\frac{2\kappa ^2}{u^2+v^2}, \end{aligned}$$
(76)

as shown in [76], incompatible with the experiment. This case is similar to a scalar triplet in the type II seesaw mechanism added to the standard model.

However, if we consider an alternative scalar sextet of type,

$$\begin{aligned} \sigma =\begin{pmatrix} \sigma ^+_{11} &{} \frac{\sigma ^0_{12}}{\sqrt{2}} &{} \frac{\sigma ^{q+1}_{13}}{\sqrt{2}}\\ \frac{\sigma ^0_{12}}{\sqrt{2}} &{} \sigma ^{-}_{22} &{} \frac{\sigma ^{q}_{23}}{\sqrt{2}}\\ \frac{\sigma ^{q+1}_{13}}{\sqrt{2}} &{} \frac{\sigma ^{q}_{23}}{\sqrt{2}} &{} \sigma ^{2q+1}_{33} \end{pmatrix}, \end{aligned}$$
(77)

studied in the 3-3-1 model for dark matter [105], the \(SU(2)_L\) scalar triplet \((\sigma _{11},\sigma _{12},\sigma _{22})\) contained in the sextet develops a VEV in different way, such as

$$\begin{aligned} \langle \sigma \rangle =\frac{1}{2}\begin{pmatrix} 0 &{} \kappa ' &{} 0\\ \kappa ' &{} 0&{} 0\\ 0 &{} 0 &{} 0 \end{pmatrix}. \end{aligned}$$
(78)

It gives a positive contribution to the \(\rho \)-parameter, such as

$$\begin{aligned} \rho \simeq 1+\frac{4\kappa '^2}{u^2+v^2}. \end{aligned}$$
(79)

Comparing to the W-mass shift, \(\kappa '\) is about 4.5 GeV, similarly in size to \(u'\) bounded above. This case is similar to a scalar triplet with \(Y=0\) added to the standard model [134].

It is stressed that the mentioned scalar sextets contribute to the gauge boson mass spectra differently from the scalar triplets in previous sections. However, the results obtained according to the scalar triplets are easily generalized for the sextets with the aid of [126].

5 Existing bounds

We now present the running coupling and Landau pole limit which place a constraint on \(\beta ,q\) parameters. We also discuss FCNCs which reveal important information on \(B-L\)-violating VEVs and new physics scale. Last, but not least, we examine collider bounds, validating the previous constraints, as well as we summarize the relevant bounds used for model classification and updated with W-mass measurement.

5.1 Running coupling and Landau pole

A gauge coupling commonly denoted as \(g=\{g_s,g,g_X\}\) changes with renormalization scale \(\mu \) through the RG equation,

$$\begin{aligned} \frac{\partial g}{\partial \ln \mu }=\beta (g)=-\frac{g^3}{16\pi ^2}b_g, \end{aligned}$$
(80)

where the 1-loop beta function is given by

$$\begin{aligned} b_g=\frac{11}{3} C_V-\frac{2}{3} \sum _L C_L-\frac{2}{3} \sum _R C_R -\frac{1}{3} \sum _S C_S, \end{aligned}$$
(81)

where VL/RS indicate vector, left/right fermion, and scalar field representations under the relevant gauge group, respectively. For SU(3), \(C_V=3\) and \(C_L=C_R=C_S=1/2\) for (anti)triplets, while for \(U(1)_{X}\), \(C_V=0\) and \(C_{L,R,S}=X^2_{L,R,S}\) for LRS fields, respectively. We obtain \(b_{g_s}=5>0\) and \(b_g=13/2>0\). Thus, \(g_s,g\) decrease when \(\mu \) increases. There is no Landau pole associated with \(g_s,g\). However, because of \(C_V=0\) and \(C_{L,R,S}=X^2_{L,R,S}>0\), we always have \(b_{g_X}<0\). Hence, \(g_X\) increases when \(\mu \) increases.

In contrast to the standard model and grand unification, a finite Landau pole associated with \(g_X\) potentially arises because of \(U(1)_X\) along with \(SU(3)_L\) embedded in \(U(1)_Q\). Indeed, as given before, the electric charge operator defines the photon field that couples to it and that the normalization of the photon field implies

$$\begin{aligned} s^2_W=\frac{g^2_X}{g^2+g^2_X(1+\beta ^2)}<\frac{1}{1+\beta ^2}, \end{aligned}$$
(82)

where note that g is always finitely nonzero. When the energy scale increases, \(g_X/g\) increases as long as \(s^2_W\) approaches the r.h.s of the inequality. The model encounters a Landau pole \(\mu \) at which \(s^2_W(\mu )=\frac{1}{1+\beta ^2}\), or equivalently \(g_X(\mu )= \infty \), where the scale of \(\mu \) depends heavily on \(\beta \) value. It is stressed that the model is valid only if the Landau pole \(\mu \) is larger than the new physics scale, i.e. \(\mu > w\), thus than the weak scale uv. Correspondingly, we have \(s^2_W(\mu )=\frac{1}{1+\beta ^2}>s^2_W(u,v)\). This yields \(|\beta |<\cot _W(u,v)\simeq 1.824\) for \(s^2_W(u,v)\simeq 0.231\), which translates to \(-2.08< q < 1.08\), due to \(\beta =-(1+2q)/\sqrt{3}\).

The electric charge of E is very constrained, taking \(q=-2,-1,0,1\) for integer charges. For \(q=-2,1\), we have \(|\beta |=\sqrt{3}\) and \(s^2_W(\mu )=1/4\) just above that at the weak scale. In this case, the model presents a low Landau pole at \(\mu =4\)–5 TeV, as shown in [135, 136] for the minimal 3-3-1 model. However, if \(q=-1,0\), which include the 3-3-1 model with heavy charged leptons and the 3-3-1 model with right-handed neutrinos, we have \(|\beta |=1/\sqrt{3}\) and \(s^2_W(\mu )=3/4\) that is much beyond the typical value \(s^2_W=3/8\) at the grand unification scale, close to the Planck regime. In this case, the model still possesses a Landau pole, but this pole is beyond the Planck scale.

5.2 FCNCs

There are two sources for FCNCs in the 3-3-1 model that come from the nonuniversality of quark families under the gauge group and the possible mixing of ordinary quarks and exotic quarks, respectively. The former occurs for every q associated with \(Z'\) current, while the latter arises only if \(q=0\) or \(-1\) concerning Z current. Particularly for \(q=0\), it happens with the 3-3-1 model with right-handed neutrinos but not with the 3-3-1 model with neutral (heavy) fermions. These FCNCs are actually induced at tree-level and indeed dangerous. The model with \(q=-1\) happening similarly to the case \(q=0\), as well as the FCNCs that are possibly arisen/coupled to the Higgs and new Higgs fields, will not be discussed.

In the 3-3-1 model with \(q=0\), the ordinary and exotic quarks of up-type \((u_a,J_3)\) and of down-type \((d_a,J_\alpha )\) each mix by themselves due to the lepton-number violating VEVs of \(\eta _3,\chi _1\) as well as the lepton-number violating Yukawa couplings (called s’s) that like ordinary Yukawa couplings (called h’s) but appropriately interchange the right-handed components of ordinary and exotic quarks, as supplied above [76]. We define mixing matrices, \((u_1\ u_2\ u_3\ J_3)^T_{L,R}=V_{uL,R}(u\ c\ t\ T)^T_{L,R}\) and \((d_1\ d_2\ d_3\ J_1\ J_2)^T_{L,R}=V_{dL,R}(d\ s\ b B_1\ B_2)^T_{L,R}\), such that the \(4\times 4\) mass matrix of \((u_a,J_3)\) and the \(5 \times 5\) mass matrix of \((d_a,J_\alpha )\) are diagonalized. Because the ordinary and exotic quarks have different \(T_3\) weak isospin, the tree-level FCNCs of Z boson arise, such as

$$\begin{aligned} {\mathcal {L}}\supset (\pm )\frac{g}{2c_W}\bar{q}_{iL}\gamma ^\mu q_{jL} (V^*_{qL})_{Ii}(V_{qL})_{Ij}Z_\mu , \end{aligned}$$
(83)

i.e. the standard model CKM mechanism does not work [108]. Here, we denote q as either u for up-type or d for down-type quarks, which should not be confused with the q charge parameter used throughout, \(i,j=1,2,3\) label ordinary physical quarks, and \(I=J_3\) and plus sign applies for \(V_{u}\), while \(I=J_\alpha \) and minus sign applies for \(V_d\). Integrating Z out as well as using \(m^2_Z\simeq (g^2/4c^2_W)(u^2+v^2)\), we obtain effective interactions,

(84)

where \(\Delta C_{\bar{d}s}\equiv [(V^*_{dL})_{I1}(V_{dL})_{I2}]^2/(u^2+v^2)\), and “\(\cdots \)” indicates its conjugate as well as other four-quark systems. The strongest bound comes from the \(K^0\)\(\bar{K}^0\) mixing that requires the relevant coupling, \(\Delta C_{\bar{d}s}\), to be smaller than \((10^4~{\textrm{TeV}})^{-2}\) [2], implying

$$\begin{aligned} |(V^*_{dL})_{I1}(V_{dL})_{I2}|\lesssim 10^{-5}. \end{aligned}$$
(85)

The mixing of ordinary and exotic quarks is much smaller than the smallest mixing element of CKM matrix (around \(5\times 10^{-3}\)), which may be understood in the 3-3-1-1 model [107]. For the present model, we safely assume \((V_{qL})_{Ii}\sim u'/ u \sim w' / w \sim s/h\sim 10^{-2}\)\(10^{-3}\).Footnote 3

It is noted that the model with \(q\ne 0,-1\) has only FCNCs associated with nonuniversal \(Z'\) couplings, because the ordinary and exotic quarks do not mix. The following computation would apply for all cases of q-charge, since for \(q=0\) (or \(-1\)) the ordinary and exotic quark mixing negligibly contributes to this kind of the FCNCs, as shown in [108]. Because the third family of quarks transforms under \(SU(3)_L\otimes U(1)_X\) differently from the first two, there must be tree-level FCNCs. Indeed, using \(X=Q-T_3-\beta T_8\), the interaction of the neutral current is \({\mathcal {L}}\supset -g\bar{F}\gamma ^\mu [T_3 A_{3\mu } + T_8 A_{8\mu }+t_X(Q-T_3-\beta T_8)B_\mu ]F\), where F runs over fermion multiplets. It is clear that the terms of \(T_3\) and Q, as well as all terms of \(\nu _a\), \(e_a\), \(E_a\), \(J_\alpha \), and \(J_3\), do not flavor change. The relevant part includes only \(T_8\) with ordinary quarks, such as \({\mathcal {L}}\supset -g\bar{q}_L\gamma ^\mu T_{8q}q_L (A_{8\mu }-\beta t_X B_\mu ) =-g\bar{q}_L\gamma ^\mu T_{8q} q_L Z'_\mu /\sqrt{1-\beta ^2 t^2_W}\), where q denotes either up-type or down-type quarks and \(T_{8q}=\frac{1}{2\sqrt{3}}{\textrm{diag}}(-1,-1,1)\) combines their \(T_8\) charge. Changing to the mass basis, \(q_{aL,R}=(V_{qL,R})_{ai}q_{iL,R}\), we have

$$\begin{aligned} {\mathcal {L}} \supset -\frac{g}{\sqrt{3}\sqrt{1-\beta ^2 t^2_W}}\bar{q}_{iL}\gamma ^\mu q_{jL} (V^*_{qL})_{3i}(V_{qL})_{3j}Z'_\mu , \end{aligned}$$
(86)

which implies FCNCs for \(i\ne j\). Integrating \(Z'\) out and using \(m^2_{Z'}\simeq g^2 w^2/3(1-\beta ^2 t^2_W)\), we obtain effective interactions,

(87)

where \(\Delta C'_{\bar{s}b}\equiv [(V^*_{dL})_{32}(V_{dL})_{33}]^2/w^2\), and “\(\cdots \)” stands for its conjugate and other four-quark systems. The strongest bound comes from the \(B^0_s-\bar{B}^0_s\) system, given by [2]

$$\begin{aligned}{}[(V^*_{dL})_{32}(V_{dL})_{33}]^2\frac{1}{w^2}<\frac{1}{(100\ {\textrm{TeV}})^2}. \end{aligned}$$
(88)

Assuming \(V_{uL}=1\), the CKM factor is \(|(V^*_{dL})_{32}(V_{dL})_{33}|\simeq 3.9\times 10^{-2}\), which translates to \(w>3.9\) TeV. This bound is independent of \(\beta \), i.e. applying for every 3-3-1 model. Last, notice that the \(K^0\)\(\bar{K}^0\) mixing gives a bound, \(w>3.6\) TeV, slightly smaller than the given one, which has not been signified.

Table 2 A summary of existing bounds and their updates with W-mass result

5.3 Collider searches

The LEPII studied the process \(e^+ e^-\rightarrow f\bar{f}\), where f is an ordinary fermion, through exchange of a new heavy gauge boson \(Z'\), described by effective interactions,

$$\begin{aligned} {\mathcal {L}}_{\textrm{eff}}&\supset \frac{g^2}{c^2_Wm^2_{Z'}}\left[ \bar{e}\gamma ^\mu (a^{Z'}_L(e) P_L+a^{Z'}_R(e)P_R)e\right] \nonumber \\&\times \left[ \bar{f}\gamma _\mu (a^{Z'}_L(f)P_L+a^{Z'}_R(f)P_R)f\right] , \end{aligned}$$
(89)

where \(a^{Z'}_{L,R}(f)=\frac{1}{2} [g^{Z'}_V(f)\pm g^{Z'}_A(f)]\), as usual. Considering \(f=\mu ,\tau \), the charged leptons have equal couplings, and we rewrite

$$\begin{aligned} {\mathcal {L}}_{\textrm{eff}}&\supset \frac{g^2[a^{Z'}_L(e)]^2}{c^2_W m^2_{Z'}}(\bar{e}\gamma ^\mu P_L e)(\bar{f}\gamma _\mu P_Lf)\nonumber \\&+(LR)+(RL)+(RR), \end{aligned}$$
(90)

where the successive terms differ from the first one only in chiral structure, and

$$\begin{aligned} a^{Z'}_L (e) = \frac{c_W+\sqrt{3}\beta s_W t_W}{2\sqrt{3}\sqrt{1-\beta ^2 t^2_W}},\quad a^{Z'}_R(e)=\frac{\beta s_W t_W}{\sqrt{1-\beta ^2 t^2_W}}. \end{aligned}$$
(91)

The LEPII supplied bounds for such chiral couplings, typically [137]

$$\begin{aligned} \frac{g^2[a^{Z'}_L(e)]^2}{c^2_W m^2_{Z'}} <\frac{1}{(6~{\textrm{TeV}})^2}. \end{aligned}$$
(92)

Using \(m^2_{Z'}\simeq g^2w^2/[3(1-\beta ^2t^2_W)]\), we get

$$\begin{aligned} w> 3\times (1+\sqrt{3}\beta t^2_W)~{\textrm{TeV}}, \end{aligned}$$
(93)

which are 5.7 TeV, 3.9 TeV, 2.1 TeV, and 0.3 TeV, for \(q=-2\), \(-1\), 0, and 1, respectively. The first case is ruled out by the Landau pole, while the last case means that \(Z'\) negligibly contributes to the process, since w would be at TeV due to the FCNCs above. That said, the 3-3-1 model with \(q=0\) and \(\pm 1\) would be viable, as taken into account.

The LHC searched for dilepton signals through the process \(pp\rightarrow ff^c\) that is contributed by \(Z'\), supplying a bound \(m_{Z'}\sim 4\) TeV for \(Z'\) couplings identical to those of the usual Z boson [138]. It is noticed that \(Z'\) in our model couples similarly to Z, governed by the common g coupling but having a small difference due to \(\beta \). Therefore, the bound as given applies to our model with some extent since the couplings are not identical. The \(Z'\) mass limit thus converts to a w bound comparable to the LEPII.

5.4 Summary of the existing bounds with updates for model building and W mass

The viable range of \(\beta ,q\) and the Landau pole limit determined in Sect. 5.1 have appropriately been supplied to Table 1 for classifying 3-3-1 versions. Additionally, all the relevant, existing bounds of this section, i.e. Sects. 5.15.2, and 5.3, have appropriately been combined with the W-mass measurement in the previous sections.

For convenience in reading, we give a summary of all the bounds obtained in this section as well as their application to the W-mass deviation in the previous sections, as in Table 2. That said, the constraints previously given concerning the W mass, such as (wv) that results from Figs. 12, and 3, are indeed the updated bounds, while the conditions \(u'\ll u\) and \(w'\ll w\) which are obviously used for the gauge spectrum and the W-mass shift are suitable to the bound in this section.

6 Conclusion

We have examined a variety of variants of the 3-3-1 model, characterized by a basic charge parameter q and the behavior of \(B-L\) symmetry through its residual matter parity \(P_M\). The 3-3-1 model generally has a standard vacuum structure, except for the 3-3-1 variant with right-handed neutrinos or with heavy charged leptons which can develop an abnormal vacuum structure, not protected by \(P_M\). We have diagonalized the gauge spectra according to the two kinds of vacuum in detail.

There are two kinds of constraints for this model, which are jointly considered and consistently updated:

New experiment: We have investigated the various contributions of the 3-3-1 model to the CDF W-mass anomaly, and we conclude that the Z\(Z'\) mixing due to the 3-3-1 breakdown by the normal VEVs uvw and a non-degenerate physical Higgs doublet popularly existed in the model can explain this anomaly, separately. Additionally, a scalar sextet that contains a \(SU(2)_L\) Higgs triplet with \(Y=0\) can also solve this puzzle. Furthermore, considering the special 3-3-1 versions with \(q=0,\pm 1\), the viable regimes of uvw have been derived, obeying the FCNC, collider, and relevant Landau-pole limits. For the case of the heavy non-degenerate Higgs doublet, its mass splitting should be at 98 GeV, whereas for the case of the scalar sextet, the tiny VEV should be about 4.5 GeV.

Existing bounds: The running coupling and Landau pole limit imply a bound for q, i.e. \(-2.08<q<1.08\). The minimal 3-3-1 model \((q=1)\) and the 3-3-1 model with \(q=-2\) have a Landau pole at TeV scale, while the 3-3-1 model with right-handed neutrinos/neutral fermions \((q=0)\) and the 3-3-1 model with heavy charged leptons \((q=-1)\) possess a Landau pole beyond the Planck scale. The FCNCs set a constraint on \(B-L\) violating parameters, such as \(u'/u\sim w'/w\sim s/h\sim 10^{-2}\)\(10^{-3}\), for the 3-3-1 model with right-handed neutrinos, while they imply a bound for the new physics scale \(w>3.9\) TeV, valid for every 3-3-1 version. The LEPII gives a strong bound for the 3-3-1 model with \(q=-2\), i.e. \(w>5.7\) TeV, which is bigger than its Landau pole, \(\mu \sim 4\)–5 TeV, hence this version is ruled out. The collider bounds for 3-3-1 versions with \(q=0,\pm 1\) are appropriate to the FCNCs.