1 Introduction

The possible existence of a light neutral spin-1 boson with a small gauge coupling, in the \(\sim \) MeV to hundred MeV mass range and decaying most notably into \(e^+ e^-\) pairs, has been studied for a long time [1,2,3]. It is generally expected not to have a significant effect on neutral current phenomenology at higher \(q^2\), as compared to a heavy \(Z'\), but it could affect the anomalous magnetic moments of the muon or electron, parity-violation effects in atomic physics, or be produced in various decays and beam dump experiments, etc.

In an electroweak theory the new boson, referred to as the U boson, can mix with the Z and with the photon through a \(3\times 3\) matrix, borrowing features from both kinds of particles. The Z weak neutral current as well as the electromagnetic current, and also the B and L currents, can contribute to the new current \(J_U^\mu \), which has in general both vector and axial parts. This could, however, lead to too-strong parity-violation effects [4], with axial couplings also enhancing the cross-sections for longitudinally polarized U bosons, produced much like light pseudoscalars [1].

In a large class of spontaneously broken gauge theories, however, the U couplings to quarks and leptons are naturally vectorial in the small mass limit. The U current, obtained from a mixing of the extra-U(1) current \(J^\mu _F\) with the Z current, ultimately involves a combination of the baryonic, leptonic (or \(B-L\)), and electromagnetic currents [5, 6]. This includes and generalizes, within the framework of an extended electroweak or grand-unified theory, the very specific case of a “dark photon” coupled to electric charges, which has focused much of the experimental attention. This also applies to an extremely light or massless U boson inducing a new long-range force, extremely weak, which could lead to apparent deviations from the equivalence principle [3, 5,6,7].

The U boson may also couple to dark matter, and can mediate sufficient annihilations through stronger-than-weak interactions so as to allow for dark-matter particles to be light [8, 9]. It may provide a possible explanation for the apparent discrepancy between the expected and measured values of \(g_\mu \!-2\,\) [10,11,12], and be at the origin of many interesting effects.

2 A vectorially coupled U boson

In the presence of weak-interactions one may expect the U couplings to be parity-violating. Still this one may be vectorially coupled to quarks and leptons, as in the specific case of a “dark photon”, but allowing for more general situations. This occurs in a large class of models in which the extended electroweak symmetry is broken by a single doublet; or by two (or more) as in supersymmetric theories, but with the same gauge quantum numbers. The symmetry breaking may then be viewed as induced by a single active doublet \(\varphi \), the others being “inert”.

2.1 \(Q_U\) as a combination of \(Q,\,B,\,L\) and \(F_\mathrm{dark}\)

We express the extended electroweak covariant derivative as

$$\begin{aligned} iD_\mu =\,i\partial _\mu \! \underbrace{- \,g\, {{\varvec{T}}}.{{\varvec{V}}\!}_\mu -\,\frac{g'}{2}\,Y\,B_\mu }_{\hbox { (possibly grand-unified)}} -\,\frac{g''}{2}\,F\,C_\mu , \end{aligned}$$
(1)

with the extra \(U(1)_F\) gauge symmetry commuting with \({SU}(3)_C\times {SU}(2)\times U(1)_Y\), or SU(5) in a grand-unified theory. The Lagrangian density, expressed as usual in an orthonormal field basis, includes the couplings of the gauge fields with the corresponding currents,

$$\begin{aligned} \mathcal{L}= & {} -\,{\displaystyle \frac{1}{4}}\, \left( {{\varvec{V}}\!}^{\mu \nu } {{\varvec{V}}\!}_{\mu \nu }+ B^{\mu \nu }B_{\mu \nu }+ C^{\mu \nu }C_{\mu \nu }\right) \nonumber \\&-\ {{\varvec{\mathcal {J}}}}^\mu . \,{{\varvec{V}}\!}_\mu \,- \mathcal{J}_{\,Y}^\mu \,B_\mu \, -\mathcal{J}_{\,F}^\mu \,C_\mu + ...\ . \end{aligned}$$
(2)

When quarks and leptons acquire their masses from a single electroweak doublet as in the standard model, or several but with the same gauge quantum numbers, the gauge invariance of their Yukawa couplings requires the new \(U(1)_F\) quantum number F, and the corresponding current \(\mathcal{J}^\mu _F\), to be expressed as [5, 6]

$$\begin{aligned} F= & {} \alpha _B \,B+\beta _i \,L_i\,+\,\gamma \,Y\,+\,F_d,\nonumber \\ \mathcal{J}^\mu _F= & {} \frac{g''}{2}\,J^\mu _F = \frac{g''}{2} (\alpha _B\,J^\mu _B+\beta _i\,J^\mu _{L_i}+\gamma \,J^\mu _Y+\,J^\mu _{\,d})\,. \end{aligned}$$
(3)

The new U(1) current is naturally expressed as a linear combination of the B and L currents with the weak-hypercharge current,Footnote 1 and a possible dark-matter or extra spin-0 singlet contribution associated with a “hidden sector”. The simultaneous appearance of \(B,\,L\) and Y in (3) is actually required in the framework of grand unification, to ensure that \(U(1)_F\) commutes with the non-abelian grand-unification gauge group [3, 5, 6].

For \(\gamma =0\) the new interaction, coupled to a linear combination of the baryon and lepton numbers, is simple to study in terms of the mass and couplings of the new boson, then unmixed with the Z and the photon. For the theory to be anomaly-free the U current may be taken as the \(B-L\) current (in the presence of \(\nu _R\) fields), possibly combined with the \(L_i-L_j\) and dark-matter currents. The new force may be of infinite or finite range, and may also act on dark-matter particles.

We now concentrate on the more elaborate situation of a \(U(1)_F\) gauge interaction of \(C^\mu \) in (1)–(3) involving the weak-hypercharge generator Y, allowing to normalize \(g''\) and F so that \(\gamma =1\). We may again use \(B-L\) and \(L_i-L_j\) in expression (3) of F for the theory to be anomaly-free, including \(\nu _R\) fields. The v.e.v. \(v/\sqrt{2}\simeq 174\) GeV of the doublet \(\varphi \), with \(Y\!=\!1\,\), breaks the \({SU}(3)_C\times {SU}(2) \times U(1)_Y\times U(1)_F\) gauge symmetry to \({SU}(3)_C\times U(1)_\mathrm{QED}\times U(1)_U\), with \(m_W=gv/2\). The three neutral fields \(W_3,B\), and C are mixed into the massless photon field A, the massive Z field and a new neutral field U, still massless at this stage. They are given by [5, 6]

(4)

The photon field has its usual SM expression in terms of \(\tan \theta =g'/g\). Z and U are obtained by rotating the standard \(Z_\mathrm{sm}=\cos \theta \ W_3- \sin \theta \,B\) and the \(U(1)_F\) gauge field C in the plane orthogonal to A as represented in Fig. 1, according to

$$\begin{aligned} \left\{ \begin{array}{ccl} Z&{}=&{} \cos \xi \ Z_\mathrm{sm} - \sin \xi \ C,\\ U&{}=&{} \sin \xi \ Z_\mathrm{sm} + \cos \xi \ C, \end{array} \right. \end{aligned}$$
(5)

with

$$\begin{aligned} \tan \xi \,= {\displaystyle \frac{g''}{\sqrt{g^2+g'^2}}}\,= \,{\displaystyle \frac{g''}{g_Z}}. \end{aligned}$$
(6)

This leads to the \(3\times 3\) orthogonal mixing matrix in (4).

Fig. 1
figure 1

Mixing with the \({\varvec{Z}}\)”, in the plane orthogonal to the photon field \(A^\mu \). \(U^\mu \!\) is a (small) mixing of \(C^\mu \!\) (coupled to \(F=Y +\alpha _B B +\beta _i L_i+ F_d\)) with \(Z_\mathrm{sm}^\mu \), leading to \(\mathcal{J}^\mu _U\) as in (7) and (8), with \(\tan \xi = g''/\sqrt{g^2+g'^2}\,\)

The U field is still massless at this stage, and coupled to a conserved current. This one is obtained, in the small \(m_U\) limit in which U is almost exactly given by (4), as a combination of the \(U(1)_F\) current \(J^\mu _F\) with the standard weak neutral current \(J^\mu _{Z_\mathrm{sm}}\), using \(\sin \xi \, \sqrt{g^2+g'^2}=g'' \cos \xi \,\) from (6). Including as in (2) and (3) the coupling constants \(g,\,g'/2,\,g''/2\,\) within the currents \(\mathcal{J}^\mu \) we have

$$\begin{aligned} \mathcal{J}_U^\mu= & {} \cos \xi \, \underbrace{\left( \,\frac{g''}{2} \,J_F^\mu \right) }_{{{\mathcal{J}_F^\mu }}} \,+\,\sin \xi \, \underbrace{\sqrt{g^2+g'^2}\,(J_3^\mu \!-\sin ^2\theta \,J^\mu _\mathrm{em})}_{{\mathcal{J}_{Z_\mathrm{sm}}^\mu }}\nonumber \\= & {} g'' \cos \xi \left[ \frac{1}{2}\ J^\mu _Y+{\displaystyle \frac{1}{2}} (\alpha _B \,J^\mu _{B}+\beta _i\,J^\mu _{L_i}+ J_d^\mu )\right. \nonumber \\&\left. + \, (J_3^\mu \!-\sin ^2\theta \,J^\mu _\mathrm{em})\, \right] . \end{aligned}$$
(7)

With Y / 2 (from \(J^\mu _F\)) and \(T_3\) (from \(J^\mu _{Z_\mathrm{sm}}\)) combining into \(Q=\frac{Y}{2}+T_3\) the axial part in the U current for quarks and leptons disappears in the massless U limit. The resulting current is

(8)

It is associated with a conserved charge [5, 6], normalized as

(9)

with coupling constant

$$\begin{aligned} \begin{array}{ccl} g''\, \cos \xi \,\cos ^2 \theta &{}=&{} g''\,{ \displaystyle \sqrt{\frac{g^2+g'^2}{g^2+g'^2+g''^2}}\ \frac{g^2}{g^2+g'^2}}\\ &{}=&{} \epsilon \,e = e\,\tan \chi \,\simeq \,g''\cos ^2 \theta . \end{array} \end{aligned}$$
(10)

This coupling is expressed in terms of the elementary charge \(e=gg'/\!\sqrt{g^2+g'^2}\,\) as \(\epsilon \,e\) with \(\epsilon \!=\tan \chi \). The angle \(\chi \) may be interpreted from another expression of \(U^\mu \), given in a non-orthogonal basis as \((1/\cos \chi )\, {\hat{C}}^\mu \, + \) \(\tan \chi \,A^\mu \) (cf. Eqs. (46), (48), (55), and Fig. 3 in Sect. 5). \({\hat{C}}^\mu \) is in general coupled, not only to dark matter and to a spin-0 field \(\sigma \) in the hidden sector responsible for \(m_U\) [5, 6], but also to B and L as well. \({\hat{C}}^\mu \) and \(A^\mu \), both orthogonal to \(Z^\mu \) in (4), are themselves non-orthogonal, at an angle \(\frac{\pi }{2}+\chi \) with \(\tan \chi =\epsilon \) given by (10), very close to \(\frac{\pi }{2}\) if the extra U(1) coupling \(g''\) is small.

2.2 Relations between U charges

The U charges in (9), expressed for the first generation of quarks and leptons as a linear combination of \(Q,\,B\), and \(L_e\), satisfy the same additivity relations

$$\begin{aligned} Q_U(p)-Q_U(n)= & {} Q_U(u)-Q_U(d) \nonumber \\= & {} Q_U(\nu _e)-Q_U(e)=\,Q_U(W^+)=1.\nonumber \\ \end{aligned}$$
(11)

They express the conservation of \(Q_U\) in the limit of a massless U, associated with an unbroken symmetry \(U(1)_U\), with the \(W^\pm \) carrying \(\pm \,1\) unit of \(Q_U\). The corresponding vector couplings of the U boson, expressed from (8) to (10) as

$$\begin{aligned} \left\{ \, \begin{array}{cclcccc} f_{\nu _e}&{}=&{}\epsilon e \ Q_U(\nu _e), &{}f_p&{}=&{}2f_u\!+\!f_d\,=\, \epsilon e \ Q_U(p), \\ f_e&{}=&{} \epsilon e \ Q_U(e),&{}f_n&{}=&{} f_u+ 2f_d=\, \epsilon e \ Q_U(n), \end{array} \right. \end{aligned}$$
(12)

must therefore verify

$$\begin{aligned} f_n = \,f_p+f_e-f_{\nu _e}. \end{aligned}$$
(13)

This relation expresses in particular that \(Q_U\) remains conserved in the \(\beta \) decay of the neutron, \(n \rightarrow p \ e \ \bar{\nu }_e\).

The U is constrained to interact very weakly with electrons, so that the extra contribution \(\delta _U a_e\) to the electron anomaly be sufficiently small; and with protons, so that the \(\pi ^0\!\rightarrow \! \gamma \, U\) decay amplitude, proportional to \( f_p=2f_u+f_d\), be sufficiently small (cf. Sect. 6).  It should also interact sufficiently weakly with neutrinos so as to satisfy the constraint \(|f_{\nu _e} f_e|/m_U^2\) from low-\(q^2\ \nu _e\)e scattering, i.e., [4, 8]

(14)

valid for \(m_U\) larger than a few MeVs. The additivity property (11)–(13) implies that if \(f_e,\, f_p\), and \(f_{\nu _e}\) are all small, the coupling to the neutron \(f_n= f_p+f_e- f_{\nu _e}\) is expected to be small as well.

The \(U(1)_F\) generator in (3) may well involve B and L through their difference \(B-L\), in view of an anomaly-free theory (including \(\nu _R\)’s), or of grand unification as we shall see. \(Q_U\) may then be expressed as

(15)

More specifically with \(\lambda \simeq 1\),

$$\begin{aligned} Q_U \simeq \, Q-(B-L)\,+\,Q_{U\,\mathrm dark} \end{aligned}$$
(16)

would lead to smaller interactions with the proton and the electron, i.e.,

$$\begin{aligned} \mathrm{small} \,(f_p=- f_e), \mathrm{as \,compared\, to } \ (f_n= -\,f_\nu ), \end{aligned}$$
(17)

very much as found in [3], in the presence of axial couplings. In a similar way \(Q_U\) close to \(Q-B +Q_{Ud}\) (or \(Q-(B-3L_\tau )+Q_{Ud}\) in view of an anomaly-free theory) would lead to small \(f_p\) and \(f_{\nu _e}\), with a larger \(f_n\simeq f_e\), again in agreement with (13).

Such relations, however, may be avoided in other situations, with two spin-0 doublets at least, in which the U current is not naturally vectorial and conserved in the small \(m_U\) limit, so that significant parity-violation effects may have to be expected. A light U in a longitudinal polarization state may then be produced and interact significantly, much like the spin-0 pseudoscalar a associated with the spontaneous breaking of the global \(U(1)_U\) [1]. Both effects restrict significantly the possible size of axial couplings (cf. Sect. 6.1).

2.3 A new long-range force, and equivalence principle tests

The U could stay massless, mediating a new long-range force acting additively  on ordinary particles, proportionally to a linear combination of B (as considered long ago by Lee and Yang [13]) with L and Q. Or it may acquire a mass if the \(U(1)_U\) symmetry gets spontaneously broken. The fact that both B and L can be present simultaneously in the expression of \(Q_U\), and in combination with the electric charge Q, allows for an extension to grand-unified theories. The fact that Q may also appear alone illustrates that the popular dark photon case is included as a specific case of this general analysis.

For ordinary matter \(Q_U\) appears as a combination of the numbers of protons, neutrons and electrons, or \(Z,\,N\) and Q, i.e., effectively Z and N only for ordinary neutral matter [7]. More specifically the new force may act mostly on the number of neutrons N, as in the case of a “protophobic” U boson for which \(Q_U\) is close to \(Q-(B-L)\) [3].

With \(Q_U=Q-\lambda \,(B-L)+Q_{Ud}\) as in (15) (and also in the absence of the Q term, if Y does not appear in expression (3) of F so that the U does not mix with the Z and the photon), one has

$$\begin{aligned} Q_U(p+e)=0,\quad Q_U(n)=-\lambda , \end{aligned}$$
(18)

so that

$$\begin{aligned} Q_U=-\,\lambda \,N, \end{aligned}$$
(19)

for ordinary neutral matter. The interaction potential between two bodies of mass \(m_{i}\) and number of neutrons \(N_{i}\) is then given by

(20)

The ratio between the repulsive U-exchange potential and the gravitational potential between two neutrons at a distance r somewhat larger than \(\hbar /m_Uc\) is about

$$\begin{aligned} \frac{V_U(r)}{V_g(r)}\,\simeq \,-\,(\lambda \epsilon )^2\,\alpha \,\left( \frac{m_{\mathrm{Planck}}}{m_n}\right) ^2 \simeq \,-\,1.23 \ 10^{36}\,(\lambda \epsilon )^2.\nonumber \\ \end{aligned}$$
(21)

For a massless or almost massless U boson the new force could lead to apparent deviations from the equivalence principle [5,6,7], constraining it to be considerably weaker than gravitation, by \(\approx 10^{-10}\) at least, corresponding typically to

(22)

depending also on \(m_U\) and \(\lambda _U=\hbar /m_Uc\,\),  so that the resulting violations of the equivalence principle be [14,15,16,17].

The MICROSCOPE experiment will soon test the validity of this principle at the \(10^{-15}\) level [18]. The additivity property of the new force induced by a spin-1 U boson, following from the linear expression (9) of \(Q_U\) (as opposed to an hypothetical coupling to mass, or strangeness, ...), is also in contrast with the case of a spin-0 mediator [19], for which other contributions to the expression of the new force are generally expected. This may allow for a distinction between spin-1 and spin-0 mediators, should a deviation from the equivalence principle be observed.

2.4 Generating a small mass for the U boson

The U boson can acquire a small mass from a neutral singlet \(\sigma \) with \(Y=0\), directly providing \(A=\sin \theta \,W_3+\) \(\cos \theta \,B\,\) in (4) as the massless photon field [5, 6]. The singlet v.e.v.   \(<\sigma >\ =w/\sqrt{2}\) generates a mass term \(m_C=\) \(g'' F_\sigma w/2\), resulting in a small U mass

$$\begin{aligned} m_U\,\simeq \,m_C\,\cos \xi \ \simeq \,g'' F_\sigma w/2, \end{aligned}$$
(23)

with \(\sqrt{2}\ \mathfrak {R}\, \sigma \) a physical singlet BEH field, possibly (slightly) mixed with the standard one \(\sqrt{2} \ \mathfrak {R}\,\varphi ^0\) taken to describe the new 125 GeV spin-0 boson [20, 21].

The massive U field differs very little from its expression in (4) and (5), through a tiny change in \(\xi \) (from \(\xi _\circ \) to \(\xi _\circ +\delta \xi \)) inducing very small parity-violating contributions \(\approx m_U^2/m_Z^2\) in the U current \(J^\mu _U\). To discuss these small \(m_U\) corrections to the U and Z currents we observe that the theory is invariant under a simultaneous change of sign for \(C^\mu \) and \(g''\), acting as

$$\begin{aligned} C^\mu \rightarrow \,-\,C^\mu ,\quad g''\rightarrow \,-\,g'', \end{aligned}$$
(24)

so that

$$\begin{aligned} \begin{array}{rcc} (A,\,Z,\,U)\ \ &{}\rightarrow &{}\ (A,\,Z,\,-\,U),\\ \hbox {with}\ \ (g'',\xi ,\chi ,m_U) &{}\rightarrow &{}-\, (g'',\xi ,\chi ,m_U). \end{array} \end{aligned}$$
(25)

Corrections to \(\mathcal{J}^\mu _{\,U}\) in (7) and (8), odd in \(g''\) and vanishing with \(m_U\), are thus \(\approx g''\,\cos \xi \ m_U^2/m_Z^2\) (rather than \(g''\,\cos \xi \) \(m_U/m_Z\)), i.e.,

$$\begin{aligned} \approx \,e\,\tan \chi \ m_U^2/m_W^2. \end{aligned}$$
(26)

The Z current differs also very little from its standard SM expression with a small contribution from \(J^\mu _F\), by terms \(\approx m_U^2/m_Z^2\) as obtained from (4) to (8).

2.5 Special case of the dark photon

The special case for which B and L do not participate in expression (3) of the \(U(1)_F\) quantum number, simply reduced to

$$\begin{aligned} F= Y +F_d, \end{aligned}$$
(27)

provides in an electroweak theory a “dark photon”, with the U coupled to standard model particles proportionally to their electromagnetic current, through

$$\begin{aligned} \mathcal{J}^\mu _U = \,\underbrace{g''\cos \xi \,\cos ^2\theta }_{{\displaystyle \epsilon \,e}} \ (J^\mu _\mathrm{em}+ {\displaystyle \frac{1}{2\,\cos ^2\theta }}\,J^\mu _d\,). \end{aligned}$$
(28)

The coupling \(g''\cos \xi \cos ^2\theta \) may be expressed as in (10) as \(\epsilon \,e\) in terms of the elementary charge \(e=gg'/\sqrt{g^2+g'^2}\), with

(29)

in the small \(m_U\) limit.

This simple situation, a special case of the general one, has been obtained in an extended electroweak theory without ever referring to a largely fictitious “kinetic-mixing” term. Such terms are simply associated with a description in a non-orthogonal field basis, independently of the fact that they are gauge invariant or not. Adding them explicitly in an initial Lagrangian density (then under the restrictive condition that they must be gauge invariant [22, 23]) does not provide additional physical freedom. Indeed the notion of scalar product is not generalized by adding to its usual expression \(xx'+yy'\) in orthonormal coordinates a non-diagonal “mixing term” \(\epsilon \,(xy'+yx')\,\). Discussing a theory in a non-orthogonal rather than in an orthonormal field basis has no effect on the results.

The simple situation of a “dark photon” has been the focus of much experimental attention recently [24, 25]. But it appears excessively restrictive as compared to the general situation for a U boson [5, 6], excluding possible contributions from the B and L currents, which ought to be present in a grand-unified theory.

3 The U current in grand-unified theories

Indeed within SU(5)-type grand-unified theories [26, 27], the weak hypercharge Y, now a generator of SU(5), is no longer abelian, while the \(U(1)_F\) generator F in (1) and (3) should commute with SU(5). It may then look like gauge invariance prevents Y from entering in the expression of the \(U(1)_F\) quantum number F for the visible particles, requiring \(\epsilon =\tan \chi \) to vanish at tree level, as commonly believed [25].

This is not true, however, as Y can contribute to F through the SU(5)-invariant combination involving \(B-L\,\) [5, 6],

(30)

One also has \(F=-\frac{5}{2}\) for possible \(\bar{\nu }_L\) singlets describing right-handed neutrinos \(\nu _R\), making the theory anomaly-free, and the \(U(1)_F\) generator traceless. All Yukawa couplings proportional to \(\underline{\bar{\mathbf{5}}}_H.\, \underline{\bar{\mathbf{5}}}.\, \underline{\mathbf{10}}\,\) and \(\underline{\mathbf{5}}_H.\, \underline{\mathbf{10}}.\, \underline{\mathbf{10}}\,\), responsible for down-quark and charged-lepton masses and up-quark masses, respectively, have \(F=0\) and are invariant under \({SU}(5)\times U(1)_F\).

The \(U(1)_F\) current, including its hidden-sector part \(J^\mu _d\), now reads

$$\begin{aligned} J_F^\mu =J_Y^\mu \,-\,\frac{5}{2}\,J^\mu _{B-L} +\,J^\mu _d. \end{aligned}$$
(31)

By combining it with \(J^\mu _{Z_\mathrm{sm}}\) as in (7) and (8) we get the U current

$$\begin{aligned} \mathcal{J}_U^\mu= & {} g'' \cos \xi \left( \frac{1}{2}J^\mu _Y-\frac{5}{4}J^\mu _{B-L}+\frac{1}{2}J_d^\mu + (J_3^\mu -\sin ^2\theta \,J^\mu _\mathrm{em})\right) \nonumber \\= & {} g'' \cos \xi \,\left( \,\cos ^2\theta \ J^\mu _\mathrm{em}-\frac{5}{4}\ J^\mu _{B-L}+\frac{1}{2} \ J_d^\mu \,\right) , \end{aligned}$$
(32)

associated with the charge [5, 6]

(33)

The corresponding current \(J^\mu _U\) involves a term proportional to the electromagnetic current \(J^\mu _\mathrm{em}\), with the same coupling

$$\begin{aligned} g'' \cos \xi \cos ^2 \theta = \epsilon \, e = e\,\tan \chi \end{aligned}$$
(34)

as in (10) and (29), and a term proportional to the \(B-L\) current, plus a dark-matter current.

4 \(Q_U\), the electrostrong symmetry, and \(B-L\)

4.1 \(Q_U\) at the grand-unification scale

To understand better the origin and meaning of the new charge we note that \(Q_U\) in (33), if evaluated at the grand-unification scale with \(g'= g\sqrt{3/5} \) and \(\sin ^2\theta =\) 3 / 8, would read

$$\begin{aligned} \hbox {at GUT scale:}\quad Q_U^\mathrm{\,gut}=\,Q-2\,(B-L) + \frac{4}{5}\,F_d. \end{aligned}$$
(35)

This \(Q_U^\mathrm{\,gut}\) is invariant under the \({SU}(4)_\mathrm{es}\,(\sim \! O(6))\) electrostrong symmetry group unifying directly electromagnetism with strong interactions, including \({SU}(3)_C\times U(1)_\mathrm{QED}\) within SU(5), and commuting with \(U(1)_U\). Indeed, forgetting momentarily about the  \(\underline{\varvec{24}}\) adjoint v.e.v. (or extra dimensions) responsible for the breaking of \({SU}(5)\ [\times U(1)_F]\) into \({SU}(3)_C\times {SU}(2)\times U(1)_Y\ [\times U(1)_F]\), the v.e.v. of the quintuplet \({\underline{\mathbf{5}}}_H\) (including the electroweak doublet \(\varphi \), with \(F=Y=1\)), is responsible for the symmetry breaking

$$\begin{aligned} {SU}(5) \times U(1)_F \mathop {\longrightarrow }\limits ^{ \begin{array}{c} {<{\underline{\mathbf{5}}}_H>}\\ \end{array}}\ {SU}(4)_\mathrm{es}\times U(1)_U. \end{aligned}$$
(36)

It leaves at this stage unbroken the \(U(1)_U\) group generated by \(Q_U^{\,\mathrm gut }\) in (35), commuting with \({SU}(4)_\mathrm{es}\). This remaining \(U(1)_U\) may then be broken by the singlet v.e.v. \(<\sigma >\ =w/\sqrt{2}\), generating a non-vanishing \(m_U\) as in (23).

This U charge, invariant at the GUT scale under the \({SU}(4)_\mathrm{es}\) electrostrong symmetry group, is the same for all components within \({SU}(4)_\mathrm{es}\) representations, i.e., for quarks and leptons

$$\begin{aligned} Q_U^\mathrm{\,gut}\,=\ \left\{ \ \begin{array}{ccc} 1\,:&{}\ \ \underline{\bar{\mathbf{4}}} &{} \ \ \left( \begin{array}{c} \bar{d} \\ \,e^- \end{array}\right) _{L+R},\\ 0\,:&{} \ \ \underline{\mathbf{6}} &{} \ \{u,\,\bar{u}\}_L,\\ 2\,: &{}\ \ \underline{\mathbf{1}} &{} \ \nu _L \,(+\,\nu _R), \end{array}\right. \end{aligned}$$
(37)

with family indices omitted for simplicity. Dirac quark and lepton fields (except possibly for chiral neutrinos) are in vectorial representations of \({SU}(4)_\mathrm{es}\times U(1)_U\), with the U boson interactions, as well as electrostrong interactions, invariant under \({SU}(4)_\mathrm{es}\), preserving parity.

For the 24+1 gauge bosons of \({SU}(5)\times U(1)_F\), which includes \({SU}(4)_\mathrm{es}\times U(1)_U\) as a subgroup, one has

$$\begin{aligned} Q_U^\mathrm{\,gut}=\,\left\{ \ \begin{array}{ccc} 0\,:&{} \ \underline{\mathbf{15}} &{}\ \{ \hbox {gluons},\, \gamma , \,X^{\pm 4/3}\},\\ 0\,: &{} \ \ \underline{\mathbf{1}}+ \underline{\mathbf{1}} &{} Z,\ U,\\ 1\,:&{}\ \ \underline{\mathbf{4}} &{}\ \left( \begin{array}{c} Y^{-1/3} \\ W^+\, \end{array}\right) ,\\ -1\,:&{} \ \ \underline{\bar{\mathbf{4}}} &{}\ \left( \, \begin{array}{c} Y^{1/3} \\ W^-\, \end{array}\right) . \end{array}\right. \end{aligned}$$
(38)

\(Q_U^\mathrm{gut}\) vanishes as required for the \(\{u,\,\bar{u}\}\) self-conjugate sextets (the U being “u-phobic” at the GUT scale), with

$$\begin{aligned} \begin{array}{c} {Q_U^\mathrm{gut}= \,Q-2(B-L):}\ \, \left\{ \begin{array}{ll} Q_U^{\,p}=\ \, Q_U^{\,d}\,=\,-\,1, &{}\ Q_U^{\,\nu }=2,\\ Q_U^{\,n}= 2\,Q_U^{\,d}=\,-\,2,&{}\ Q_U^{\,e}=1,\\ Q_U^{\,u}=0. \end{array}\right. \\ \end{array}\nonumber \\ \end{aligned}$$
(39)
Fig. 2
figure 2

Two of the diagrams responsible for \(p\rightarrow \pi ^0 e^+\) in a \({SU}(5)\times U(1)_F\) theory, showing the values of \(Q_U=Q-\lambda (B-L) \,+Q_\mathrm{dark }\) and illustrating its conservation. \(\lambda =2\) at the GUT scale (with \(Q_U=0\) for u and \(X^{4/3}\)), down to 1.64 at low energy. \(Q_U= 1-\lambda \simeq -.64\) for both the proton and the positron emitted in its decay

4.2 \(Q_U\) at low energy

Below the grand-unification scale expressions (35), (37)–(39) of \(Q_U\) get modified as in (15) according to

$$\begin{aligned} Q_U = Q\,-\,\lambda \,(B-L)\,+\,Q_{U\,\mathrm dark}, \end{aligned}$$
(40)

down to (33),

$$\begin{aligned} Q_U \simeq \,Q\,-\,1.64\,(B-L)\,=\left\{ \begin{array}{ll} p: \ -.64\,;&{}\nu :1.64, \\ n: -\,1.64\,;&{} e:\, 0.64. \end{array}\right. \end{aligned}$$
(41)

\(\lambda =5/(4\cos ^2\theta )\) in (40) decreases from 2 at the GUT scale to \(\simeq 1.64\) at low energy, with \(\sin ^2\theta \simeq .238\,\).

This is in agreement with the conservation of \(Q_U\) (following from those of Q and \(B-L\)) by all interactions, strong, electroweak, and grand-unified, in the massless U limit, with

$$\begin{aligned} Q_U\,=\,\left\{ \begin{array}{ccc} W^\pm \ : &{} \ \pm \,1, \\ Y^{\pm 1/3}: &{} \pm \,(Q_U^{\,u}+Q_U^{\,d}) &{}= \,\pm \,{ \displaystyle \frac{1-2\lambda }{3}}, \\ X^{\pm 4/3}: &{} \pm \,2\,Q_U^{\,u}&{}= \,\pm \,{\displaystyle \frac{4-2\lambda }{3}}, \end{array} \right. \end{aligned}$$
(42)

and

$$\begin{aligned} Q_U\,=\,\left\{ \begin{array}{cccccc} u: &{} \ \ \ {\displaystyle \frac{2-\lambda }{3}}, &{}\ &{} p:&{} 1-\lambda ,\\ d : &{} \ \ {\displaystyle \frac{-1-\lambda }{3}}, &{}\ &{} n:&{} \ -\lambda ,\\ \nu : &{}\ \ \ \ \ \lambda ,\\ e: &{} \ \ \ \lambda -1, \end{array} \right. \end{aligned}$$
(43)

where \(\lambda =5/(4\cos ^2\theta )\simeq 1.64\) at low energy. The conservation of \(Q_U\) by all interactions, including those induced by \(X^{\pm 4/3}\) and \(Y^{\pm 1/3}\) exchanges that could be responsible for proton decay, still unobserved [28], is illustrated in Fig. 2.

Note that \(\epsilon =\tan \chi \simeq (g'' /g')\cos \theta \) is already present at the grand-unification scale. As \(C^\mu \) is decoupled from the visible sector for \(g''=0\,\) (or \(\tan \chi =0\)),Footnote 2 \(\epsilon =\tan \chi \) may be viewed as a measure of the coupling between the visible and hidden sectors. Furthermore a non-vanishing \(\epsilon =\tan \chi \) cannot be generated just from quantum corrections once \(g''=0\,\) so that \(\epsilon \) vanishes at tree level, with \(C^\mu \) interacting exclusively with the hidden sector, being decoupled from the visible one.

5 The U as a dark photon also coupled to B and L

5.1 Another orthogonal basis for neutral gauge fields

The U current has been obtained from the mixing (7) and (8) between the extra-U(1) current \(J^\mu _F\) and the standard weak neutral current \(J^\mu _\mathrm{sm}\), providing, in the small \(m_U\) limit, a vector current \(J^\mu _U\) including a contribution \(\propto J^\mu _\mathrm{em}\), with additional contributions from \(B,\,L\) and dark-matter currents.

This result may also be described in a complementary way, by constructing the same current \(J^\mu _U\) from the electromagnetic current \(J^\mu _\mathrm{em}\) combined with the extra current \(J^\mu _d\) in the hidden sector (as in the specific “dark photon” case), but also in general with the baryonic and leptonic currents \(J^\mu _B\) and \(J^\mu _{L_i}\). Indeed the weak-hypercharge current \(J^\mu _Y\) in (1) and (3) may be viewed as coupled to \(B^\mu \) and \(C^\mu \) through the single hatted combination

$$\begin{aligned} {\hat{B}} =\cos \zeta \,B + \sin \zeta \,C,\quad \hbox {with}\ \tan \zeta = g''/g'. \end{aligned}$$
(44)

The doublet \(\varphi \), with \(F=Y=1\), interacts with \({\hat{B}}\) with the coupling constant \({\hat{g}}'=\sqrt{g'^2+g''^2}\,\). It does not interact with the orthogonal combination \({\hat{C}}\), which remains massless at this stage. \(<\varphi >\,\) generates a spontaneous breaking of \({SU}(2)\times U(1)_{{\hat{Y}}} \rightarrow U(1)_{\hat{\mathrm{QED}}}\), leaving also \({\hat{A}}\) massless. With

$$\begin{aligned} \tan {\hat{\theta }}\,=\,{\hat{g}}'/g=\sqrt{g'^2+g''^2}/g\ \end{aligned}$$
(45)

this leads one to define the orthonormal basis

(46)

with \({\hat{\theta }}\) defined by

$$\begin{aligned} \tan {\hat{\theta }}\,=\,{\hat{g}}'/g\,=\,\sqrt{g'^2+g''^2}/g. \end{aligned}$$
(47)

We can also relate the two orthonormal basis in (4) and (46), which have \(Z={\hat{Z}}\) in common, by writing, for the fields in the orthogonal plane,

$$\begin{aligned} \left\{ \begin{array}{ccr} {\hat{A}}&{}=&{}\cos \chi \,A+\sin \chi \, U,\\ {\hat{C}}&{}=&{}-\sin \chi \,A+\cos \chi \, U. \end{array}\right. \end{aligned}$$
(48)

The angle \(\chi \) between the \({\hat{A}}\,\) field in (46) and the actual photon field A in (4) is obtained from the scalar products

$$\begin{aligned} \cos \chi = A. {\hat{A}}= U.{\hat{C}},\quad \sin \chi = -\,A.{\hat{C}}={\hat{A}}.U, \end{aligned}$$
(49)

so that

$$\begin{aligned} \left\{ \begin{array}{ccccc} \cos \chi &{}=&{} g'\,{\displaystyle \sqrt{\frac{g^2+g'^2+g''^2}{(g^2+g'^2)(g'^2+g''^2)}}} &{}=&{} {\displaystyle \frac{g'{\hat{g}}_Z}{{\hat{g}}'g_Z}}, \\ \sin \chi &{}=&{} g g''/ {\displaystyle \sqrt{(g^2+g'^2)(g'^2+g''^2)}}&{}=&{} {\displaystyle \frac{g\, g''}{{\hat{g}}'g_Z}}, \\ \tan \chi &{}=&{} gg''/ g'{\displaystyle \sqrt{g^2+g'^2+g''^2}}&{}=&{} {\displaystyle \frac{g \,g''}{ g'{\hat{g}}_Z}}, \end{array} \right. \end{aligned}$$
(50)

giving back in particular (10) and (29) for \(\epsilon =\tan \chi \,\).

Fig. 3
figure 3

Mixing with the photon”, in the plane orthogonal to Z. \(\varphi \) interacts with \(B^\mu \) and \(C^\mu \) through \({\hat{B}}^\mu =\) \(\cos \zeta \,B^\mu +\sin \zeta \,C^\mu \), leaving \({\hat{C}}^\mu =-\sin \zeta \,B^\mu +\cos \zeta \,C^\mu \) massless. \(U^\mu \) is a (small) mixing of \({\hat{C}}^\mu \) (coupled to \(B,\,L\) and dark matter)  with the photon field \(A^\mu \), \(U^\mu =({\hat{C}}^\mu /\cos \chi ) + \tan \chi \, A^\mu \), leading to \(\mathcal{J}_U^\mu = \,(\mathcal{J}^\mu _{{\hat{C}}}/\cos \chi )+e\tan \chi \,J^\mu _\mathrm{em}\) as in (56) and (59), and earlier in (8)

5.2 The dark photon case

In the specific dark photon case for which \(F=Y+F_d\), standard model particles do not interact with \({\hat{C}}\), only with \(\varvec{W}\) and \({\hat{B}}\) and thus \({\hat{Z}}\) and \({\hat{A}}\). In the small \(m_U\) limit, \({\hat{A}}\) in (46) appears for SM particles as a photon-like field, coupled to them only through their electromagnetic current, with strength

$$\begin{aligned} {\hat{e}}\,=\,g\,\sin {\hat{\theta }}= g\ {\displaystyle \sqrt{\frac{g'^2+g''^2}{g^2+g'^2+g''^2}}}=\,{\displaystyle \frac{g {\hat{g}}'}{{\hat{g}}_Z}}. \end{aligned}$$
(51)

The photon field, expressed as \( A= \cos \chi \,{\hat{A}}-\,\sin \chi \,{\hat{C}}\,\) where \({\hat{C}}\) is in this case uncoupled to SM particles, is coupled electromagnetically to them with the slightly reduced strength

$$\begin{aligned} {\hat{e}} \,\cos \chi = e=gg'/\sqrt{g^2+g'^2}. \end{aligned}$$
(52)

At the same time \(U=\sin \chi \,{\hat{A}}+\cos \chi \,{\hat{C}}\,\) is coupled with the reduced strength \({\hat{e}}\,\sin \chi =e\,\tan \chi \). The A\({\hat{A}}\) angle \(\chi \), i.e., in fact the UA mixing angle, is obtained from

$$\begin{aligned} \cos \chi =\frac{e}{{\hat{e}}}=g'\ {\displaystyle \sqrt{\frac{g^2+g'^2+g''^2}{(g^2+g'^2)(g'^2+g''^2)}}}{=}\,{\displaystyle \frac{g'{\hat{g}}_Z}{{\hat{g}}'g_Z}} {=} \,{\displaystyle \frac{\sin \theta }{\sin {\hat{\theta }}}},\nonumber \\ \end{aligned}$$
(53)

as found in (50) in the general case. U is here coupled to SM particles only through their electromagnetic current, with strength \({\hat{e}}\, \sin \chi =e\,\tan \chi \), i.e.,

$$\begin{aligned} \epsilon e= e\tan \chi =g''{\displaystyle \frac{g^2}{g^2+g'^2} \sqrt{\frac{g^2+g'^2}{g^2+g'^2+g''^2}}} = g''\cos ^2\theta \cos \xi ,\nonumber \\ \end{aligned}$$
(54)

as directly found in (10) from the mixing of \(J^\mu _F\) with \(J^\mu _{Z_\mathrm{sm}}\).

5.3 Recovering in a non-orthogonal basis the general expression of the U current

The \({\hat{A}}/A/U\) mixing in (46) provides an interpretation for the fact that the combination between the \(J^\mu _F\) and \(J^\mu _{Z_\mathrm{sm}}\) neutral currents provides a U coupling to SM particles through the electromagnetic current. But is it possible to extend this interpretation to the general situation involving also B and L in the U couplings?

To address this question, we express from (48) the U field in terms of the (non-orthogonal) A and \({\hat{C}}\) fields defined in (4) and (46), as

$$\begin{aligned} U^\mu \,=\,\frac{{\hat{C}}^\mu }{\cos \chi }+ \tan \chi \, A^\mu . \end{aligned}$$
(55)

Its equation of motion reads

(56)

With

$$\begin{aligned} \mathcal{J}^\mu _{{\hat{C}}}= & {} -\sin \zeta \,\mathcal{J}^\mu _Y +\cos \zeta \,\mathcal{J}^\mu _F\nonumber \\= & {} \cos \zeta \ {\displaystyle \frac{g''}{2}}\,\left( \alpha J^\mu _B+\beta _i\,J^\mu _{L_i}+J^\mu _d\right) , \end{aligned}$$
(57)

the U(1) gauge field \({\hat{C}}\) is now coupled, not just to the dark-matter current as in the pure “dark photon” case, but to a combination of it with the B and L currents.

Using the identityFootnote 3

$$\begin{aligned} \frac{\cos \zeta }{\cos \chi } =\,{ \displaystyle \sqrt{\frac{g^2+g'^2}{g^2+g'^2+g''^2}}}\,=\,\cos \xi , \end{aligned}$$
(58)

and \(g''\,\cos \xi \cos ^2\theta =e\tan \chi \), we recover from (55) to (57) expression (8) of \({ \mathcal J}_U^\mu \), or equivalently

$$\begin{aligned} \mathcal{J}_U^\mu \,=\,\epsilon \, e \,\left( J^\mu _\mathrm{em}+\frac{1}{2\,\cos ^2\theta }\,(\alpha \,J^\mu _{B}+\beta _i\,J^\mu _{L_i}+J_d^\mu )\,\right) . \end{aligned}$$
(59)

This also reads, in a grand-unified theory,

$$\begin{aligned} \mathcal{J}_U^\mu \,=\,\epsilon \, e \,\left( J^\mu _\mathrm{em}-{\displaystyle \frac{5}{4\,\cos ^2\theta }}\ J^\mu _{B-L}+ {\displaystyle \frac{1}{2\,\cos ^2\theta }} \ J_d^\mu \,\right) , \end{aligned}$$
(60)

in agreement with expression (33) of \(Q_U\), with \(\sin ^2\theta \,\simeq .238\,\) as appropriate for low energies.

The presence of the electromagnetic current, but also of the B and L currents, first found from the mixing (5) and (7) between \(J^\mu _F\) and \(J^\mu _{Z_\mathrm{sm}}\) (cf. Fig. 1) [5, 6], may be interpreted by expressing \(U^\mu \) in terms of the non-orthogonal \(A^\mu \) and \({\hat{C}}^\mu \) fields as in (55) (cf. Fig. 3). \(\mathcal{J}_U^\mu \) is then obtained in terms of \(J^\mu _\mathrm{em}\) and \(J^\mu _{d}\), \(J^\mu _B\) and \(J^\mu _{L_i}\) as in (56)–(60). The specific case of a “dark photon” coupled proportionally to electric charges, simple to discuss and often used as a benchmark model, appears to be too restrictive, and the possible couplings of the U to the B and L currents should be taken into account as well.

5.4 “Kinetic mixing” as the effect of a description in a non-orthogonal field basis

As a side remark, the notion of “kinetic mixing”, popular now, has been used nowhere. It goes without saying that, as in any  theory (including the standard model itself), using a non-orthogonal rather than an orthonormal field basis would introduce in the expression of the Lagrangian density non-diagonal terms, now often referred to  as “kinetic-mixing” terms. These can always be eliminated by returning to an orthonormal basis, without any loss of physical content. Furthermore, and in contrast with a general belief, it is not necessary that these “kinetic-mixing” terms be gauge invariant, provided of course the complete Lagrangian density is invariant, independently of the basis in which it is expressed.

To illustrate this let us rewrite the Lagrangian density defined in terms of \(B^\mu \) and \(C^\mu \) as in (1)–(3), in the non-orthogonal basis (\({\hat{B}}^\mu ,\,C^\mu \)), or (\(B^\mu ,{\hat{C}}^\mu \)), represented in Fig. 3, using from (44) and (46)

$$\begin{aligned} B^\mu =({\hat{B}}^\mu /\cos \zeta )-\tan \zeta \,C^\mu . \end{aligned}$$
(61)

We get

$$\begin{aligned} \begin{array}{ccl} \mathcal{L}_k&{}=&{}-\,{\displaystyle \frac{1}{4}}\, \left( W^{\mu \nu }_{\,3}\,W_{3\,\mu \nu }+ B^{\mu \nu }B_{\mu \nu }+ C^{\mu \nu }C_{\mu \nu }\right) \\ &{}=&{}-\,{\displaystyle \frac{1}{4}}\ W^{\mu \nu }_{\,3}\,W_{3\,\mu \nu } \\ &{}&{} -\,{\displaystyle \frac{1}{4} \,\frac{1}{\cos ^2\zeta }}\left( {\hat{B}}^{\mu \nu }{\hat{B}}_{\mu \nu }+ C^{\mu \nu }C_{\mu \nu }- 2\sin \zeta \, {\hat{B}}^{\mu \nu }C_{\mu \nu }\right) \\ &{}=&{}-\,{\displaystyle \frac{1}{4}}\ W^{\mu \nu }_{\,3}\,W_{3\,\mu \nu } \\ &{}&{} -\,{\displaystyle \frac{1}{4} \,\frac{1}{\cos ^2\zeta }}\left( B^{\mu \nu }B_{\mu \nu }+ {\hat{C}}^{\mu \nu }{\hat{C}}_{\mu \nu }+2\sin \zeta \, B^{\mu \nu }{\hat{C}}_{\mu \nu }\right) . \end{array}\nonumber \\ \end{aligned}$$
(62)

This is immediately rediagonalized by returning to the orthogonal fields \(B^\mu \) and \(C^\mu \) (or \({\hat{B}}^\mu \) and \({\hat{C}}^\mu \)).

The same kinetic terms (62) in \(\mathcal L\) may be reexpressed in other non-orthogonal basis, involving different non-diagonal “kinetic-mixing” terms, this time not even gauge invariant. With

$$\begin{aligned} \begin{array}{rcc} U^\mu &{}=&{}{\displaystyle \frac{{\hat{C}}^\mu }{\cos \chi }}\,+ \,\tan \chi \ A^\mu ,\\ \hbox {or}\ \ \ \ A^\mu &{}=&{}{\displaystyle \frac{{\hat{A}}^\mu }{\cos \chi }}\, - \tan \chi \ U^\mu , \end{array} \end{aligned}$$
(63)

as in (55) one has

$$\begin{aligned} \mathcal{L}_k= & {} -\,{\displaystyle \frac{1}{4}}\, \left( Z^{\mu \nu }\,Z_{\mu \nu }+ A^{\mu \nu }A_{\mu \nu }+ U^{\mu \nu }\,U_{\mu \nu }\right) \nonumber \\= & {} -\,{\displaystyle \frac{1}{4}}\ Z^{\mu \nu }\,Z_{\mu \nu }\nonumber \\&-\,{\displaystyle \frac{1}{4} \,\frac{1}{\cos ^2\chi }}\left( A^{\mu \nu }A_{\mu \nu }+ {\hat{C}}^{\mu \nu }{\hat{C}}_{\mu \nu }+2\sin \chi \, A^{\mu \nu }{\hat{C}}_{\mu \nu }\right) , \nonumber \\= & {} -\,{\displaystyle \frac{1}{4}}\ Z^{\mu \nu }\,Z_{\mu \nu } \nonumber \\&-\,{\displaystyle \frac{1}{4} \,\frac{1}{\cos ^2\chi }}\left( {\hat{A}}^{\mu \nu }{\hat{A}}_{\mu \nu }{+} U^{\mu \nu }U_{\mu \nu }{-}2\sin \chi \, {\hat{A}}^{\mu \nu }U_{\mu \nu }\right) , \end{aligned}$$
(64)

where the “kinetic-mixing” terms, which no longer involve abelian gauge fields only, are not gauge invariant.

The mixing angles between \({\hat{C}}^\mu \) and \(A^\mu \) in (64), or \({\hat{C}}^\mu \) and \(B^\mu \) in (62) are geometrically related by

$$\begin{aligned} \sin \chi =-\,{\hat{C}}.\,A=\cos \theta \, (\,\underbrace{-\,{\hat{C}}.B}_{{\displaystyle \sin \zeta }}\,)-\,\sin \theta \ \underbrace{{\hat{C}}.\,W_3}_{{\displaystyle 0}}, \end{aligned}$$
(65)

thanks to the orthogonality between \({\hat{C}}\) and \(W_3\) (cf. Fig. 3), i.e.,

$$\begin{aligned} \sin \chi = \sin \zeta \cos \theta , \end{aligned}$$
(66)

easily verified from \(\sin \zeta \cos \theta = (g''/{\hat{g}}') \,(g/g_Z)= \sin \chi \).

Altogether there is no real gain in considering such non-diagonal kinetic terms, immediately rediagonalized by returning to the original expressions. Considering kinetic-mixing terms i.e., using non-orthogonal field basis simply appears as a substitute for the introduction of the appropriate couplings of \(C^\mu \) in the covariant derivative (1), involving both the visible and the hidden sectors and leading to the corresponding current \(\mathcal{J}^\mu _F\) in (3). This should not hide that \(C^\mu \) may be coupled to B and L as well as to Y and dark matter, and the U boson to a combination of the electromagnetic with the \(B,\,L\), and dark-matter currents.

In addition, a non-vanishing mixing angle \(\chi \), and coupling \(\epsilon =e\,\tan \chi \) (also relating the visible and hidden sectors), corresponding to \(U^\mu = ({\hat{C}}^\mu /\cos \chi )+\tan \chi \,A^\mu \) as in (55), may be obtained directly even in the presence of a single U(1) gauge group, as in a \({SU}(5)\times U(1)_F\) gauge theory, with the visible and hidden sectors getting totally decoupled for \(\epsilon =\chi =\zeta =\xi =0\).Footnote 4 These non-vanishing \(\epsilon \) and \(\chi \) are obtained here in spite of the non-existence of a gauge-invariant kinetic-mixing term between non-abelian (SU(5)) and abelian (\(U(1)_F\)) gauge fields.

6 Implications for a light U

6.1 Axial couplings are strongly constrained

The axial couplings of the U should satisfy

(67)

for \(m_U\) larger than a few MeVs, expressing that , to avoid too large parity-violation effects in atomic physics [4]. A light U with axial couplings (\(1^+\)) could also be produced in a longitudinal polarization state with enhanced effective pseudoscalar couplings to quarks and leptons [1],

$$\begin{aligned} f_{ql}^P=\,f_{ql}^A\ \ \frac{2\,m_{ql}}{m_U}, \end{aligned}$$
(68)

much like a \(0^+\) pseudoscalar a. The resulting constraints on the axial couplings to heavy quarks, from \(\psi \) or \(\Upsilon \rightarrow \gamma \,U \) and \(K^+\rightarrow \gamma \,U \) decays [3], are rather severe, especially for a light U with invisible decay modes into \(\nu \bar{\nu }\) or LDM particles. In particular the axial couplings to down quarks and charged leptons, universal when they get masses from the same doublet v.e.v. (as in supersymmetric theories) must then verify [10, 29, 30]

(69)

(which is typically for a U in the \(\sim 10\) MeV mass range). This leads, in such cases where axial couplings may occur, one to consider a \(U(1)_F\) symmetry broken at a scale larger than electroweak through a large singlet v.e.v., very much as for an “invisible” axion [2].

The U lifetime may vary considerably between less than \(10^{-15}\) s to many years and even infinity, depending on its mass and couplings. The decay rate for \(U\rightarrow e^+e^-\) is given by [1]

$$\begin{aligned} \Gamma _{ee}\simeq & {} \displaystyle \frac{1}{12\pi }\ \left[ \,(f_e^V)^2\, \left( 1+\frac{2m_e^2}{m_U^2}\right) \,\sqrt{1-\frac{4m_e^2}{m_U^2}}\,\right. \nonumber \\&+\, \displaystyle \left. \,(f_e^A)^2 \, \left( 1-\frac{4m_e^2}{m_U^2}\right) ^{3/2}\ \right] m_U, \end{aligned}$$
(70)

including the phase-space factors \(\frac{3}{2}\,\beta -\frac{1}{2}\,\beta ^3\) and \(\beta ^3\) for the vector and axial production of spin-\(\frac{1}{2}\) particles, with \(\beta =\sqrt{1-4m_e^2/m_U^2}\,\). We also have to take into account the invisible decay modes of the U into ordinary neutrinos, given for three left-handed neutrinos with chiral couplings \(f_\nu \) by

$$\begin{aligned} \Gamma _{\nu _L\,\overline{\nu _L}} \ \simeq \, \frac{f_\nu ^2}{8\pi } \ m_U, \end{aligned}$$
(71)

and other possible invisible decays into right-handed neutrinos and light dark-matter particles, which could decrease significantly the branching ratio for \(U\rightarrow e^+e^-\).

6.2 Vector couplings should not be too large

Let us thus return to a light U vectorially coupled to SM particles, as is the case when a single doublet \(\varphi \) contributes to the electroweak breaking (or several but with the same gauge quantum numbers) [5, 6]. Couplings proportional to a combination of \(B,\,L_i\) and Q as in (9), (40), and (60), rather than just Q, open new possibilities for experimental detection [29]. Experimental results, usually discussed in the \((\log \epsilon , m_U)\) plane, should also be considered in terms of these couplings \(f=\epsilon e\,Q_U\), through the changes in the couplings to SM particles,

$$\begin{aligned} \epsilon e\,Q\ \rightarrow \ f\,=\,\epsilon e\,Q_U =\, \epsilon e\ (Q+\lambda _B B+\lambda _{L_i} L_i), \end{aligned}$$
(72)

leading to \( f_n+f_{\nu _e} = f_p+f_e \) as in (13), and more specifically

$$\begin{aligned} \epsilon e\,Q\ \rightarrow \ \epsilon e \,Q_U = \epsilon e \ (Q- \lambda (B-L)), \end{aligned}$$
(73)

with \(Q_U(e)=\lambda -1\simeq .64\) in a grand-unified theory. This usually results in moderate shifts of the various limits when expressed in terms of \(\log \epsilon \); e.g. for an experiment sensitive to the couplings to the electron, through the change

(74)

The U boson should interact sufficiently weakly with electrons, so that its contribution to the electron anomaly,

$$\begin{aligned} \delta _U a_e\,\simeq \,\frac{f_e^2}{12\pi ^2}\ \frac{m_e^2}{m_U^2}\simeq \,\frac{\alpha }{3\pi }\ [\,\epsilon \,Q_U(e)\,]^2\,\frac{m_e^2}{m_U^2} \end{aligned}$$
(75)

for \(m_U\) larger than a few MeVs, be less than about \(3\ 10^{-12}\) [31] (improved over the earlier \(2\ 10^{-11}\) leading to (MeV) [10]). This requires

(76)

or

(77)

applicable for \(m_U\) above a few MeVs.

The U should also act sufficiently weakly with protons, so that the \(\pi ^0 \rightarrow \gamma \,U\) decay rate be sufficiently small. The corresponding branching ratio is given, for \(m_U\) somewhat below \(m_{\pi ^\circ }\), by \(2\,(2f_u+f_d)^2/e^2 =2\, \epsilon ^2 \,Q_U(p)^2\), replacing the \(2\,\epsilon ^2\) of the pure dark photon case, typically constrained to be for \(m_U\) in the 10–100 MeV range [32]. This experiment provides similar limits for a U boson decaying into \(e^+e^-\), with the replacement

(78)

leading to

(79)

with \(Q_U(p) = 1-\lambda \simeq \,-.64\) in a grand-unified theory.

For a boson with significant invisible decays into neutrinos or light dark-matter particles, we have, from the search for the decay \(\pi ^0\rightarrow \gamma +U_\mathrm{\, inv.}\) [33,34,35], with a branching ratio fixed by \(2\epsilon ^2\, Q_U(p)^2 < \,3.3\ 10^{-5}\) for \(m_U< \) 120 MeV, the limit

(80)

We also have, from a low-\(q^2\ \nu _e\)e scattering experiment [36], the constraint \(|f_{\nu _e} f_e|/m_U^2\) [4, 8],  i.e.,

(81)

valid for \(m_U\) larger than a few MeV (or otherwise), also expressed as

(82)

If \(f_e, f_p\), and \(f_\nu \) are all small as suggested by the above constraints, \(f_n\) may have to be small as well, as a result of (13).

For an anomaly-free theory the currents may be constructed from Q, \(B-L\) (with \(\nu _R\)’s) and \(L_i-L_j\), with the dark-matter current, also vectorial, involving spin-0 or Dirac spin-1/2 dark-matter particles. With a family-independent symmetry \(Q_U\) involves \(B-L\) as in (15), \(Q_U = Q\) \(-\,\lambda \,(B-L)\) \( +\,Q_{U\,\mathrm dark}\), as found in a grand-unified theory, implying

$$\begin{aligned} f_e =-\,f_p= (\lambda -1)\,\epsilon e,\ f_\nu =-f_n=\lambda \, \epsilon e, \end{aligned}$$
(83)

in agreement with (13). More specifically the constraints (81) and (82) read (10 MeV) and typically imply, in a grand-unified theory with \(\lambda \simeq 1.64\) so that \( \sqrt{\left| \lambda (\lambda -1)\right| } \simeq 1\),

(84)

for \(m_U\) larger than a few MeVs.

Still it may be possible, although at the price of elegance, to arrange for \(\nu _e\), or \(\nu _e\) and \(\nu _\mu \), not to interact with U, e.g. through the change \(B-L\rightarrow B-3L_\tau \), leading to

$$\begin{aligned} Q_U = \,Q-\lambda \,(B-3L_\tau ) +Q_{U\,\mathrm dark}, \end{aligned}$$
(85)

still in an anomaly-free theory. More specifically

$$\begin{aligned} Q_U\simeq \,Q-(B-3L_\tau )+Q_{U\,\mathrm dark}\, \end{aligned}$$
(86)

would lead to \(Q_U(n)\simeq Q_U(e) \simeq Q_U(\mu ) \simeq -1\), \(Q_U(p)\simeq \) \(Q_U(\nu _e)\simeq Q_U(\nu _\mu )\simeq 0\), i.e.,

$$\begin{aligned} f_n\simeq f_e\simeq \,-\,\epsilon \,e,\ \ \hbox {with } \ f_p,\,f_{\nu _e},f_{\nu _\mu }\ \ \hbox {very small.} \end{aligned}$$
(87)

The U couplings in (9) offer new opportunities for tentative interpretations of possible anomalies. The decays of excited states of \(^8\)Be have long been viewed as potentially sensitive to the production of light spin-1 U bosons and anomalous production of \(e^+e^-\) pairs that could signal such decays have already been reported [37], although these first indications were not confirmed. A possible new anomaly has been found recently [38], which remains to be better understood before attempting at an interpretation [39, 40]. It does not seem that it can be attributed to a dark photon coupled proportionally to electric charges, which would require a too large \(\epsilon \). Equations (8) and (9) of the U charge and current [5, 6] may help provide an interpretation if the effect is real, possibly with a U interacting more strongly with neutrons and electrons than with protons and neutrinos, keeping in mind relations (11)–(13) associated with a conserved \(Q_U\) in the massless limit.

7 Conclusions

Expression of \(Q_U\) from \(Q,\ B\) and L

The structure presented here for the interactions of a light U boson depends on a very small number of relevant parameters, especially \(\epsilon \) and \(m_U\) with \(Q_U\) expressed as \(Q+\lambda _B B+\lambda _i\,L_i\ [+\,Q_d\,]\,\) or \(Q-\,\lambda \,(B-L)\ [+\,Q_d\,]\,\). It provides a consistent framework to deal with a new interaction, naturally parity-conserving in the visible sector and coupled to a conserved charge \(Q_U\), in the small mass limit. The U may be viewed as mixed with the Z, or mixed with the photon, or both at the same time with a \(3\times 3\,\) mixing matrix extending the \(2\times 2\,\) electroweak one of the standard model.

Kinetic mixing as an effect of a non-orthogonal field basis

What is often referred to as “kinetic mixing” simply corresponds to choosing a description in a non-orthogonal field basis. This also implies that the kinetic-mixing terms associated with this basis are not even required to be gauge invariant, in contrast with a common belief. Furthermore, even with the U viewed as kinematically mixed with the photon, B and L contributions are generally allowed in its couplings, and may even be required as in the case of grand unification.

A non-vanishing \(\epsilon =\tan \chi \,\) within grand unification

The construction is compatible with grand unification, with a charge \(Q_U\) commuting with the electrostrong symmetry between the photon and gluons at the GUT scale, and a non-vanishing \(\epsilon =\tan \chi \,\) already present at the classical level. This occurs in spite of the fact that the \({SU}(5)\times U(1)_F\) gauge group includes a single abelian factor \(U(1)_F\), so that no gauge-invariant kinetic term mixing the SU(5) and \(U(1)_F\) gauge fields may be written. Still it is possible to view the U as (“kinematically”) mixed with the photon as in (55) and (63), with non-gauge-invariant mixing terms in the Lagrangian density as in (64), and a U boson also coupled to the B and L currents.

\(Q_U\) from the \({SU}(4)_\mathrm{es }\times U(1)_U\) electrostrong symmetry

\(Q_U\) evolves, from \(Q-2\,(B-L) + Q_{U\,\mathrm dark}\) at the GUT scale, to

$$\begin{aligned} Q_U\,\simeq \ Q-1.64 \,(B-L) + Q_{U\,\mathrm dark} \end{aligned}$$
(88)

at low energy. This expression, and the more general one of \(Q_U\simeq \,Q-\lambda \,(B-L) + Q_{U\,\mathrm dark}\), motivated by grand unification and by anomaly-cancellation, may be used to display the experimental constraints in the \((\log \epsilon ,\,m_U)\) plane as a function of \(\lambda \,\).

The U current in the visible sector is purely vectorial in the massless U limit, in relation with the fact that the theory may admit at high energy an extended \(U(4)= {SU}(4)_\mathrm{es}\times U(1)_U\) electrostrong symmetry which preserves parity, with the U, photon, gluons, and \(X^{\pm 4/3}\) bosons all coupled to vector currents. The interactions mediated by the Z and the electrostrong quartet \((Y^{\mp 1/3}, \, W^\pm )\), on the other hand, violate parity.

A large variety of interesting effects may occur, in particular for a U in the \(\approx \) MeV to hundred MeV mass range. The U boson, if extremely light or massless, may also lead to a new long range force, extremely weak, which could become manifest through apparent violations of the equivalence principle. This will soon be tested in space to an increased level of precision. The characteristics of the new interaction mediated by such a light neutral spin-1 U boson may also play a role in shedding light on a possible unification of weak, electromagnetic, and strong interactions.