1 Introduction

Initially proposed in 1977, the Peccei–Quinn theory so far is considered to be the most compelling strong CP problem resolution [1, 2]. In this model a CP-violating phase is dynamically driven to zero, giving rise to a new pseudoscalar particle called axion [3, 4]. A key property of the QCD axion is the linear proportionality between its couplings to the Standard Model particles and the axion mass.

During the last four decades numerous attempts have been made to find a signal of this particle, including both lab searches and astronomical observations [5, 6]. Current constraints show that the QCD axion (in case it exists) must be very weakly interacting and thus is called “invisible”, which forces one to concentrate on the possible indirect detection of this particle [7, 8].

In addition to the QCD axion mechanism, various Standard Model extensions with axion-like particles were proposed [9,10,11]. The main difference to the original model is that ALPs are not restricted to a linear mass-coupling relation. Furthermore, ALPs are considered to be promising dark matter candidates as being both very long-lived and weakly-interacting with the mass unconstrained by their interactions with other particles [12, 13]. During the past few years there is a noticeable increase of interest in the MeV to GeV range [14,15,16,17,18]. Although significant progress has been made recently [19, 20], there still remain untested regions in the parameter space, especially when compared to the low-mass region where astrophysical constraints can be applied.

The ALP-photon coupling has been the subject of numerous studies, approached from both theoretical [14, 21] and experimental perspectives [22, 23]. However, these investigations often overlook the ALP-lepton coupling, despite the fact that it is much less constrained experimentally and thus warrants attention (see, for example, the recent work [24], where the significance of the ALP-photon and ALP-lepton couplings interplay is also emphasized).

At first glance, one might assume that additional interactions would amplify the cross section, thereby imposing even stricter bounds. However, this reasoning falls short when an extra decay channel significantly reduces the ALPs lifetime. Our goal is to refine current analyses by incorporating the ALP-electron interaction and assessing its potential impact on the constraints derived from electron–positron colliders.

In this work we investigate the mass and coupling constraints of ALPs in the MeV to GeV range from 3-photon events in \(e^+e^-\) annihilation. We focus on the kinematical setting of the Belle II experiment. Section 2 provides a general overview of the given formalism with the discussion of couplings, matrix elements and cross sections. Section 3 illustrates the main results and provides predictions for Belle II kinematics and constraints which follow from the calculated processes. Section 4 summarizes our work.

2 ALP formalism

In this work we assume that ALPs in the MeV to GeV mass range couple predominantly to electroweak gauge bosons and electrons, i.e. decay only to visible states and are decoupled from the QCD sector. The last statement is justified by the fact that such coupling is strongly constrained by the searches of flavour-changing processes [25].

The following section provides a short review of the relevant ALP interactions. The parameter space includes three variables - the ALP mass \(m_a\) and its couplings to photons and electrons, which are denoted by \(g_{a \gamma \gamma }\) and \(g_{aee},\) respectively.

In this paper, we do not focus on any specific UV-complete ALP model, and the exact origin of these couplings is unspecified. At the same time, our primary objective is to impose constraints on ALPs using Belle II data and it’s important to note that experimental setups can only detect effective couplings rather than tree-level ones [24]. Therefore, the couplings \(g_{a \gamma \gamma }\) and \(g_{aee}\) must be considered as effective, encompassing both their tree-level values and potential loop-induced effects.

In this framework we detail the calculations of ALP contributions to 2- and 3-photon annihilation of \(e^+e^-\) pairs and the interplay of two couplings.

2.1 Electroweak gauge invariance

The generic gauge-invariant Lagrangian of ALPs interaction with electroweak vector bosons has the form [14]

$$\begin{aligned} {\mathcal {L}}_{a-EW} = -\frac{g_{aBB}}{4}aB^{\mu \nu }\tilde{B}_{\mu \nu } -\frac{g_{aWW}}{4}aW_i^{\mu \nu }\tilde{W}_{i, \mu \nu }, \end{aligned}$$
(1)

a stands for the pseudoscalar ALP field, \(B^{\mu \nu }\) and \(W^{\mu \nu }_i\) refer to the \(U\left( 1\right) \) and \(SU\left( 2\right) \) field tensors, respectively. The corresponding dual pseudotensors are defined in the standard way, \(\tilde{B}_{\mu \nu } = \frac{1}{2}\varepsilon _{\mu \nu \lambda \sigma } {B}^{ \lambda \sigma }\) and similarly for \(W^{\mu \nu }_i,\) while \(g_{a BB}\) and \(g_{a WW}\) are the coupling constants with dimension \(\text{ GeV}^{-1}.\)

After symmetry breaking the effective interaction of ALPs and photons is given by

$$\begin{aligned} {\mathcal {L}}_{a \gamma \gamma } = -\frac{g_{a \gamma \gamma }}{4}aF^{\mu \nu }\tilde{F}_{\mu \nu }, \end{aligned}$$
(2)

where the new coupling constant was defined

$$\begin{aligned} g_{a\gamma \gamma } = g_{aBB} \cos ^2{\theta _W}+g_{aWW} \sin ^2{\theta _W}, \end{aligned}$$
(3)

with \(\theta _W\) being the Weinberg angle.

Of course, the Lagrangian (1) also leads to a non-zero aZZaWW and also \(a\gamma Z\) interactions. However, in the domain of interest corresponding to electron–positron collider with center-of-momentum energy below 10 GeV, these channels are irrelevant, since the production of Z and W bosons is prohibited by energy conservation and can safely be neglected.

The matrix element for the \(a \rightarrow 2 \gamma \) decay shown in Fig. 1 is given by

$$\begin{aligned} M_{a \rightarrow \gamma \gamma } \left( k_1, k_2\right)= & {} -ig_{a \gamma \gamma } k_{1,\kappa }k_{2,\beta } \varepsilon ^{\kappa \beta \mu \nu }\nonumber \\{} & {} \times \epsilon ^*_{\mu }\left( k_1, \lambda _1\right) \epsilon ^*_{\nu }\left( k_2, \lambda _2\right) , \end{aligned}$$
(4)

where \(\epsilon _{\mu }\left( k_1, \lambda _1\right) \) and \(\epsilon _{\mu }\left( k_2, \lambda _2\right) \) are the polarization vectors of the photons with 4-momenta \(k_1,\) \(k_2\) and helicities \(\lambda _1,\) \(\lambda _2,\) respectively. Summing over the final helicities, we obtain

$$\begin{aligned} \sum _f \left| M_{a \rightarrow \gamma \gamma }\left( k_1, k_2\right) \right| ^2 = 2 g^2_{a \gamma \gamma } \left( k_1k_2\right) ^2. \end{aligned}$$
(5)
Fig. 1
figure 1

ALP decay to two photons in the lowest order

The corresponding decay width is then obtained as

$$\begin{aligned} \Gamma _{a \gamma \gamma } = \frac{g^2_{a \gamma \gamma }m_a^3}{64\pi }. \end{aligned}$$
(6)

2.2 Interaction with leptons

We next discuss the ALP-fermion coupling and the corresponding decay rate. The generic interaction of ALPs with fermions is of the form

$$\begin{aligned} {\mathcal {L}}_{aff} = -\frac{g_{aff}}{2m_f} \partial _{\mu } a \bar{f} \gamma ^5 \gamma ^{\mu } f, \end{aligned}$$
(7)

where f stands for the fermion field, \(m_f\) denotes its mass and \(g_{aff}\) is the dimensionless coupling constant.

From \({\mathcal {L}}_{aff}\) it is clear that lepton universality requires the large enhancement of ALP coupling to muon, namely

$$\begin{aligned} g_{a \mu \mu } \approx \frac{m_{\mu }}{m_e} g_{a e e}. \end{aligned}$$
(8)

In this paper we follow Alves and Wiener work [26] and consider ALPs coupled only to electrons in order to avoid effects induced by this enhanced coupling, such as \(\left( g-2\right) _{\mu }\) corrections on the muon anomalous magnetic moment.

At tree level \({\mathcal {L}}_{aff}\) can be equivalently reduced to a pseudoscalar coupling

$$\begin{aligned} {\mathcal {L}}_{aff}= -ig_{aff} a \bar{f} \gamma ^5 f. \end{aligned}$$
(9)

We are interested in the \(a \rightarrow e^+ e^-\) decay shown on Fig. 2, which has the amplitude

$$\begin{aligned} M_{a \rightarrow e^+ e^-} = g_{aee} \bar{u} \left( p_-, s_-\right) \gamma ^5 v\left( p_+, s_+\right) , \end{aligned}$$
(10)

where \(u\left( p_-, s_-\right) \) and \(v\left( p_+, s_+\right) \) are the bispinors describing electron and positron with momenta \(p_{\pm }\) and helicities \(s_{\pm },\) respectively.

Fig. 2
figure 2

ALP decay to the lepton–antilepton pair in the lowest order

In the domain of interest we can assume \(m_e \ll m_a.\) After summing over the final helicities, the squared amplitude for this process is given by

$$\begin{aligned} \sum _f \left| M_{a \rightarrow e^+ e^-} \right| ^2 =2g_{aee}^2m_a^2. \end{aligned}$$
(11)

The corresponding decay width then has the form

$$\begin{aligned} \Gamma _{aee} = \frac{g_{aee}^2m_a}{8\pi }. \end{aligned}$$
(12)

In the absence of interaction with other fields, the total ALP decay width is assumed to consist of two contributions

$$\begin{aligned} \Gamma _a = \Gamma _{aee} + \Gamma _{a \gamma \gamma }. \end{aligned}$$
(13)

2.3 ALP production at \(e^+e^-\) colliders

An ALP contributes to the 2-photon annihilation of \(e^+e^-\) through the diagram shown on Fig. 3. The matrix element is

$$\begin{aligned} M_{e^+e^- \rightarrow \gamma \gamma }= & {} ig_{aee}\frac{\bar{v} \left( p_+, s_+\right) \gamma ^5 u\left( p_-, s_-\right) }{s-m_a^2+im_a \Gamma _a} \nonumber \\{} & {} \times M_{a \rightarrow \gamma \gamma }\left( k_1,k_2\right) , \end{aligned}$$
(14)

with \(M_{a \rightarrow \gamma \gamma }\left( k_1,k_2\right) \) given in Eq. (4).

As a function of \(m_a,\) this cross section is significantly different from zero only in a small region around \(m_a^2=s=4E^2,\) where E denotes the initial electron (positron) energy in the center-of-momentum frame. Thus, for a fixed collider energy, the \(e^+e^- \rightarrow \gamma \gamma \) process is not providing constraints on ALP parameters in a broad \(m_a,\) \(g_{a \gamma \gamma },\) \(g_{aee}\) parameter space in \(e^+e^-\) annihilation. Therefore, in the following we investigate 3-photon final states.

Fig. 3
figure 3

\(e^+e^-\) annihilation into two photons through an intermediate ALP

Figure 4 shows the contribution of ALP-photon coupling resulting in 3-photon events. The corresponding amplitudes are given by

$$\begin{aligned}{} & {} M_{e^+e^- \rightarrow \gamma \gamma \gamma \, \left( ALP_1\right) } = i\frac{H_{e^+e^- \rightarrow \gamma ^* \rightarrow a\gamma } \left( k_1\right) }{K_{23}^2 - m_a^2 + i m_a \Gamma _a}\nonumber \\{} & {} \quad \times M_{a \rightarrow \gamma \gamma }\left( k_2,k_3\right) + \text{ crossed } \text{ terms }, \end{aligned}$$
(15)

where \(H_{e^+e^-\rightarrow \gamma ^* \rightarrow a\gamma }\left( k_i\right) \) stands for \(e^+ e^- \rightarrow a \gamma _i\) amplitude

$$\begin{aligned} H_{e^+e^- \rightarrow \gamma ^* \rightarrow a\gamma } \left( k_i\right)= & {} -ieg_{a \gamma \gamma } \, \varepsilon _{\alpha \beta \mu \gamma } q^{\alpha } k_{i}^\beta \epsilon ^{\gamma }\left( k_i, \lambda _i\right) \nonumber \\{} & {} \times \frac{ \bar{v} \left( p_+, s_+\right) \gamma ^{\mu } u\left( p_-, s_-\right) }{s} , \end{aligned}$$
(16)

with e representing the positron charge and \(q=p_++p_-\) standing for the internal photon 4-momentum. We denote the ALP 4-momenta as \(K_{23} = k_2+k_3.\)

Fig. 4
figure 4

\(e^+e^-\) annihilation into three photons involving the \(g_{a\gamma \gamma }\) coupling. Graphs obtained from these by crossing are not shown, but are evaluated too

It is generally assumed that ALPs are long-lived particles, i.e. their decay width \(\Gamma _a\) is a small quantity, typically much smaller than the experimental resolution of the invariant mass of the two-photon system in which the ALP decays. Thus the integration over the phase space gives the main contribution only in a very small range of variables where the invariant mass of the photon pair produced by the ALP is close to \(m_a^2.\)

In such kinematics the interference terms become unobservable and can be omitted. After the integration over the phase space the total cross section can be represented as a cross section obtained from only Feynman diagrams shown in Fig. 4 multiplied by a factor of three to account for the 3 channels. Thus we obtain

$$\begin{aligned}{} & {} \bar{\sum _i} \sum _f \left| H_{e^+e^- \rightarrow \gamma ^*\rightarrow a\gamma } \left( k_1\right) \right| ^2 \nonumber \\{} & {} \quad = \frac{2e^2g^2_{a \gamma \gamma }}{s^2} \left[ \left( k_1p_+\right) ^2+\left( k_1p_-\right) ^2 \right] \left( p_-p_+\right) , \end{aligned}$$
(17)

where \(\bar{\sum _i} \sum _f\) denotes the average over initial helicities states and the sum over final.

As we conclude this part, it’s important to note that the diagram in Fig. 4 incorporates the ALP-photon coupling twice, but in distinct ways. Specifically, one vertex depicts a Primakoff-like process (with one photon being virtual), while the other corresponds with the ALP decay. Given that these couplings must be viewed as effective, they are differentiated by loop correction and, fundamentally, they are not identical, as emphasized in [27]. However, when the limit \(m_e/m_a \ll 1\) is fulfilled, the difference becomes negligible, making the two effective couplings in good approximation equivalent once again. Further details can be found in Appendix A.

Fig. 5
figure 5

\(e^-e^+\) annihilation into three photons involving the \(g_{aee}\) coupling. Graphs which are obtained by crossing are not shown, but are evaluated too

We next discuss the 3-photon production in \(e^+ e^-\) annihilation which results from the ALP-electron coupling contribution, shown in Fig. 5. The corresponding amplitudes can be expressed as

$$\begin{aligned}{} & {} M_{e^+e^- \rightarrow \gamma \gamma \gamma \, \left( ALP_2\right) } \nonumber \\{} & {} \quad = i\frac{H_{e^+e^- \rightarrow a\gamma ,1}\left( k_1\right) + H_{e^+e^- \rightarrow a\gamma ,2}\left( k_1\right) }{K_{23}^2 - m_a^2 + i m_a \Gamma _a}\nonumber \\{} & {} \qquad \times M_{a \rightarrow \gamma \gamma }\left( k_2,k_3\right) + \text{ crossed } \text{ terms }, \end{aligned}$$
(18)

where \(H_{e^+e^- \rightarrow a\gamma ,j}\left( k_i\right) \) denote amplitudes for the corresponding \(2 \rightarrow 2\) process

$$\begin{aligned} H_{e^+e^- \rightarrow a\gamma ,1} \left( k_i\right)= & {} eg_{aee} \,\epsilon _{\eta }^*\left( k_i, \lambda _i\right) \nonumber \\{} & {} \times \bar{v}\left( p_+, s_+\right) \gamma ^{\eta } \frac{\hat{l}_i}{l_i^2} \gamma ^{5} u \left( p_-, s_-\right) ,\end{aligned}$$
(19)
$$\begin{aligned} H_{e^+e^- \rightarrow a\gamma ,2}\left( k_i\right)= & {} eg_{aee} \, \epsilon _{\lambda }^*\left( k_i, \lambda _i\right) \nonumber \\{} & {} \times \bar{v}\left( p_+, s_+\right) \gamma ^5 \frac{\hat{f}_i}{f_i^2} \gamma ^{\lambda } u \left( p_-, s_-\right) ,\nonumber \\ \end{aligned}$$
(20)

with the internal electron momenta

$$\begin{aligned} l_i = k_i-p_+, \quad f_i = p_- - k_i. \end{aligned}$$
(21)

It is worth noticing that there is no interference between this set of diagrams and the diagrams shown in Fig. 4. Using the same arguments as before, we conclude that for the cross section calculation, we only need to evaluate the two topologies shown in Fig. 5, as

$$\begin{aligned}{} & {} \bar{\sum _i} \sum _f \left| H_{e^+e^- \rightarrow a\gamma ,1}\left( k_1\right) + H_{e^+e^- \rightarrow a\gamma ,2}\left( k_1\right) \right| ^2 \nonumber \\{} & {} \quad = e^2g_{aee}^2 \left( \frac{ p_-k_1}{p_+k_1} +\frac{ p_+k_1}{p_-k_1} + 2 \frac{\left( p_+ K_{23}\right) \left( p_-K_{23}\right) }{\left( p_-k_1\right) \left( p_+k_1\right) } \right) .\nonumber \\ \end{aligned}$$
(22)
Fig. 6
figure 6

Angular distributions of \(e^+e^- \rightarrow \gamma \gamma \gamma \) annihilation for a process with an intermediate ALP in Belle II kinematics for two values of \(m_a\) and two values of \(g_{aee}.\) The value \(g_{a \gamma \gamma } = 10^{-4} \, \text{ GeV}^{-1}\) is used for all curves

2.4 Cross section and observables

The cross section of \(e^+e^- \rightarrow \gamma \gamma \gamma \) process is given by the expression

$$\begin{aligned} \sigma= & {} \frac{1}{3!} \int d_{LIPS} \left( 2\pi \right) ^4 \delta ^{\left[ 4\right] }\left( p_- + p_+ - k_1 - k_2 - k_3\right) \nonumber \\{} & {} \times \frac{1}{2s} \bar{\sum _i} \sum _f \left| M_{e^+e^- \rightarrow \gamma \gamma \gamma }\right| ^2, \end{aligned}$$
(23)

where \(d_{LIPS}\) stands for the Lorentz-invariant phase space of the three final photons

$$\begin{aligned} d_{LIPS} = \frac{d^3k_1}{2\omega _1 \left( 2\pi \right) ^3}\frac{d^3k_2}{2\omega _2 \left( 2\pi \right) ^3}\frac{d^3k_3}{2\omega _3 \left( 2\pi \right) ^3}. \end{aligned}$$
(24)

After the integration with the delta function, the phase space can be expressed as

$$\begin{aligned}{} & {} d_{LIPS} \left( 2\pi \right) ^4 \delta ^{\left[ 4\right] }\left( p_- + p_+ - k_1 - k_2 - k_3\right) \nonumber \\{} & {} \quad = \frac{1}{2^8 \pi ^5} \frac{\omega _1 \omega _2}{2E + \omega _1 \left( \cos {\theta _{12}} - 1 \right) } d\omega _1 d\Omega _1 d\Omega _2, \end{aligned}$$
(25)

with \(\theta _{12}\) denoting the angle between \({\textbf {k}}_1\) and \({\textbf {k}}_2\) momenta. The remaining phase space is parameterized as

$$\begin{aligned} d\Omega _1 d\Omega _2 = 2 \pi d\phi \, d\cos {\theta _{1-}}d\cos {\theta _{2-}}, \end{aligned}$$
(26)

where \(\theta _{i-}\) is the angle between \({\textbf {p}}_-\) and \({\textbf {k}}_i,\) which leads to

$$\begin{aligned} \cos {\theta _{12}} = \sin {\theta _{1-}}\sin {\theta _{2-}} \cos {\phi } + \cos {\theta _{1-}}\cos {\theta _{2-}}. \end{aligned}$$
(27)

Furthermore, in the center-of-momentum frame it holds

$$\begin{aligned} {\left\{ \begin{array}{ll} \omega _1 + \omega _2+\omega _3 = 2E,\\ {\textbf {k}}_1+{\textbf {k}}_2+{\textbf {k}}_3=0, \end{array}\right. } \end{aligned}$$
(28)

allowing to express \(\omega _2\) as

$$\begin{aligned} \omega _2 = \frac{2E\left( E-\omega _1\right) }{2E + \omega _1\left( \cos {\theta _{12}}-1\right) }. \end{aligned}$$
(29)

For the ALP-associated process, the photon which is opposite to the ALP in center-of-momentum frame is denoted by \(k_1.\) In this case, we can remove the integration over \(d\omega _1\) using the definition of the delta function

$$\begin{aligned}{} & {} \frac{1}{\left( K_{23}^2-m_a^2\right) ^2+\left( m_a \Gamma _a\right) ^2} \rightarrow \frac{\pi }{m_a \Gamma _a} \delta \left( K_{23}^2-m_a^2\right) \nonumber \\{} & {} \quad = \frac{\pi }{m_a \Gamma _a} \frac{1}{4E} \, \delta \left( \omega _1 - \frac{4E^2-m^2_a}{4E}\right) . \end{aligned}$$
(30)

Due to the resonant behavior of the amplitude, one photon is always emitted with a fixed energy

$$\begin{aligned} \omega = \frac{4E^2-m^2_a}{4E}. \end{aligned}$$
(31)

We note that, as the branching fraction, represented by the ratio \(\Gamma _{a \gamma \gamma } / \Gamma _a,\) is always less than 1,  the total cross-section of the process under investigation with an intermediate ALP may actually become smaller with a non-zero value of \(g_{aee},\) compared to when this quantity is equal to zero.

Fig. 7
figure 7

QED background distributions for the softest, middle and hardest photons in the \(e^+e^- \rightarrow \gamma \gamma \gamma \) process in Belle II kinematics

After the integration over the full phase space, the cross section of the \(2 \rightarrow 3\) process with the intermediate ALP can be written in the compact form

$$\begin{aligned} \sigma _{e^+e^- \rightarrow \gamma \gamma \gamma } = \sigma _{e^+e^- \rightarrow a \gamma } \times \frac{\Gamma _{a \gamma \gamma }}{\Gamma _a}. \end{aligned}$$
(32)

If \(g_{aee}=0,\) the ALP decays directly to photons and this formula can be simplified further (notably, it is independent of s if \(m_a^2 \ll s\)) as

$$\begin{aligned} \sigma _{e^+e^- \rightarrow a \gamma } = \frac{\alpha g^2_{a \gamma \gamma }}{24} \left( 1 - \frac{m_a^2}{s} \right) ^3, \end{aligned}$$
(33)

where \(\alpha \equiv e^2/4\pi .\)

For a realistic detector, one has to integrate the cross section formula over the phase space, restricted by the experimental setup, as discussed below.

3 Results and discussion

The ALP signal detection strategy can be based on searches for a narrow peak in the squared mass distribution \(m_{\gamma \gamma }\) of photon pairs, or a narrow peak in the photon energies distributions, due to the fact that the photon which accompanies the ALP is always monoenergetic in this process.

If no significant ALP signal is observed, it is possible to constrain ALP parameters in the corresponding mass range.

Fig. 8
figure 8

Left panel: Belle II constraints for \(g_{a\gamma \gamma }\) based on the 2018 data set, with the analytical result shown by the black dashed curve. Right panel: projected results on the \(\left( m_a, g_{a\gamma \gamma }\right) \) reach for the future data collection at Belle II corresponding to \(50 \, \text{ ab}^{-1}\) of integrated luminosity. Existing bounds (filled areas) are taken from [28], which in turn are based on the results of [14, 16, 22, 23, 29,30,31,32,33,34,35,36]

In this section we illustrate our results with exclusion plots for the kinematics of an \(e^+e^-\) collider. For this purpose, we first split the total \(e^+e^- \rightarrow \gamma \gamma \gamma \) cross section into three terms

$$\begin{aligned} \sigma _{ALP+B} = \sigma _{ALP}+\sigma _{B}+\sigma _{int}, \end{aligned}$$
(34)

with \(\sigma _{ALP}\) referring to the ALP-associated process (shown in Figs. 4 and 5), while \(\sigma _{B}\) is the background. The interference term \(\sigma _{int}\) does not contribute since the ALP decay width \(\Gamma _a\) is assumed to be much smaller than the experimental resolution of the invariant mass of the final photon pair.

The dominant part of the background originates from QED 3-photon annihilation [14], i.e. \(\sigma _{B}=\sigma _{QED}.\) The aimed sensitivity is then expressed by the formula

$$\begin{aligned} \frac{\sigma _{ALP}}{\sigma _{QED}} = \frac{N}{\sqrt{L \cdot \sigma _{QED}}}, \end{aligned}$$
(35)

where L denotes the integrated luminosity and N is the number of standard deviations that determines whether or not a fluctuation is considered as a signal. We conventionally set \(N = 2,\) which refers to 95% confidence level.

In our study we neglect the potential hadronic background from \(\pi ^0,\) \(\eta \) and \(\eta '\) mesons. In a complete analysis, however, those contributions must be included. Therefore, the parameter space for \(m_a\) in the vicinity of the \(\pi ^0,\) \(\eta \) and \(\eta '\) masses can be expected to be modified.

The \(e^+e^- \rightarrow a \gamma \rightarrow \gamma \gamma \gamma \) cross section is a function of three variables. For purposes of illustration, we use the available independent constraints for \(g_{aee}\) to show two-dimensional projections of \(g_{a\gamma \gamma }\) as a function of \(m_a.\)

Experimental searches in the MeV to GeV region are mostly focused on ALP-muon interaction [17] and therefore not able to constrain \(g_{aee}.\) However, it is possible to convert constraints on visibly decaying dark photons to limits on the ALP-electron mixing [26]. Indeed, the processes of \(X \rightarrow e^+e^-\) and \(a \rightarrow e^+e^-\) achieve comparable signal strengths in case of \(g_{Xee} \sim g_{aee}.\) This relation, of course, is only approximate, since the two processes have different angular distributions, but using it one can estimate \(g_{aee} \lesssim 10^{-4}\) [17].

In the following, we discuss the reach on \(m_a,\) \(g_{aee}\) and \(g_{a\gamma \gamma }\) which can be obtained from \(e^+e^- \rightarrow \gamma \gamma \gamma \) data that are already available from the Belle II experiment or are expected from future running.

3.1 Belle II kinematics

To obtain the exclusion plots for Belle II kinematics, we start by discussing the detector acceptance.

Belle II is an asymmetric collider, for which electron and positron have energies of \(7\, \text{ GeV }\) and \(4\, \text{ GeV },\) respectively. This requires a boost with a relative velocity \(\beta \approx 0.27\) to the center-of-momentum frame, where particles have energies of \(E = 5.29 \, \text{ GeV }.\) The angular coverage of the electromagnetic calorimeter in the lab frame is \(12.4^{\circ }<\theta < 155.1^{\circ }.\)

The angular region \(37.3^{\circ }<\theta < 123.7^{\circ }\) provides the best energy resolution, avoiding regions close to detector gaps, and offers the lowest beam background levels [22]. Following the work of [14], we set the photon energy selection threshold of \(0.25\, \text{ GeV }\) in the center-of-momentum frame. Our analysis requires all three photons to be in this acceptance range and, unless otherwise specified, these experimental cuts are used for all the plots shown below.

The angular distributions for the ALP process are presented in Fig. 6 for two different values of \(m_a\) and two different values of \(g_{aee}.\) For a given \(g_{aee},\) there is more than an order of magnitude difference between \(m_a = 0.3 \, \text{ GeV }\) and \(m_a = 3 \, \text{ GeV }\) curves due to the fact that for a relatively light ALP the decay width is dominated by the \(a \rightarrow e^+ e^-\) channel, see Eqs. (6) and (12). For the particular case of \(g_{a\gamma \gamma } = 10^{-4} \, \text{ GeV}^{-1}\) and \(g_{aee} = 10^{-4},\) one obtains

$$\begin{aligned}&\frac{\Gamma _{a\gamma \gamma }}{\Gamma _a} \approx 0.01, \quad \text{ for } \ m_a = 0.3 \, \text{ GeV }, \\&\frac{\Gamma _{a\gamma \gamma }}{\Gamma _a} \approx 0.53, \quad \text{ for } \ m_a = 3 \, \text{ GeV }. \end{aligned}$$

3.2 QED background

We next discuss the QED background process. The cross section of leading order QED \(e^+e^-\) annihilation in 3 photons in the massless electron limit is given by [37]

$$\begin{aligned}{} & {} \bar{\sum _i} \sum _f \left| M_{e^+e^- \rightarrow \gamma \gamma \gamma \, \left( QED\right) }\right| ^2 = s \left( 4\pi \alpha \right) ^3 \nonumber \\{} & {} \quad \times \frac{\sum _{i=1}^3\left( p_+k_i\right) \left( p_-k_i\right) \left[ \left( p_+k_i\right) ^2+\left( p_-k_i\right) ^2\right] }{\prod _{i=1}^{3} \left( p_+k_i\right) \left( p_-k_i\right) }. \end{aligned}$$
(36)

For the total cross section an additional factor 1/3! must be added due to the 3 identical bosons in the final state.

Figure 7 shows the corresponding QED background angular and energy distributions. In contrast to the ALP-related process (see Fig. 6), which exhibits a rather uniform angular distribution, the QED three-photon annihilation is characterized by an enhanced angular distribution in both the forward and backward directions. The presence of a distinct peak in the photon energy distribution would serve as an indication of ALP creation.

3.3 Belle II results from 2018 data set

In the 2018 data run Belle II achieved an integrated luminosity of \(445 \, \text{ pb}^{-1}\) [22], which was used for the ALP searches in a simplified way by converting the cross section limit to the coupling using Eq. (33). The latter formula does not take into consideration the fact that all three photons in the ALP-associated process must be detected in the acceptance range of the electromagnetic calorimeter.

We require three resolved photons with energies higher than \(0.65 \, \text{ GeV }\) in the center-of-momentum frame as a criteria for this event selection. These requirements are slightly different from those used in the Belle II report [22], where the selection of photons with energies above \(0.65 \, \text{ GeV }\) (for \(m_a > 4 \, \text{ GeV }\)) and \(1 \, \text{ GeV }\) (for \(m_a \le 4 \,\text{ GeV }\)) in the lab frame was performed. The difference is negligible since \(g_{a \gamma \gamma }\) is sensitive to \(\sigma _{QED}^{-1/4}.\)

Fig. 9
figure 9

Projected results on the \(\left( g_{aee}, g_{a\gamma \gamma }\right) \) reach for three different \(m_a\) hypotheses for the future data collection at Belle II corresponding to \(50 \, \text{ ab}^{-1}\) of integrated luminosity. Existing bounds, based on the Belle II 2018 run, can be found in [24]

Our result based on Eq. (33) is shown on Fig. 8 (left panel) by the black curve. It shows a good agreement with the analysis of [22] in the higher ALP mass region. In the lower mass region some deviations are seen. This is expected because in the case of a light ALP the invariant mass of a photon pair also becomes low, i.e. two photons travel in a very narrow cone with each other, oppositely to the third photon. This produces very asymmetric kinematics and the QED background becomes suppressed. In our analysis we do not take this into consideration, but more detailed investigation can be performed in future work.

3.4 Belle II projection from upcoming data collection

Belle II is expected to reach an integrated luminosity of \(50 \, \text{ ab}^{-1}\) after around 10 years of running. The resulting constraints which such future data would yield were investigated in [14] for the case where ALPs are only coupled to photons (i.e. for \(g_{aee} = 0\)). Figure 8 right panel shows the projected sensitivity in two scenarios: ALPs coupled only to photons and ALPs coupled to photons and electrons with different \(g_{aee}\) coupling strength.

Our results for the scenario \(g_{aee} = 0\) are in reasonably good agreement with the exclusion limits deduced in [14] in the high \(m_a\) region. For lower values of \(m_a\) one expects a similar deviation as for the 2018 Belle II data discussed above. Figure 8 also shows that the inclusion of a non-zero interaction of ALPs with electrons significantly affects the final result, especially in ALP mass range \(m_a \lesssim 2 \, \text{ GeV }.\) The assumption \(g_{aee}=0\) generally leads to an overestimated \(g_{a\gamma \gamma }\) limit, which may be incorrect if the ALP has other decay channels besides the 2-photon mode.

Notably, such an interplay of two couplings becomes relevant only with higher statistics and does not seriously affect Belle II 2018 results. Indeed, at \(m_a \sim 1 \, \text{ GeV },\) \(g_{aee} \sim 10^{-4}\) and \(g_{a\gamma \gamma } \gtrsim 10^{-3} \, \text{ GeV}^{-1}\) the branching ratio can be estimated as:

$$\begin{aligned} \frac{\Gamma _{a\gamma \gamma }}{\Gamma _a} \gtrsim 0.9, \end{aligned}$$

i.e. the decay into electron–positron pair is largely suppressed. However, with the several orders of magnitude increase in integrated luminosity, the reach on the \(\left( m_a,g_{a\gamma \gamma }\right) \) is considerably improved and the opposite becomes true (see the end of the Sect. 3.1 for more details). In such case, more detailed models with additional parameters are required to constrain invisible particles in a more rigorous way.

Figure 9 shows the reach on \(\left( g_{aee}, g_{a\gamma \gamma }\right) \) for three different values of \(m_a,\) which can be achieved with the Belle II upcoming data collection.

4 Conclusion

In this paper we discussed ALPs coupled to electrons and photons in a minimal way. The contributions of ALP states to 2- and 3-photon \(e^+e^-\) annihilation events were calculated. In this way, we obtained new constraints for possible ALPs in the MeV to GeV mass range, which can be tested at \(e^+e^-\) colliders. Results were shown for Belle II kinematics both from existing data and from forthcoming data with projected integrated luminosity of \(50 \, \text {ab}^{-1}.\)

Our results indicate that the \(g_{a \gamma \gamma }\) limits can be vastly affected in the presence of an additional decay mode, especially in the lower \(m_a\) region. The difference between single \(g_{a \gamma \gamma }\) and combined \(g_{a \gamma \gamma }\)/\(g_{aee}\) scenarios is minimal for Belle II 2018 data, but is expected to become substantial with the data currently being collected. Consequently, the constraints on lower mass ALP will be less stringent.

Using current best limits for \(g_{aee},\) it is possible to improve the \(g_{a \gamma \gamma }\) limits by at least an order of magnitude, which allows to significantly narrow down the search area for potential ALPs in the MeV to GeV mass range. This result can be improved further if a better way to constrain \(g_{aee}\) independently is available.

There are many possible ways to further extend this work. First of all, a more precise background modeling and experimental analysis must be performed in order to refine the exclusion plots, especially around the \(\pi ^0,\) \(\eta \) and \(\eta '\) masses. The comparison with the Belle II 2018 data analysis shows that with more detailed background analysis it is possible to even further improve \(g_{a \gamma \gamma }\) constraints in the \(m_a^2 \ll s \) case.

Additionally, in this paper we assumed that ALPs interact only with the electrons and photons. Hidden decay channels were not considered. However, the contribution of light dark matter particles of sub-GeV masses may change the obtained constraints if the pair production threshold is surpassed. At the same time, the inclusion of such particles makes it more complicated to set any constraints, as new free parameters appear.

Finally, we restricted ourselves with ALPs which are not coupled to muons (and taus). However, lepton universality leads to an increase in the coupling constant \(g_{a\mu \mu }\) by around two orders of magnitude compared to \(g_{aee}.\) It may notably affect the results if \(m_a\) is larger than \(2m_{\mu }.\) The parameter space in such case will get additional restrictions from the requirement of compatibility with current \(\left( g-2\right) _{\mu }\) data [38, 39].