1 Introduction

Since the existence of dark matter (DM) is confirmed by many astrophysical observations [1], identifying the properties such as their masses and spins and couplings of the DM is one of the most important tasks of particle physics. The most often considered thermal DM candidate is the weakly interacting massive particle (WIMP), which has the mass around \(\mathcal {O}(100)\) GeV and interacts with Standard Model (SM) particles via the electroweak force. Thus it can be produced directly in collider experiments in principle. The DM signal at colliders can be probed as the momentum imbalance at the detector if it is produced with recoiling against visible objects. Probing the DM signals at colliders could elucidate the particle physics properties of DM without suffering from astrophysical uncertainties thus becomes one of the main object of the current and future colliders.

There are three theoretical frameworks [2] that are used for describing the DM phenomena at the colliders, each has its own advantages and limitations:

  • The DM effective field theory (EFT) [3,4,5] is the low energy approximation of a renormalizable theory after integrating out the heavy particle that mediates the DM-SM particles interactions. The number of free parameters in the EFT is minimal, i.e. only two parameters are relevant for each operator, the coefficient of the effective operator and the DM mass. However, the EFT descriptions of DM interactions are valid only when momentum transfer is much smaller than the mass of the mediator such as in DM direct detection experiments. Meanwhile in the collider experiments, where the momentum transfers can be quite high, the kinematic distributions that are predicted by the underlying UV completion are not correctly captured by the EFT [6,7,8,9,10], especially in the region with light mediator or heavy DM.

  • In the DM simplified model [11,12,13], the DM is neutral under the SM gauge group and interacts with the SM particles via the portal of a single particle. Models in this class usually contain five free parameters: DM mass \(m_\chi \), mediator mass \(m_{\text {med}}\), DM-mediator coupling \(g_\chi \), SM particle-mediator coupling \(g_{\text {SM}}\) and the mediator decay width \(\Gamma _{\text {med}}\). Considering mediators of different masses makes it possible to consider different kinematic distributions that cannot be mapped onto effective operators, thus providing a more general framework for describing the DM phenomena. However, simplified DM models with a single scalar mediator often violate the SM gauge symmetry [14, 15] and perturbative unitarity, thus may become invalid for describing some sort of UV-complete models.

  • There is growing interest in a second generation simplified DM model that respects the SM gauge symmetry and renormalizability [14,15,16,17,18,19,20,21,22,23,24,25]. Among them, the simplest ones are singlet DM extension of SM with Higgs portal. In these models, depending on the DM spin, the gauge invariant DM-SM interactions may require at least two mediators. Even though in the parameter region where only the contribution of one mediator dominates, the prediction of this model coincide with that of the simplified model with a single mediator. We have shown in Ref. [15] that the interference effect between the two mediators can affect the exclusion bounds considerably in some parameter space.Footnote 1 While the models in this class give more realistic predictions regarding a UV completion, there will be ad hoc constraints from many experiments, which may be quite specific and only applicable to certain UV completed models. For example, for a singlet fermion DM extension of SM, the constraints and the prediction of the model with \(h_{\text {SM}}\) + singlet scalar portal are quite different from those of the model with two-Higgs-doublet portal.

All the above frameworks have been widely used in studying DM phenomenology at colliders. Each case contains quite a lot of possible operators/models that describe the nature of DM and its couplings. If any excess is observed within a given theoretical framework, it will be important to ask which operator/model provides the best description, i.e. characterize the DM properties. There are several studies [26,27,28] devoted to distinguishing the DM EFT operators and its spin at the LHC. In the framework of DM simplified model with single mediator, much current work [29,30,31,32,33,34,35,36,37] are mainly focused on distinguishing the spin of the mediator and identifying the coupling forms between the mediator and SM particles. Because here the DM is dominantly produced by the decay of the on-shell mediator, those visible final states do not carry any information of the DM nature.

In this work, based on the gauge invariant and renormalizable DM models with Higgs portal, we will study the fermion DM (FDM) discovery prospects and its spin discriminations against scalar DM (SDM) and vector DM (VDM) at the ILC. A very preliminary study along a similar direction has been given by one of us in Ref. [38], where the detector effects were completely ignored in discussions of the DM discovery and only qualitative arguments were given regarding the spin discrimination. By curing these two problems, we find the hadronic channel of DM production provides a better sensitivity for DM discovery than the leptonic channel. Taking the FDM model as a reference model, the FDM with the coupling in a wide range can be discovered in the hadronic channel when the second mediator is relatively light. In this region, the spin discriminating from SDM is always quite promising, because the SDM model is intrinsically different from the FDM model with only one mediator being introduced. However, the spin discriminating from VDM is much more difficult; it becomes possible only in the region where the coupling between the DM and the second mediator is approaching the perturbative limit.

This paper is organized as follows. In Sect. 2, we briefly describe the renormalizable and gauge invariant Higgs portal DM models for scalar, fermion and vector DM. Their complete Lagrangians as well as the interaction Lagrangians that are relevant to the DM search at collider are provided. Section 3 details the analysis for the DM discovery and the strategy for the DM spin discrimination based on a benchmark scenario. Similar methods are then applied to the leptonic channel of the benchmark scenario and the hadronic channel with different couplings in Sects. 4 and 5, respectively. Then we conclude the work in Sect. 6.

2 Higgs portal DM models

In this section, we define the Higgs portal DM models with SM gauge invariance and renormalizability, where the DMs are scalar, fermion and vector particle, respectively.

The SDM model can be constructed by simply introducing a new scalar S in addition to the SM [39,40,41]

$$\begin{aligned} \mathcal{L}_\mathrm{SDM} = \frac{1}{2} \partial _\mu S \partial ^\mu S - \frac{1}{2} m_{0}^2 S^2 - \lambda _{HS} H^\dagger H S^2 - \frac{\lambda _S}{4 !} S^4, \end{aligned}$$
(2.1)

where H is the SM Higgs doublet and S is assumed to be odd under a \(Z_2\) symmetry and thus becomes a DM candidate. After the electroweak (EW) symmetry breaking \(H \rightarrow (0, (v_h+h)/\sqrt{2})^T\) and assuming \(\langle S \rangle =0\), we can write down the interaction Lagrangian for DM production at the ILC as

$$\begin{aligned} \mathcal {L}^{\text {int}}_{\text {SDM}} = - h \left( \frac{2 m^2_W}{v_h} W^+_\mu W^{- \mu } + \frac{m^2_Z}{v_h} Z_\mu Z^\mu \right) - \lambda _{HS} v_h h S^2. \end{aligned}$$
(2.2)

In this model, the DM can only be pair produced through the SM Higgs (h) mediation.

The simplest Higgs portal singlet FDM model with SM gauge invariance and renormalizability contains a SM singlet Dirac fermion DM \(\chi \) and a real singlet scalar mediator S Footnote 2 in addition to the SM particles [16, 17]:

(2.3)

where the singlet scalar S cannot have direct renormalizable couplings to the SM particles due to the SM gauge symmetry and the singlet Dirac fermion \(\chi \) is assumed to be odd under a \(Z_2\) dark parity \(\chi \rightarrow - \chi \). When both scalar fields H and S develop nonzero vacuum expectation values (VEVs), \(v_h\) and \(v_s\), so that

$$\begin{aligned} H = \left( \begin{array}{c} G^+ \\ \frac{1}{\sqrt{2}} (v_h + h + iG^0) \end{array} \right) , \quad S= v_s +s , \end{aligned}$$
(2.4)

the two scalar fields mix:

$$\begin{aligned} \left( \begin{array} {c} h \\ s \end{array} \right) = \left( \begin{array}{l l} \cos \alpha &{} \sin \alpha \\ -\sin \alpha &{} \cos \alpha \end{array} \right) \left( \begin{array}{c} H_1 \\ H_2 \end{array} \right) , \end{aligned}$$
(2.5)

giving \(H_1\) and \(H_2\) fields in mass eigenstate. The mixing angle can be expressed in terms of parameters in the scalar potential,

$$\begin{aligned} \tan 2\alpha = - \frac{2 \lambda _{HS} v_s v_h + 2 \mu _{HS} v_h}{ 2 \lambda _S v^2_s - \frac{\mu _0^3}{v_s} - \mu _S v_s - \frac{\mu _{HS} v^2_h}{2 v_s} - 2\lambda _{H} v^2_h } . \end{aligned}$$
(2.6)

The interaction Lagrangian of interest can be written in the mass eigenstates as

$$\begin{aligned} \mathcal {L}^{\text {int}}_{\text {FDM}}&= - (H_1 \cos \alpha + H_2 \sin \alpha ) \left( \sum _f \frac{m_f}{v_h} \bar{f} f \right. \nonumber \\&\quad \left. - \frac{2m^2_W}{v_h} W^+_\mu W^{-\mu } - \frac{m_Z^2}{v_h} Z_\mu Z^\mu \right) \nonumber \\&\quad + g_\chi (H_1 \sin \alpha - H_2 \cos \alpha ) \bar{\chi } \chi . \end{aligned}$$
(2.7)

In contrast to the SDM model, there are two scalar bosons that mediate the DM production in the fermion DM model. The interference effects between two mediators can lead to interesting applications to DM searches at colliders [14, 15]. If the \(H_1\) is assumed to be the 125 GeV Higgs boson [42, 43] with its measured strengths [44, 45], the mixing angle should be small, \(\sin \alpha \lesssim 0.4\) [46,47,48].

As for constructing a renormalizable and gauge invariant model for vector (VDM), we need to introduce an abelian dark gauge group \(U(1)_{X}\) and a dark Higgs field \(\Phi \) [23, 49]:

$$\begin{aligned} \mathcal{L}_{\text {VDM}}= & {} -\frac{1}{4} V_{\mu \nu } V^{\mu \nu } + D_\mu \Phi ^\dagger D^\mu \Phi - \lambda _\Phi \left( \Phi ^\dagger \Phi - \frac{v_\phi ^2}{2} \right) ^2 \nonumber \\&-\;\lambda _{H\Phi } \left( H^\dagger H - \frac{v_h^2}{2} \right) \left( \Phi ^\dagger \Phi - \frac{v_\phi ^2}{2} \right) , \end{aligned}$$
(2.8)

where the VEV of \(\Phi = \frac{1}{\sqrt{2}} (v_\phi +\phi )\) will provide mass to the vector DM \(V_\mu \). The covariant derivative is defined as \(D_\mu \Phi = ( \partial _\mu + i g_V Q_\Phi V_\mu ) \Phi \) where the \(U(1)_X\) charge of \(\Phi \) will be taken as \(Q_\Phi =1\) throughout the paper. In this model, a \(Z_2\) symmetry (\(V_\mu \rightarrow - V_\mu \)) and charge conjugation symmetry have been imposed by hand, thereby forbidding the kinetic mixing between \(V_\mu \) and the SM \(U(1)_Y\) gauge boson and making the vector boson \(V_\mu \) stable. It can also be implemented by some unbroken local dark gauge symmetry as proposed in Ref. [50].

Similarly to the FDM model with Higgs portal, there are two scalar mass eigenstates (\(H_{1/2}\)) that originate from the mixing of SM Higgs h and dark Higgs \(\phi \), with the mixing angle given by

$$\begin{aligned} \tan 2 \alpha = \frac{ \lambda _{H\Phi } v_h v_\phi }{\lambda _\Phi v^2_\phi - \lambda _H v^2_h}. \end{aligned}$$
(2.9)

Then the interaction Lagrangian that is relevant to the collider study can be written as

$$\begin{aligned} \mathcal {L}^{\text {int}}_{\text {VDM}}&= - (H_1 \cos \alpha + H_2 \sin \alpha ) \left( \sum _f \frac{m_f}{v_h} \bar{f} f\right. \nonumber \\&\quad \left. -\;\frac{2m^2_W}{v_h} W^+_\mu W^{-\mu } - \frac{m_Z^2}{v_h} Z_\mu Z^\mu \right) \nonumber \\&\quad -\;\frac{1}{2} g_V m_V (H_1 \sin \alpha - H_2 \cos \alpha ) V_\mu V^\mu . \end{aligned}$$
(2.10)

So far we have derived the relevant interaction Lagrangians for scalar, fermion and vector DMs with Higgs portal in Eqs. (2.2), (2.7), and (2.10), respectively. Note that there is only one scalar mediator (h) in the scalar DM model, while there are two scalar mediators (\(H_{1/2}\)) in fermion and vector DM models. The difference in the number of mediators can lead to quite different kinematic distributions, which can be used to discriminate scalar DM model against fermion/vector DM models. On the other hand, distinguishing fermion DM models from vector DM models is more involved. First of all, if the DM production is dominated by on-shell \(H_{1/2}\) production with subsequent invisible decay, it will be impossible to observe any differences in the final state distribution. The spin discrimination between fermion and vector DM is possible only if the off-shell contributions become important. Then, given the same decay width of \(H_{1/2}\), the fermion and vector DM model will predict different DM production rates as well as final state kinematics.

3 A benchmark study

At the ILC, the Higgs portal DM is dominantly produced through the Higgs-strahlung process

$$\begin{aligned} e^+ e^- \rightarrow Z H_{1/2} \left( \rightarrow DD \right) , \end{aligned}$$
(3.1)

where \(D = S, \chi , V_\mu \) for scalar, fermion and vector DM, respectively. The Z boson can decay either leptonically or hadronically. We will show later that the leptonic mode which is suffering from branching ratio suppression is less sensitive than the hadronic mode. In this section, we will focus on the discovery prospect of the hadronic mode of fermion DM and discuss its spin discrimination against vector/scalar DM. Note that, for scalar DM, only one mediator \(H_1 = h_{\text {SM}}\) is introduced.

In order to guarantee a sufficient DM production rate at colliders, while being consistent with current measurements, the relevant parameters for the fermion DM production are chosen as

$$\begin{aligned} g_\chi =3, \quad \sin \alpha =0.3, \quad m_{\chi } =80 \text { GeV}. \end{aligned}$$
(3.2)

Four benchmark points with different \(m_{H_2}=\) (200, 300, 400, 500) GeV will be studied, which are denoted as FDM200, FDM300, FDM400 and FDM500, respectively. For each benchmark point, we assume that the decay width for heavier scalar \(H_2\) into the \(H_1\) pair is negligible.Footnote 3 Then we can express the minimal decay width of \(H_2\) as

$$\begin{aligned} \Gamma ^{\text {FDM}}_{\min }(H_2)&= \Gamma (H_2 \rightarrow \chi \chi )+ \Gamma \left( H_2 \rightarrow WW/ZZ \right) \nonumber \\&\quad +\, \Gamma ( H_2 \rightarrow ff ) \nonumber \\&= \cos ^2 \alpha \cdot g^2_\chi \frac{m_{H_2}}{8 \pi } \left( 1- \frac{4 m^2_\chi }{m^2_{H_2}} \right) ^{3/2}\nonumber \\&\quad +\, \sin ^2 \alpha \cdot \frac{G_\mu m^3_{H_2}}{16\sqrt{2} \pi } \delta _V \sqrt{1-4 \frac{m_V^2}{m^2_{H_2}}}\nonumber \\&\quad \times \, \left( 1-4 \frac{m_V^2}{m^2_{H_2}} +12 \frac{m_V^4}{m^4_{H_2}} \right) \nonumber \\&\quad +\, \sin ^2 \alpha \cdot \left( \frac{m_f}{v} \right) ^2 \frac{3 m_{H_2}}{8 \pi } \left( 1-\frac{4 m^2_f}{m^2_{H_2}} \right) ^{3/2} , \end{aligned}$$
(3.3)

where f is the SM fermion, \(V=Z, W\) and \(\delta _V=1(2)\) for \(Z(W^{\pm })\).

To study the spin discrimination, the parameters for the vector DM production are chosen accordingly:

$$\begin{aligned} \sin \alpha =0.3, \quad m_{V} =80 \text { GeV} \end{aligned}$$
(3.4)

and \(g_V\) is chosen such that the total decay width of \(H_2\) is the same as that in the fermion DM case, since one can rely on other methods to measure the total decay width of \(H_2\). We give the total decay widths of \(H_2\) for four benchmark points in the FDM model as well as the corresponding \(g_V\) of the VDM model in Table 1. Due to the different dependencies of the \(\Gamma _{H_2}\) on the \(m_{H_2}\) in FDM and VDM models, the \(g_V\) can be quite different from the \(g_\chi \) (= 3). In the VDM model, a heavier \(H_2\) requires a smaller \(g_V\) to keep the decay width of \(H_2\) the same with that in FDM. The decay width of \(H_2\) here is similar to the Eq. (3.3) with the first term \(\Gamma (H_2 \rightarrow \chi \chi )\) replaced by \(\Gamma (H_2 \rightarrow V V)\).

Table 1 First two rows are the masses and decay widths of the \(H_2\) for four benchmark points in the FDM model. The last row gives the \(g_V\) in the VDM model which produce the \(\Gamma _{\min } (H_2)\) in the second row

The scalar DM model is much simpler, since there are only two parameters: \(m_S\) and \(\lambda _{HS}\). In studying the spin discrimination, we will fix \(m_S = 80\) GeV and take appropriate \(\lambda _{HS}\) such that the number of signal events after all selections are kept the same as that of each benchmark point of the FDM model. However, changing the \(\lambda _{HS}\) can only lead to total rescaling of the cross section and will not affect the differential distribution of kinematic variables. In the following, we will fix \(\lambda _{HS}\) when discussing the kinematic shapes without loss of generality.

Fig. 1
figure 1

Dominant background processes for hadronic mode of our signal

The SM processes with any species of neutrino in the final state could mimic the DM signal. The dominant SM background processes to Eq. (3.1) are shown in Fig. 1. Among them, the first and the second diagram (including three species of neutrino) give similar amount of contributions, while the third one is slightly smaller. At the ILC with \(\sqrt{s}=500\) GeV and unpolarized beams, the total production cross section including the interference effects between different diagrams is 219 fb. Since the left and right handed fermions have different electroweak charges, the background cross section, especially the contribution from vector boson fusion (VBF) process, strongly depends on the beam polarization. The ILC will be able to provide highly polarized electron beam (80%) and moderately polarized positron beam (30%) [51]. The background cross sections with respect to the varying beams polarizations are plotted in the left panel of Fig. 2, where we have used the positive sign for right handed polarization and negative sign for left handed polarization. We can see that the background cross section is largest for electron polarization \(P_{e^-}=-80\%\) and positron polarization \(P_{e^+}=30\%\), while it is smallest for \((P_{e^-} , P_{e^+})= (80,-30\%)\). Meanwhile, the cross sections of signal processes also mildly depend on the beam polarization. Taking the benchmark point of FDM200 for illustration, the signal-to-background ratio with respect to the varying beams polarizations are given in the right panel of Fig. 2, where the values have been normalized to unit at \((P_{e^-} , P_{e^+}) = (0,0)\). It can be seen that the signal-to-background ratio can be either reduced by a factor of 0.7 or enhanced by a factor of \(\sim 3\) compared to the value at \((P_{e^-} , P_{e^+}) = (0,0)\). Although polarized beams improve the sensitivity, we report the results with the unpolarized beam in this work.

Fig. 2
figure 2

The background cross section (left) and signal-to-background ratio (right) with varying electron and position beam polarization at the \(\sqrt{s}=500\) GeV. In the right panel, the benchmark point of FDM200 has been taken as signal for illustration. The signal-to-background ratios have been normalized at \((P_{e^-} , P_{e^+}) = (0,0)\)

In this work, the cross sections and events for signal and background are generated by MadGraph5_aMC@NLO_v2.4.3 [52]. The Pythia6 [53] is used for parton showering and hadronization. The final state jets are clustered using the Fastjet [54]. We also include the detector effects by using Delphes_v3.4.1 [55] with input of the ILD card [56]. The track momentum and calorimeter energy resolutions of the card are listed in Table 2. It should be noted that a more realistic detector simulation should also consider the energy spectra of incoming beams, the effect of which is neglected in our simulation.

Table 2 The resolutions for track momentum (\(\sigma _{1/p_T}\)), electromagnetic calorimeter (\(\sigma _{1/E}\)) and hadronic calorimeter (\(\sigma _{1/E}\))

3.1 Features of DM spin

For our signal processes at the ILC, the 4-momentum of the DM pair system can be solved as

$$\begin{aligned} P^\mu _{DD}&=P^\mu _{e^+} + P^{\mu }_{e^-} - P^\mu _Z \nonumber \\&= ( \sqrt{s}- E_Z, - \vec {p}_Z ), \end{aligned}$$
(3.5)

where the \(\sqrt{s}\) is the collision energy and \(E_Z\) (\(\vec {p}_Z\)) is the energy (momentum) of the Z boson. Therefore the invariant mass of the DM system is an observable at the ILC:

$$\begin{aligned} m^2_{DD} = s +m^2_Z - 2 E_Z \sqrt{s} . \end{aligned}$$
(3.6)

The differential cross section with respect to \(m^2_{DD}\) for scalar, fermion and vector DM production have been calculated in Ref. [38]. It can be factorized as an off-shell mediator production and decay:

$$\begin{aligned} \frac{\mathrm{d} \sigma _D}{ \mathrm{d} t} = \frac{1}{2 \pi } \sigma _{h^* Z} \left( s, t \right) \cdot G_D \left( t \right) , \end{aligned}$$
(3.7)

where \(t \equiv m^2_{DD}\) and \(D= S,\chi ,V\) for scalar, fermion, vector DM, respectively. The off-shell mediator production cross section

$$\begin{aligned}&\sigma _{h^* Z} \left( s, t \right) \nonumber \\ {}&\quad =\mathcal {P}_{ee} \frac{1}{6 s} \frac{m^4_Z}{v_h^4} \left| \frac{s}{s- m^2_Z + i m_Z \Gamma _Z} \right| ^2 \frac{\hat{\beta }}{8 \pi } \left( \hat{\beta }^2 + \frac{12 m_Z^2}{s} \right) \end{aligned}$$
(3.8)

is universal for all DM spins. In the above equation, \(\mathcal {P}_{ee} = \left( -\frac{1}{2} + 2 \sin \theta _W \right) ^2+ \left( -\frac{1}{2} \right) ^2 \) with \(\theta _W\) being the Weinberg mixing angle is the averaged spin factor for initial electron and positron; \(\hat{\beta } = \lambda ^{1/2} (1, m^2_Z /s, t/s )\) with \(\lambda (a,b,c ) = a^2+b^2+c^2 -2 (ab+bc+ca )\). The \(G_D \left( t \right) \) in Eq. (3.7) which is different from spin to spin shows the spin dependent behavior of the differential cross section:

$$\begin{aligned} G_S(t)&= \frac{\beta _S}{8 \pi } \cdot \left| \frac{\lambda _{HS} v_h}{t - m^2_h + i m_h \Gamma _h} \right| ^2, \end{aligned}$$
(3.9)
$$\begin{aligned} G_\chi (t)&= \frac{\beta _\chi ^3}{8 \pi } 2 g_\chi t \cdot \left| \frac{1}{t- m_{H_1}^2 + i m_{H_1} \Gamma _{H_1}}\right. \nonumber \\&\quad \left. - \frac{1}{t- m_{H_2}^2 + i m_{H_2} \Gamma _{H_2}} \right| ^2, \end{aligned}$$
(3.10)
$$\begin{aligned} G_V(t)&= \frac{\beta _V}{16\pi } \frac{g_V^2 t^2}{4 m^2_V} \left( 1- \frac{4 m^2_V}{t} + \frac{12 m^4_V}{t^2} \right) \cdot \left| \frac{1}{t- m_{H_1}^2 + i m_{H_1} \Gamma _{H_1}}\right. \nonumber \\&\quad \left. -\;\frac{1}{t- m_{H_2}^2 + i m_{H_2} \Gamma _{H_2}}\right| ^2 , \end{aligned}$$
(3.11)

where \(\beta _{S/\chi /V} = \sqrt{ (1- 4m^2_{S/\chi /V} / t)}\).

We can see from above that different DM spins can lead to different collision energy \(\sqrt{s}\) dependences of the production cross sections and different distributions of the DM pair invariant mass \(m_{DD}\). Especially the threshold behaviors (\(t \gtrsim 4 m_{DD}^2\)) or the large-t behaviors clearly depend on the DM spin. In Fig. 3, we show the DM total production cross section in SDM, FDM and VDM models by integrating over t in Eq. (3.7). The cross sections of benchmark points in FDM and VDM increase faster than that in SDM, due to the contributions from the second mediator. Comparing FDM and VDM, we can find that the VDM has slightly larger cross section than FDM when the \(m_{H_2} \lesssim 200\) GeV, while it can have a much smaller cross section for heavy \(H_2\). The differences are largest when the collision energy is relatively small \(\sqrt{s} \sim [400,500]\) GeV. In the following discussion, we will study the collider phenomenology with fixed \(\sqrt{s}=500\) GeV, so that FDM and VDM may possibly be distinguished by their production rate directly.

Fig. 3
figure 3

Production cross section of \(e^+ e^- \rightarrow Z (\rightarrow j j ) DD\) for benchmark points in FDM and VDM models with varying collision energy \(\sqrt{s}\) as well as that in SDM with \(\lambda _{HS}=1\). The meaning of the lines with different colors is indicated in the legend. The dashed lines correspond to the benchmark points in the VDM model which have the same \(H_2\) mass as the points in the FDM model, which is shown by the solid line with the same color

Fig. 4
figure 4

Invariant mass of DM (neutrino) pair for signal (background). The left panel shows parton level distributions. Right panel shows the detector level distributions. The meaning of the lines is the same as in Fig. 3

For fixed \(\sqrt{s}\), a powerful spin discriminator at the ILC is the invariant mass of the DM pair \(m_{DD} \equiv \sqrt{t}\). We plot the \(m_{DD}\) distributions for signals with different DM spins as well the background both at parton level (left panel) and at detector level (right panel) in Fig. 4.

At parton level, the \(m_{DD}\) for the SM background corresponds to the invariant mass of the neutrino pair in the final state, since they will mimic the missing energy from the DM pair at detector level. As we have discussed before, there is a large fraction of background events in which the neutrino pair is produced from Z boson decays. Thus the \(m_{DD}\) will show a sharp peak at \(m_Z\), which is a SM background. The \(m_{DD}\) is usually quite large for the VBF background process (first panel in Fig. 1), which gives another broad peak at \(m_{DD} \sim 400\) GeV. In the SDM model, the DM with \(m_S=80\) GeV is pair produced through the off-shell SM Higgs mediation. The \(m_{SS}\) will peak at \(2 m_S\) and decrease as \(1/m_{SS}^4\) with increasing \(m_{SS}\). In FDM and VDM models, there is another resonant enhancement at \(m_{DD} \sim m_{H_2}\) because of the existence of the additional scalar mediator, especially when the mass of \(H_2\) is relatively light and decay width of the \(H_2\) is small. This explains the clear peaks for FDM200 and VDM200. The peaks become much broader for \(m_{H_2}=300\) GeV, since the decay width of \(H_2\) is large. As the on-shell \(H_2\) production is (almost) kinematically closed for \(m_{H_2}=400/500\) GeV, the peaks no longer exist. The FDM and VDM also show distinguishable structures in the \(m_{DD}\) distributions. When the second scalar mediator is light, the VDM has more events in the small \(m_{DD}\) region than the FDM while this becomes opposite when the second mediator is heavy.

The features at parton level can be smeared out to some extent by the detector effects. First of all, the momenta of DMs/neutrinos are not observables. One can only calculate the \(m_{DD}\) from Eq. (3.6) by using the momentum of the Z boson, which is identified as the vector sum of the momenta of two leading jets. In some cases, only one of two jets from the Z boson decay is reconstructible at the detector (\(p_T(j) >20\) GeV and \(|\eta (j)| < 3.0\)). These events will be dropped. The detector level distribution of the \(m_{DD}\) is given in the right panel of Fig. 4. We can see that the peaks are broadened and the edges get ambiguous. In particular, for the background process, the peak at Z boson mass is almost disappearing and the distribution of \(m_{DD}\) is quite flat, rending the discovery of signal processes difficult. The edges for signal distributions at \(2 m_D\) and \(\sqrt{s}-m_{Z}\) are less steep. Nevertheless, we are still able to observe distinguishable distributions between signal and background as well as between signals with different DM spins. These features can be used to search and characterize the signal as will be discussed in the following.

Fig. 5
figure 5

Kinematic variables distributions after detector simulation. The meaning of the lines is the same as in Fig. 3

3.2 Discovery prospect of FDM and spin discriminating power

A signal has to be discovered with high significance before being characterized. In this section, based on the benchmark scenarios that we have set at the beginning of this section, we will study the discovery prospects of the FDM and discuss its spin discriminating power against SDM and VDM at the ILC with \(\mathcal {L}=1000\) fb\(^{-1}\) and \(\sqrt{s}=500\) GeV.

In the event reconstruction, leptons are required to have \(p_T(\ell )>10\) GeV, \(|\eta (\ell )|<2.4\) Footnote 4 and be isolated, which means the scalar sum of transverse momenta of all particles with \(p_T > 0.5\) GeV that lie within a cone of radius \(R = 0.5\) around the \(e(\mu )\) is less than 12% (25%) of the transverse momentum of the \(e(\mu )\). Jets are reconstructed from particle flow objects from Delphes using the anti-kt jet clustering algorithm [57] with a radius parameter \(R = 0.5\). Only jet candidates with \(p_T(j) > 20\) GeV and \(|\eta | < 3.0\) are considered as signal jets in our analysis. The missing transverse momentum \(p_T^{\text {miss}}\) is defined as the negative vector sum of the transverse momenta of all identified physics objects at the detector. Candidate events should pass the pre-selection cuts: (1) no leptons in the final state; (2) exactly two jets in the final state; (3) \(E^{\text {miss}}_T \equiv |p_T^{\text {miss}}| > 50\) GeV.Footnote 5

The cross sections of the benchmark points in the FDM model before and after the pre-selection are given in Table  3, where we have taken into account the Z boson hadronic decay branching ratio. It can be seen that the total cross section decreases quickly with increasing the mediator mass. The pre-selection efficiency is relatively flat (\(\sim \) 0.7–0.8) and is smallest when \(m_{H_2} =400\) GeV. This is because for each event, the DM pair recoil energy (\(E^{\text {miss}}_T\)) is in inverse proportion to the invariant mass of dark matter pair (\(m_{DD}\)). As can be seen clearly in the right panel of Fig. 4, the distribution of \(m_{DD}\) is hardest for FDM400, while it is decreasing with either larger or smaller \(m_{H_2}\).

On the other hand, the production cross sections of the SM background before and after the pre-selection are found to be 219 and 109.1 fb, respectively, which are typically more than two orders of magnitude larger than that of our signals. Such small signals can easily be hidden in the background with relatively large uncertainty. One would rely on more refined cuts to improve the signal-to-background ratio as well as the signal significance.

Fig. 6
figure 6

Left: the signal-to-background ratio with varying BDT cut for FDM benchmark points. Right: the signal significance at the ILC with integrated luminosity of 1000 fb\(^{-1}\)

From the left panel of Fig. 4, we know the \(m_{DD}\) can play an important role in signal and background discrimination. Moreover, in signal processes, the DM pair is produced with recoiling against a Z boson which decays into two detectable jets. The two DM particles are flying along a similar direction. Meanwhile in the background process, in particular the first and third diagrams in Fig. 1, the momenta of two neutrinos are unlikely to align with each other leading to a cancellation in missing transverse momentum. As a result, both the missing transverse energy (\(E^{\text {miss}}_T\)) and the transverse momentum of the Z boson (\(p_T(Z)\)) and the leading jet (\(p_T(j_1)\)) get softened for the background, as demonstrated in Fig. 5. We note that the distributions of \(E^{\text {miss}}_T\), \(p_T(Z)\) and \(p_T(j_1)\) are highly correlated: hardest for VDM200 and SDM; softest for VDM400 and SM background.

Another useful and less correlated discriminator is the azimuthal angle separation between the \(p^{\text {miss}}_T\) and the momentum of the closer jets:

$$\begin{aligned} \Delta \phi ^{\min } = \min _{i=1,2} \Delta \phi (p^{\text {miss}}_T, p (j_i ) ) . \end{aligned}$$
(3.12)

In the signal process, the DM pair is flying around the opposite direction of an energetic Z boson, which decays to two collinear jets. The \(\Delta \phi ^{\min }\) is distributed toward \(\sim \pi \). As for background processes, where the Z boson energy is much smaller, the \(\Delta \phi ^{\min }\) distribution is flatter.

We will adopt the BDT method [58, 59] that takes into account all the above variables as well as the transverse momentum of the second leading jet (\(p_T(j_2)\)) and the invariant mass of jet pair (\(m_{jj}\)) in order to discriminate each signal benchmark point against the SM background. The BDT method uses a 100 tree ensemble that requires a minimum training events in each leaf node of 2.5% and a maximum tree depth of three. For each benchmark point, it is trained on the half of the pre-selected signal and backgrounds events and is tested over the rest of the events. To avoid overtraining, the Kolmogorov–Smirnov test [60] in the BDT training is required to be greater than 0.01.

After the BDT training, one can assign a BDT response to each event, which is usually larger for signal than for background. Distinguishable distributions of BDT response for signal and background can be obtained by taking into account a large number of events. Then a cut on the BDT distribution can help to improve the signal purity. We plot the signal-to-background ratios (\(N_S/N_B\)) and the signal significances (\(N_S/\sqrt{N_S+N_B}\)) with varying cuts on the BDT distributions for FDM benchmark points in Fig. 6. We can see that the cuts on BDT can improve the \(N_S/N_B\) by at least one order of magnitude, while improvements on the signal significance are only significant for benchmark points with relatively light mediator mass.

The corresponding cut on BDT for each benchmark point in the FDM model that maximizes the signal significance is given in the Table 3, where we also provide the numbers of signal and background events and the signal significance after the BDT cut. We find that detections on the benchmark points of FDM200 and FDM300 can be made at 3-\(\sigma \) level at the ILC with collision energy of \(\sqrt{s}=500\) GeV and integrated luminosity of 1000 fb\(^{-1}\). This would allow us to perform the spin discrimination for those two benchmark points.

Table 3 The total production cross section (\(\sigma ^0\)), cross section after pre-selection (\(\epsilon ^{\text {pre}}\)), the chosen BDT cut (BDT), number of signal (\(N_S\)) and background (\(N_B\)) events after BDT cut and the signal significance (\(N_S/\sqrt{N_S+N_B}\)) at the ILC with \(\sqrt{s}=500\) GeV and \(\mathcal {L}=\)1000 fb\(^{-1}\) for benchmark points in FDM model
Table 4 The total production cross section (\(\sigma ^0\)), cross section after pre-selection (\(\epsilon ^{\text {pre}}\)) and number of signal (\(N_S\)) at the ILC with \(\sqrt{s}=500\) GeV and \(\mathcal {L}=\)1000 fb\(^{-1}\) for benchmark points in SDM and VDM model. The last row gives the spin discriminating significances of FDM with \(m_{H_2} =200/300\) GeV against corresponding SDM and VDM
Fig. 7
figure 7

Distributions of \(m_{DD}\) after the BDT cut for each signal plus background. Left: discriminating the spin of the benchmark point FDM200. Right: discriminating the spin of the benchmark point FDM300

The procedure of the spin discrimination can be described as the following. Firstly, events are simulated and production cross sections are calculated for benchmark points in SDM model (SDM200, SDM300) and in the VDM model (VDM200, VDM300). The SDM200 (SDM300) denotes benchmark point in SDM model that has the same signal yields after the event selection as the FDM200 (FDM300) and the VDM200 (VDM300) denotes the benchmark point in the VDM model that has the second mediator mass of 200 (300) GeV. Next, after the event reconstruction, the same pre-selection cuts as for FDM are applied. The cross sections as well as the pre-selection efficiencies for those benchmark points are provided in Table 4. Note that the pre-selection efficiencies for SDM200 and SDM300 are the same, since the only free parameter \(\lambda _{HS}\) in the SDM model can not change the kinematic features of the final state. Then we apply the BDT that has been trained on the benchmark point FDM200 (FDM300) to the corresponding benchmark point SDM200 (SDM300) and VDM200 (VDM300). Finally, we apply the BDT cuts as given in the fourth row of Table 3 to the corresponding benchmark points in SDM and VDM model. The event numbers at \(\mathcal {L}=1000\) fb\(^{-1}\) for those benchmark points are given in the fourth row of Table 4.

Table 5 Relic densities and direct detection rates of benchmark points

The survived events are used to plot the distributions of \(m_{DD}\) for different models. In Fig. 7, we give the 5-bin distributions of \(m_{DD}\) after applying the BDT cut for signals of different DM spin adding to the SM background. We can observe that the \(m_{DD}\) distributions of benchmark points in FDM and SDM model have visible difference, while that of benchmark points in FDM and VDM are almost the same. To assess the degree of difference between the benchmark points in FDM and SDM, we construct the \(\chi ^2\) statistic

$$\begin{aligned} \delta \chi ^2 = \sum _{i=1}^5 \left( \frac{N_i^{\text {FDM+SM}} - N_i^{\text {SDM+SM}} }{\sqrt{N_i^{\text {FDM+SM}}}} \right) ^2 \end{aligned}$$
(3.13)

where \(N_i^{\text {FDM+SM}}\) (\(N_i^{\text {SDM+SM}}\)) is the number of FDM (SDM) signal plus background events in the ith bin and the i runs over five bins of the histograms in Fig. 7. The \(\delta \chi ^2\) value is compared to the \(\chi ^2\) distribution with 4 degrees of freedom to calculate the p value, which can be further transformed to the significance level (\(\mathcal {S}\)) from a Gaussian distribution. The \(\mathcal {S}\) for each benchmark point in SDM model is given in the fifth row of Table 4. Both benchmark points in SDM model can be distinguished from the benchmark points in FDM at significance level of more than 2-\(\sigma \). We note that the number of events after the BDT cut contains not only the information of normalization of the \(m_{DD}\) distribution but also the information of its shape, since the BDT selection used the \(m_{DD}\) distribution. Therefore, for discriminating FDM and VDM, the significance level will be simply estimated by \(\mathcal {S}= \left| N_S^{\text {FDM}} - N_S^{\text {VDM}} \right| /\sqrt{N_B}\), with \(N_S^{\text {FDM}}\) (\(N_S^{\text {VDM}}\)) being the number of FDM (VDM) signal events as given in Table 3 (Table 4), and \(N_B\) being the number of background events after applying BDT cut. We find both benchmark points in the VDM model can only be distinguished from the benchmark points in FDM with significance level below 1-\(\sigma \).

3.3 DM properties of benchmark points

In this subsection, we will briefly discuss the DM relic density [1] and DM direct detection bound [61] for our benchmark pointsFootnote 6 These values are calculated numerically by micrOMEGAs [63] with the CalcHep/CompHEP [64] model files that are written by FeynRules [65, 66]. For all benchmark points, the DMs are dominantly annihilating into \(W W^{*}\) through scalar mediator(s) where \(W^*\) is the off-shell W boson. Due to the relatively large couplings between the mediator and DMs being chosen, the relic abundances of our DM particles are always below the measurement (\(\Omega h_0^2 = 0.1198\)) as can be seen in Table 5, rendering our DM particle only a component of a full DM sector. Among DM spins, the fermion DM has suppressed s-wave annihilation, thus largest relic density.

In comparison between the DM–proton scattering cross section in our model and the LUX constraint, the cross section (\(\sigma _p^{\text {SI}}\)) calculated in micrOMEGAs should be rescaled by a factor of \(\frac{\Omega h^2}{0.1198}\) with \(\Omega h^2\) being the calculated relic density of each benchmark point. According to Ref. [61], the current LUX measurement has excluded \(\sigma _p^{\text {SI}} \cdot \frac{\Omega h^2}{0.1198} > 1.4 \times 10^{-10}\) pb for \(m_{\text {DM}} = 80\) GeV, which means all of our benchmark points should have been excluded already. However, the direct detection limits rely on assumptions as regards the local dark matter density and velocity distributions, which are expected to vary from the standard assumptions used in the experimental results [67,68,69,70]. Moreover, if there is indeed a DM sector, our DM particle can either decay or be annihilated into other dark particles, so that the direct detection can be evaded. It should be noted that those modifications will not lead to any effects in the collider phenomenology of DM searches.

4 The leptonic channel

As we have seen in Fig. 4, the hadronic channel is suffering from the large uncertainty in jet momentum measurement, leading to smearing effects in the \(m_{DD}\) distributions. On the other hand, much better lepton (\(e/\mu \)) momentum resolution of the leptonic channel may help to improve the discovery sensitivity as well as the spin discriminating power.

However, the main drawbacks of the leptonic channel are its small production cross section and relatively large SM background. The Z boson in the leptonic channel is required to decay into electron or muon pair, the decay branching ratio of which is around one order of magnitude below that of hadronic mode: Br\((Z\rightarrow \ell ^+\ell ^-)\) = 6.7% with \(\ell = e, \mu \), Br\((Z\rightarrow qq)\) = 69.9% with \(q=u,d,c,s,b\). Moreover, aside from the background processes listed in Fig. 1 with j being replaced by \(\ell \), there are new SM backgrounds such as the single W and W boson pair productions where the W bosons are decaying leptonically. The total production cross section of the SM process \(e^+ e^- \rightarrow \ell \ell \nu \nu \) is 505 fb at the \(\sqrt{s}=500\) GeV ILC, which we find is dominated by the contributions from processes with W boson in the final state.

Fig. 8
figure 8

Invariant mass of DM (neutrino) pair for signal (background) in the leptonic channel. The left panel shows parton level distributions. The right panel shows the detector level distributions. The meaning of the lines is the same as in Fig. 3

In Fig. 8, we plot the \(m_{DD}\) distribution for the leptonic channels of signals and background at parton level (left panel) and detector level (right panel). We can find that the shapes of \(m_{DD}\) distributions are largely unaltered after taking into account the detector effects, i.e. peaks are sharp and edges are steep even at the detector level. Compared to Fig. 4, the main features of the signal distributions are kept the same as that in the hadronic channel, since the two channels only differ in the Z boson decay final state. As for background, the Z peak in the leptonic channel is less notable because the processes with W in the final state are dominating. We note that in some events, only one of the two leptons in the final state is reconstructible at the detector (\(p_T(\ell )>10\) GeV and \(|\eta (\ell )|<2.5\)). Those events correspond to those with \(m_{DD}=0\) GeV in the right panel of Fig. 8.

Events for the leptonic channel are reconstructed with the same method as adopted for the hadronic channel. The candidate signal events are selected with the following pre-selection cuts: (1) exactly two opposite sign same flavor leptons in the final state; (2) no jet in the final state; (3) \(E^{\text {miss}}_T > 50\) GeV; (4) two leptons invariant mass around the Z pole \(m_{\ell \ell } \in [75,105]\) GeV; (5) DM pair invariant mass above twice of the DM mass \(m_{DD} > 160\) GeV. The total cross sections of the leptonic channels of benchmark points in the FDM model and their pre-selection efficiencies are given in Table 6. We also find the corresponding pre-selection efficiency of the background is \(\sim 0.029\) which is much smaller than that of the signal. Nevertheless, after the pre-selection, the production rates of our signals are still around 2–3 orders of magnitude smaller below that of the background.

Table 6 The meaning of each row is the same as in Table 3, with the leptonic channel instead

To increase the signal significance, we follow a similar strategy to the hadronic channel, i.e. adopting the BDT method. The discriminating variables that are used in the leptonic channels are

$$\begin{aligned}&p_T (\ell _1 ), \ p_T (\ell _2 ), \ E^{\text {miss}}_T, \ m_{\ell \ell }, \nonumber \\ {}&\quad m_{DD}, \ p_T\left( Z \right) , \ \Delta r ( \ell ,\ell ), \ \Delta \phi ^{\min }, \end{aligned}$$
(4.1)

where \(\Delta r (\ell ,\ell ) \equiv \sqrt{ (\Delta \eta (\ell ,\ell ) )^2+ ( \Delta \phi (\ell ,\ell ) )^2}\) is the angular distance between two leptons and \(\Delta \phi ^{\min } \equiv \min _{i=1,2} \Delta \phi (p^{\text {miss}}_T, p (\ell _i ) )\) is the azimuthal angular separation between the missing transverse momentum and the closer lepton.

After training the BDT on each benchmark point in the FDM model, we can obtain the distributions of BDT response for signal and background. The cut on the BDT distributions is chosen such that the signal significance (\(N_S/\sqrt{N_S+N_B}\)) of each benchmark points is maximized. The corresponding BDT cuts, the number of signal and background events as well as the signal significance after BDT cuts are given in Table 6. Only the FDM200 is discoverable at the ILC with \(\sqrt{s}=500\) GeV and \(\mathcal {L}=1000\) fb\(^{-1}\). For all benchmark points, the signal significances of the leptonic channel are 2–3 times smaller than those of the hadronic channel.

We can also discuss the spin discriminating of the FDM against SDM and VDM for the benchmark point FDM200. The production cross sections and pre-selection efficiencies of benchmark points SDM200 and VDM200 are given in the second and third row of Table 7. As in the hadronic channel, the significance levels (\(\mathcal {S}\)) of the spin discriminations between FDM and SDM and between FDM and VDM are calculated with two different methods. The results are given in the fifth row of Table 7. The FDM200 can be distinguished from SDM200 at significance level of around 2-\(\sigma \), while it is impossible to be distinguished from VDM200. We can conclude that the hadronic channel provides better sensitivities in both signal discovery and spin discrimination than the leptonic channel.

Table 7 The meaning of each row is the same as in Table 4, with the leptonic channel instead
Fig. 9
figure 9

Invariant mass of DM (neutrino) pair for signal (background) corresponding to two different choices of \(g_\chi \) in FDM model. The left panel: \(g_\chi =1\). The right panel: \(g_\chi =10\). The meanings of lines are same as in Fig. 3

5 Varying the coupling in the hadronic channel

So far, we have studied the benchmark points with \(g_\chi =3\) in the FDM model. In this section, we will survey the discovery and spin characterizing prospects of benchmark points with \(g_\chi =1\) and \(g_\chi =10\) in the FDM model, while we keep \(\sin \alpha \) and \(m_\chi \) unchanged. For each \(g_\chi \), four different choices of \(m_{H_2}\) = (200, 300, 400, 500) GeV will be considered. As has been done for the \(g_\chi =3\) case, the corresponding benchmark points in the VDM model are chosen such that the decay widths of \(H_2\) are kept the same as the ones in the FDM model. We note that the branching ratio of \(H_2 \rightarrow H_1 H_1\) is assumed to be negligible in calculating the decay width of a \(H_2\). Benchmark points in the SDM model are chosen with the criterion that the signal yields after the event selection for signal process is the same as that of benchmark points in the FDM model by tuning the free parameter \(\lambda _{HS}\).

The most important effect of changing the \(g_\chi \) is that the total decay widths of the \(H_2\) become different in the FDM and VDM models. As shown in Fig. 9, for FDM and VDM, the peaks in the \(m_{DD}\) distribution are quite sharp when the \(g_\chi =1\). Especially, when \(m_{H_2}=400\) GeV, the contribution from the on-shell \(H_2\) is still dominating even with the small kinematic phase space. This is in contrast to Fig. 4 where the decay width of \(H_2\) is much wider, leading to the disappearance of the \(H_2\) peak. We note that differences in the distributions of \(m_{DD}\) between the FDM and VDM only occur in the off-shell \(H_2\) processes. Otherwise, it is simply the on-shell \(H_2\) production with subsequent invisible decay, which leaves no information of DM spin in the visible products. This explains why the \(m_{DD}\) distributions for FDM and VDM almost overlap when \(H_2\) is light, while the difference becomes visible in the region \(m_{H_2} \gtrsim 300\) GeV where the off-shell contribution is sizable. For \(g_\chi =10\), which is close to the perturbative limit, the decay width of \(H_2\) is so wide that the off-shell \(H_2\) contribution is important when \(m_{H_2} \lesssim 200\) GeV and is dominant when \(m_{H_2} \gtrsim 300\) GeV. Then it is possible to distinguish the FDM against VDM in the full range of \(m_{H_2}\). From the right panel of Fig. 9, we can also see that \(m_{DD}\) distributions for FDM (VDM) with \(m_{H_2} \gtrsim 300\) GeV are almost identical, because the signal events are occupying the lower side of the off-shell \(H_2\) propagator irrespective of the \(H_2\) mass and decay width.

Table 8 The meaning of each row is the same as in Table 3, but the \(g_\chi \) of benchmark points are changed to 1 and 10 for the upper half and lower half of the table, respectively
Table 9 The meaning of each row is the same as in Table 4, but the benchmark points have been changed to those which are corresponding to benchmark points with \(g_\chi =1\) in FDM model
Table 10 The meaning of each row is the same as in Table 4, but the benchmark points have been changed to those which are corresponding to benchmark points with \(g_\chi =10\) in the FDM model

The signals are searched with the same method as used for benchmark points with \(g_\chi =3\). We will only discuss the hadronic channel, since we have shown that it has better sensitivity than the leptonic channel. We first list the production cross sections of the benchmark points in the hadronic channel and the corresponding pre-selection efficiencies in Table 8. Compared to Table 3, we can find that all benchmark points in the FDM have similar total production rate when the \(H_2\) is relatively light. Meanwhile, for \(m_{H_2} \gtrsim 300\) GeV, the production cross section increases with the coupling \(g_\chi \). The increase is more dramatic for heavier \(H_2\). Eventually, the signal production cross sections approach the same value when \(g_\chi \) is close to the perturbative limit due to the dominance of the off-shell \(H_2\) contribution. The pre-selection efficiencies for most of the benchmark points are similar, i.e. between 0.7–0.8, except for the FDM400 with \(g_\chi =1\). For this benchmark point with \(g_\chi =1\), the final state particles are a Z boson (\(m_{Z}=91.2\) GeV) and an almost on-shell \(H_2\) (\(m_{H_2} =400\) GeV), rendering the kinetic energy of final states quite small, \(E^{\text {kin}} \sim \mathcal {O}(10)\) GeV. The pre-selection condition \(E^{\text {miss}}_T >50\) GeV can cut out a large number of events.

The same BDT method that has been used in Sect. 3.2 for benchmark points with \(g_\chi =3\) is adopted here. The BDT is trained on the pre-selected events of each benchmark point with given \(g_\chi \) and \(m_{H_2}\) in the FDM model and the SM background. A cut on the BDT responses of signal and background can be applied later to improve the signal significance. The BDT cut for each benchmark point that maximizes the signal significance \((N_S/\sqrt{N_S+N_B})\) is given in Table 8. We can find that at the ILC with \(\sqrt{s}=500\) GeV and \(\mathcal {L}=1000\) fb\(^{-1}\), for \(g_\chi =1\), only the benchmark points FDM200 and FDM300 can be discovered at more than 3-\(\sigma \) level, while, for \(g_\chi =10\), all of the benchmark points can be discovered with signal significance great than \(\sim 8\)-\(\sigma \).

The production cross sections and the pre-selection efficiencies of benchmark points in SDM and VDM models corresponding to those in the FDM model with \(g_\chi =1\) and \(g_\chi =10\) are listed in Tables 9 and 10, respectively. For the case of \(g_\chi =10\), the benchmark points in the VDM model has much larger (smaller) production cross section than those in the FDM model when the \(H_2\) is light (heavy). Thus it is possible to distinguish FDM and VDM even by using the production rates of signal alone. The number of signal events for each benchmark point in the SDM and VDM model after applying the BDT cut as well their significance level \(\mathcal {S}\) of spin discrimination are calculated with the same strategy as introduced in Sect. 3.2. In the case of \(g_\chi =1\), we can see in Table 9 that only benchmark points SDM200 model can be distinguished from FDM model with \(\mathcal {S} > 3\), while it is impossible to discriminate the FDM benchmark points against the VDM benchmark points. When the \(g_\chi \) is close to the perturbative limit, the spin discrimination is quite promising as given in Table 10. The DM spin of our benchmark points with \(H_2\) in the full mass region of interests can be identified with high significance level. Owing to the considerable difference in the production rate between the FDM and VDM, the VDM has better discriminating power against FDM than the SDM.

6 Conclusion

In this paper, we have considered the DM discovery prospect and its spin discrimination at the ILC in the theoretical framework of gauge invariant and renormalizable Higgs portal DM models for the first time. The gauge invariances of the FDM model and the VDM model require another new scalar field (in addition to the SM Higgs boson) that mediates the DM and SM particles interaction, while the gauge invariant SDM model only needs one mediator, the SM Higgs boson.

Taking the FDM model with \(g_\chi =3\) as a benchmark scenario, we study the discovery prospects and spin discriminating powers of both its hadronic channel and leptonic channel at the ILC with \(\sqrt{s}=500\) GeV and \(\mathcal {L}=1000\) fb\(^{-1}\). In the hadronic channel, we first employ the BDT method with input of a few discriminative kinematic variables such as the DM pair invariant mass \(m_{DD}\) and the azimuthal angular separation between the missing transverse momentum and the closer jet \(\Delta \phi ^{\min }\) to improve the signal sensitivity. We find the benchmark points with \(m_{H_2} \lesssim 300\) GeV can be probed at more than 3-\(\sigma \) level. For those discoverable benchmark points in the FDM model, the spin discriminating against SDM can be made with \(\gtrsim \) 3-\(\sigma \) level, due to the intrinsic difference between the FDM model and the SDM model, i.e. the FDM model contains two mediators, while the SDM model only gets one. However, the spin discriminating against VDM is almost impossible, with the significance level below one for all discoverable benchmark points. The leptonic channel is considered with a similar strategy. We find that the leptonic channel has worse discovery potential than the hadronic channel. Only benchmark points of FDM model with the mediator mass \(m_{H_2} \lesssim 200\) GeV is discoverable. As with the hadronic channel, the spin discrimination between FDM and SDM can be made while it is quite difficult to distinguish FDM and VDM.

We also survey the discovery and the spin characterizing prospects of the benchmark points in the FDM model with varying \(g_\chi \). Choosing smaller \(g_\chi \) does not reduce the DM production cross section in benchmark points with small \(m_{H_2}\) much as long as the \(H_2 \rightarrow \chi \chi \) branching is dominating. Furthermore, the smaller \(g_\chi \) which gives narrower decay width of \(H_2\) will increase the difference between the \(m_{\chi \chi }\) distributions of the FDM and the SDM models. Thus benchmark points with \(g_\chi =1\) even have better signal significances and spin discriminating powers than those with \(g_\chi =3\). As for benchmark points with \(g_\chi \) approaching the perturbative limit, the off-shell \(H_2\) contribution becomes quite important, leading to the increased production rate especially for those with heavy \(H_2\). We find that the benchmark points with \(H_2\) in the full mass region of interest are discoverable. The spin discriminating against both the SDM and the VDM are quite promising.

It should be noted that, for FDM/VDM comparison throughout the work, the benchmark points of VDM are chosen such that the decay widths of \(H_2\) are kept the same as the ones in the FDM model. This can be possible provided that the decay width of \(H_2\) can be measured elsewhere. Then the normalization of the \(m_{DD}\) distribution become an important handle for FDM and VDM discrimination. We also considered the FDM/VDM comparisons without the information of normalization and find the discriminations are impossible except for the cases of \(g_\chi =10\). The \(\mathcal {S}\) calculated from Eq. (3.13) are 1.07, 1.24, 1.56 and 1.48 for FDM200/VDM200, FDM300/VDM300, FDM400/VDM400 and FDM500/VDM500, respectively.