1 Introduction

The Standard Model (SM) of elementary particles has emerged as a significant achievement explaining plenty of phenomena. The existence of the Higgs boson, as one of the most striking predictions of the SM, attracted much attention even before its experimental verification [1, 2]. The prediction of the Higgs boson was a direct consequence of the assumed scalar structure which was chosen to be the simplest possible one. To be specific, the SM assumes a single SU(2) doublet with four degrees of freedom as the Higgs field leading to the prediction of a single Higgs boson [3,4,5,6,7,8]. However, there are some important motivations, namely the SM inability to explain the universe baryon asymmetry [9], supersymmetry [10], axion models [11], etc., which provide possibility of an extended scalar structure which leads to the prediction of existence of additional Higgs bosons. Assuming a scalar structure based on two SU(2) Higgs doublets, as one of the simplest possibilities, leads to the Two-Higgs-Doublet model (2HDM) [12,13,14,15,16,17,18,19] which provides interesting environment and phenomenological features. The two assumed Higgs doublets in the 2HDM carry eight degrees of freedom, three of which are absorbed by the three of the electroweak gauge bosons and the remaining five degrees of freedom finally lead to the prediction of five Higgs bosons. The lightest Higgs boson (h) is assumed to be the same as the discovered SM Higgs boson (the SM-like scenario) and the four additional Higgs bosons are thought of as yet undiscovered particles. These Higgs bosons include a neutral scalar (H), a neutral pseudoscalar (A) and two charged (\(H^\pm \)) Higgs bosons. The present paper concentrates on the H and A Higgs bosons and investigates observability of these particles in several benchmark points in the parameter space of the 2HDM at SM-like scenario.

The 2HDM, in its general formulation, predicts tree level flavor-changing neutral currents (FCNCs) which is not in agreement with experimental observations. However, imposing the discrete \(Z_2\) symmetry, the tree level FCNCs are well avoided and four 2HDM types with different Higgs–fermion coupling scenarios which naturally conserve flavor are obtained. Different types with different coupling scenarios provide different interesting characteristics [18]. The Type-X 2HDM was studied earlier to investigate the observability of the H Higgs boson with the help of large enhancements the Higgs–lepton coupling provides at high \(\tan \beta \) values and led to promising results [20]. In this study, the Type- I 2HDM is chosen as the theoretical framework and the signal process through which the Higgs bosons observability is investigated is assumed to be \(e^- e^+ \rightarrow A H\rightarrow ZHH \rightarrow jj b{\bar{b}}b{\bar{b}}\) where jj is a di-jet resulting from the Z boson decay and \(b{\bar{b}}\) is a b quark pair resulting from the scalar Higgs H decay. The pseudoscalar Higgs boson A undergoes the decay mode \(A\rightarrow ZH\) which is dominant due to the SM-like assumption and also the non-zero mass splitting assumed between the A and H Higgs bosons. Scenarios with equal Higgs masses were also considered earlier leading to promising results [21]. The assumed signal process is also motivated by the large enhancement the scalar Higgs decay mode \(H\rightarrow b{\bar{b}}\) receives in low \(\tan \beta \) regime. Such an enhancement is due to the H–fermion coupling in the Type- I which depends on \(\cot \beta \). Moreover, the A-Z-H vertex depends on \(\sin (\beta -\alpha )\) which is set to unity because of the SM-like assumption. Hence, the process \(e^- e^+ \rightarrow A H\rightarrow ZHH\) is independent of \(\tan \beta \) and thus, the scalar Higgs decay mode \(H\rightarrow b{\bar{b}}\) may receive large enhancements at low \(\tan \beta \) values without any destructive effect on the ZHH production process.

Several benchmark points with different mass hypotheses are assumed and investigation of the Higgs bosons observability is done separately for each scenario. Masses of the Higgs bosons are chosen from intermediate and heavy mass regions and thus, the center-of-mass energy of 1000 GeV provides observation possibility in a relatively large portion of the space parameter. However, the center-of-mass energy of 500 GeV is also studied in this study since this energy is easily accessible to future linear colliders and it will be shown that a few scenarios can also be observed at this center-of-mass energy. The LHC can easily provide the center-of-mass energies considered here. However, since linear colliders provide a cleaner environment with less background processes and underlying events, a linear collider is assumed in this study.

Event generation is performed for each scenario independently and both beams are assumed to be unpolarised. The beamstrahlung effects [22] are taken into account and the detector response is simulated based on the SiD detector at the International Linear Collider (ILC) [23]. Simulated signal and relevant background events are analyzed to reconstruct Higgs bosons by first performing jet clustering and b-tagging and then applying proper selection cuts to enrich the signal events. Identifying proper \(b{\bar{b}}\) and \(jjb{\bar{b}}\) combinations and computing their invariant masses, candidate mass distributions of the Higgs bosons are obtained at the assumed integrated luminosities. The integrated luminosity is set to 500 \(fb^{-1}\) for all of the benchmark scenarios except for one scenario for which the integrated luminosity of 1000 \(fb^{-1}\) is assumed (for search for the A Higgs boson). It will be shown that, in all of the scenarios under consideration, both of the Higgs bosons H and A are observable with signals exceeding \(5\sigma \). Moreover, reconstructing masses of the Higgs bosons, it will be shown that masses of both of the Higgs bosons are measurable. In what follows, a brief introduction to the 2HDM and its different types is provided, and then analysis and results will be discussed.

2 Two-Higgs-doublet model

Employing two SU(2) Higgs doublets, the potential

$$\begin{aligned} \begin{aligned} {\mathcal {V}}&=\, m_{11}^2\Phi _1^\dagger \Phi _1+m_{22}^2\Phi _2^\dagger \Phi _2 -\Big [m_{12}^2\Phi _1^\dagger \Phi _2+\mathrm {h.c.}\Big ]\\&\quad +\frac{1}{2}\lambda _1\Big (\Phi _1^\dagger \Phi _1\Big )^2 +\frac{1}{2}\lambda _2\Big (\Phi _2^\dagger \Phi _2\Big )^2 +\lambda _3\Big (\Phi _1^\dagger \Phi _1\Big )\Big (\Phi _2^\dagger \Phi _2\Big )\\&\quad +\lambda _4\Big (\Phi _1^\dagger \Phi _2\Big )\Big (\Phi _2^\dagger \Phi _1\Big ) +\Big \{\frac{1}{2}\lambda _5\Big (\Phi _1^\dagger \Phi _2\Big )^2 +\Big [\lambda _6\Big (\Phi _1^\dagger \Phi _1\Big ) \\&\quad +\lambda _7\Big (\Phi _2^\dagger \Phi _2\Big ) \Big ]\Big (\Phi _1^\dagger \Phi _2\Big ) +\mathrm {h.c.}\Big \}, \end{aligned} \end{aligned}$$
(1)

where \(\Phi _1\) and \(\Phi _2\) are SU(2) Higgs doublets, is assumed as the Higgs potential in a general 2HDM. Using one additional Higgs doublet in this model leads to the prediction of five Higgs bosons. The lightest Higgs boson (h) is assumed to be the same as the observed Standard Model Higgs boson and the four additional Higgs bosons are thought of as yet undiscovered Higgs bosons. Additional Higgs bosons include a neutral scalar (H), a neutral pseudoscalar (A) and two charged (\(H^\pm \)) Higgs bosons. To respect experimental observations, the discrete \(Z_2\) symmetry is imposed to avoid tree level flavor-changing neutral currents [14,15,16]. Consequently, it is implied that the parameters \(\lambda _6\), \(\lambda _7\) and \(m_{12}^2\) must be zero. However, letting \(m_{12}^2\) be non-zero, \(Z_2\) symmetry is softly broken in this model. Assigning a value to \(\tan \beta \) which is defined as the ratio of the vacuum expectation values of the two Higgs doublets, the parameters \(m_{11}^2\) and \(m_{22}^2\) are obtained by the minimization conditions for a minimum of the vacuum. Setting \(\lambda _6\) and \(\lambda _7\) to zero to respect the discrete \(Z_2\) symmetry and working in the “physical basis”, \(\tan \beta \), mixing angle \(\alpha \), \(m_{12}^2\) and physical masses of the Higgs bosons must be determined to specify the model completely [12]. As a result of the imposed \(Z_2\) symmetry, Higgs–fermion coupling scenarios are limited. Table 1 provides Higgs coupling to up-type and down-type quarks and leptons in the allowed types which naturally conserve flavor. The types “X” and “Y” are also called “lepton-specific” and “flipped” respectively. Assuming \(\sin (\beta -\alpha )=1\) [12], we choose the model to be SM-like in order for the h Higgs boson to be thought of as the observed SM Higgs boson. Consequently, h coupling to fermions reduce to corresponding couplings in the SM Yukawa Lagrangian. Consequently, the neutral Higgs part of the Yukawa Lagrangian takes the form [12, 24]

Table 1 Higgs coupling to up-type quarks, down-type quarks and leptons in types allowed by the \(Z_2\) symmetry. The superscript i is a generation index
$$\begin{aligned} {\mathcal {L}}_{ Yukawa}= & {} -v^{-1} \Big (\, m_d\, {\bar{d}}d\, +\, m_u\, {\bar{u}}u\, +\, m_\ell \, {\bar{\ell }}\ell \, \Big )\ h \nonumber \\&+v^{-1} \Big (\, \rho ^dm_d\, {\bar{d}}d\, +\, \rho ^um_u\, {\bar{u}}u\, +\, \rho ^\ell m_\ell \, {\bar{\ell }}\ell \, \Big )\, H \nonumber \\&+iv^{-1}\Big (-\rho ^dm_d\, {\bar{d}}\gamma _5d\, +\, \rho ^um_u\, {\bar{u}}\gamma _5u\,\nonumber \\&- \rho ^\ell m_\ell \, {\bar{\ell }}\gamma _5\ell \, \Big ) A, \end{aligned}$$
(2)

where \(\rho ^X\) factors are provided in Table 2.

Table 2 \(\rho ^X\) factors corresponding to different flavor-conserving 2HDM types

As seen in Table 2, coupling factors of different types differ dramatically leading to considerable differences in phenomenological features of different types [18]. According to Table 2, factors of the type-I acquire the same values (\(\cot \beta \)) resulting in an interesting environment in both low and high \(\tan \beta \) regions. Searching for the Higgs bosons in the type-I in this study is highly motivated by the large enhancement the \(H\rightarrow b{\bar{b}}\) channel receives at low \(\tan \beta \) values.

3 Signal process

The process chain \(e^- e^+ \rightarrow A H\rightarrow ZHH \rightarrow jj b{\bar{b}}b{\bar{b}}\) is assumed as the signal process in this study and the Type- I 2HDM is assumed as the theoretical framework to benefit from possible enhancements in low \(\tan \beta \) regime. jj is a pair of jets resulting from the Z boson hadronic decay \(Z\rightarrow jj\). After the pseudoscalar Higgs boson undergoes the decay channel \(A\rightarrow ZH\), both of the scalar CP-even Higgs bosons experience the decay mode \(H\rightarrow b{\bar{b}}\) which receives a large enhancement due to the \(\cot \beta \) factor in the Higgs–fermion coupling factors as shown in Table 2 and thus, is dominant in low \(\tan \beta \) regime as long as the scalar Higgs mass \(m_H\) is below the threshold of the on-shell top quark pair production. The initial \(e^-e^+\) collision is assumed to occur at the center-of-mass energies of 500 and 1000 GeV at a linear collider.

3.1 Benchmark points

Several benchmark points with different mass hypotheses are assumed in the parameter space of the 2HDM as shown in Table 3. Benchmark points corresponding to the two assumed center-of-mass energies are simulated and analyzed separately. The scalar Higgs boson mass \(m_H\) is assumed to vary in range 150–250 GeV and the mass splitting between the H and A Higgs bosons is assumed to range from 50 to 100 GeV. \(\tan \beta \) is set to 10 in all scenarios resulting in a considerable enhancement in the assumed scalar Higgs boson decay channel.

Table 3 Benchmark points assumed for the center-of-mass energies of a 500 and b 1000 GeV. \(m_h,m_H, m_A\) and \(m_{H^\pm }\) are physical masses of the Higgs bosons and the provided \(m^2_{12}\) range is the range satisfying theoretical requirements. The signal cross section is also provided for each scenario

3.2 Theoretical constraints

To ensure that the considered scenarios are consistent with theoretical constraints, potential stability [25], perturbativity and unitarity [26,27,28,29] of each scenario is checked with the use of 2HDMC 1.7.0 [30, 31] and the allowed range for \(m^2_{12}\) parameter is provided in Table 3.

3.3 Experimental constraints

The assumed masses of the Higgs bosons are consistent with results of 86 analyses as checked by HiggsBounds 4.3.1 [32] and HiggsSignals 1.3.0 [33].

The experimental constraint [34, 35], based on the measurement performed at LEP [36], limits the deviation of the parameter \(\rho =m_W^2(m_Z\cos \theta _W)^{-2}\) from its SM value. To respect this constraint, the Higgs bosons A and \(H^\pm \) are assumed to have the same masses in all of the benchmark points. This is because of the demonstration provided in [37, 38] that concludes the deviation of the \(\rho \) parameter from its SM value is negligible if any of the conditions

$$\begin{aligned} m_A=m_{H^\pm },\quad m_H=m_{H^\pm }, \end{aligned}$$
(3)

is met.

As reported in [39], the ATLAS collaboration has excluded mass regions \(m_A=310-410,\,335-400\), 350–400 GeV for \(m_H=150,\,200, \,250\) GeV respectively at \(\tan \beta =10\) in the type-I. Both CMS and ATLAS also put the limit \(m_A>350\) on the pseudoscalar Higgs mass for \(\tan \beta <5\) in the type-I [40, 41]. Another study by ATLAS collaboration excludes the mass range \(m_H=170-360\) GeV for \(\tan \beta <1.5\) in this type [42]. It is obvious from Table 3 that the assumed scenarios are consistent with the current constraints resulting from direct LHC observations. In Fig. 1 the selected benchmark points are shown in the (\(m_H\),\(m_A\)) plane together with the current LHC exclusion contour at \(\tan \beta =10\) [39]. What is important here is that the selected points are chosen in a way to cover regions where the mass difference between the scalar and pseudo-scalar Higgs bosons is below the Z boson mass. Starting from any of the selected BPs, BR(\(A\rightarrow ZH\)) increases by moving upward to higher values of \(m_A\) and will be close to unity for \(m_A-m_H\ge m_Z\). Therefore the only reason for the signal significance to decrease is the phase space saturation when heavy Higgs bosons are considered.

Fig. 1
figure 1

The selected benchmark points and the current LHC excluded area [39]

3.4 Flavour physics constraints

In the context of the type-II, flavor physics data [43, 44] puts the constraints \(m_{H^\pm }>570\) GeV (\(\tan \beta >2\)) and \(m_{H^\pm }>700\) GeV (\(\tan \beta <2\)) on the charged Higgs mass. However, as indicated in [44], no condition limits the Type- I for \(\tan \beta >2\). Moreover, it is shown by a review of LHC, LEP and Tevatron results that there is no exclusion around \(\sin (\beta -\alpha )=1\) for \(m_{H/A/H^\pm } = 500\) GeV in the Type- I 2HDM [45].

3.5 Constraints from precision measurements of the couplings

The SM Higgs boson couplings with fermions and gauge bosons have been measured within reasonable uncertainties at LHC. However, future experiments such as HL-LHC [46,47,48] and ILC [49, 50] are expected to achieve more precision on those observables.

In models with extended Higgs sectors, the couplings of SM Higgs with other particles may in general deviate from the corresponding SM predictions due to loop corrections which involve extra Higgs bosons.

Although the current results are consistent with SM predictions within the uncertainties, a deviation may be found at future colliders where higher precision measurements are expected. Therefore calculating radiative corrections to the couplings of the SM Higgs boson with SM particles can fingerprint extended Higgs sectors [51, 52]. Such calculations can then be used to set limits on the masses of the extra Higgs bosons of the new model [53]. In order to achieve this goal, coupling constants have been calculated at loop level including radiative corrections. Assuming a 2HDM type-II as the theoretical framework and choosing different scenarios for the coupling constant uncertainties, upper limits have been obtained on the masses of the extra Higgs bosons as a function of \(\tan \beta \) [53].

Results of the analysis reported in [53] can be translated to a 2HDM type-I, however, since in type-I, the Higgs couplings with fermions are proportional to \(\cot \beta \), no serious constraint is expected at high \(\tan \beta \) values which are assumed in this work.

3.6 Decay channels

The assumed decay channel \(A \rightarrow ZH\) in the signal process receives an enhancement due to the assumption \(\sin (\beta -\alpha )=1\) which is needed for the SM-like scenario, since the A-Z-H vertex depends on \(\sin (\beta -\alpha )\) in the Type- I 2HDM. Moreover, the assumed non-zero mass splitting between A and H Higgs bosons in the range 50-100 GeV facilitates the possibilities for this decay channel. On average, we obtain BR(\(A \rightarrow ZH\)) \(\simeq 0.72\) for the assumed benchmark points of Table 3 using 2HDMC 1.7.0. The A-Z-H vertex appears twice in the signal process. First in the production process \(e^+e^- \rightarrow Z^* \rightarrow HA\) and then in the assumed decay mode for the pseudoscalar Higgs boson. Therefore, due to the assumption \(\sin (\beta -\alpha )=1\), the signal process is independent of \(\tan \beta \) as long as the scalar Higgs decay mode \(H\rightarrow b{\bar{b}}\) is not considered. This fact provides an opportunity for the signal to benefit from the enhancement received by the decay channel \(H\rightarrow b{\bar{b}}\) at low \(\tan \beta \) values, since lowering \(\tan \beta \) value doesn’t result in any decrease in the cross section of the process \(e^+e^- \rightarrow Z^* \rightarrow AH\rightarrow ZHH\). Setting the value of 10 for \(\tan \beta \) for all of the scenarios, on average, we obtain BR(\(H \rightarrow b{\bar{b}}\)) \(\simeq 0.61\) for the assumed benchmark points using 2HDMC 1.7.0.

The b quark pairs resulting from the H decays annihilate into hadronic jets and are identified by first performing jet reconstruction and then performing a proper b-tagging algorithm. The identified b-jets are then used to reconstruct the H Higgs bosons. Although choosing the hadronic decay channel \(Z\rightarrow jj\) for the Z boson may give rise to more errors in the final results due to the uncertainties arising from jet reconstruction and b-tagging algorithms, the hadronic decay channel is chosen since the large branching ratio of this channel (\(\hbox {BR}_{\, Z \rightarrow jj}\) \(\simeq 0.69\)) may fully compensate for the potential arising errors. Reconstructing the Z boson by the identified jets, the A Higgs boson is also reconstructed with the help of the reconstructed H Higgs boson.

Background processes relevant to the considered signal process include \(W^\pm \) pair production, \(Z/\gamma \) production, Z pair production and top quark pair production. Cross sections of the signal and relevant background processes are computed at the center-of-mass energies of 500 and 1000 GeV by CompHEP 4.5.2 [54] and are provided in Tables 3 and 4. According to Table 3, observing scenarios with heavier Higgs masses must be more challenging since the Higgs masses and cross section behave oppositely to each other.

Table 4 Background cross sections

4 Event generation

In order to take the beamstrahlung effects into account and simulate the detector response, event generation is performed in several steps for each benchmark scenario. Hard scattering part of the signal and background events (parton-level part which doesn’t include parton showers, hadronization, etc.) are generated using CompHEP 4.5.2. Simulation of the beamstrahlung effects is also performed by CompHEP with the use of the beam parameters provided in Table 5 with the assumption that both beams are unpolarised. To generate the remaining part of the events including multi-particle interactions, parton showers and hadronization, the SLHA (SUSY Les Houches Accord) files generated by 2HDMC 1.7.0 as well as the parton-level events generated by CompHEP are passed to PYTHIA 8.2.15 [56]. The SLHA files contain basic parameters of the Type- I 2HDM including coupling constants, branching ratios, etc. Using the input files, PYTHIA performs further processing to complete the events. The generated events are then internally used by DELPHES 3.4 [57] to simulate detector response with the use of the DSiD detector card which is based on the full simulation performance of the SiD detector at the ILC. The anti-\(k_t\) algorithm [58] from FASTJET 3.1.0 package [59, 60] is used to perform jet reconstruction with the cone size \(\Delta R=\sqrt{(\Delta \eta )^2+(\Delta \phi )^2}=0.4\), where \(\eta =- ln \tan (\theta /2)\) and \(\phi \) (\(\theta \)) is the azimuthal (polar) angle with respect to the beam axis. DELPHES output data is stored as ROOT files [61] and contains reconstructed jets and also b-tagging flags by which b-jets are identified.

Table 5 Beam parameters corresponding to the center-of-mass energies of 500 and 1000 GeV taken from Tab. 8.2 of ILC technical design report v3.II [55]

5 Event selection and analysis

Analysis begins by counting the number of reconstructed jets satisfying the kinematic thresholds

$$\begin{aligned} \begin{aligned}&\varvec{{p_T}}_{\varvec{jet}}\ge 10\ GeV,\quad \vert \varvec{\eta }_{\varvec{jet}} \vert \le 3, \\ \end{aligned} \end{aligned}$$
(4)

where \(p_T\) is the transverse momentum. Based on the jet multiplicity distributions of Fig. 2 which are obtained for the two assumed center-of-mass energies, the selection cut

Fig. 2
figure 2

Jet multiplicity distributions corresponding to different signal and background processes at the center-of-mass energies of a 500 and b 1000 GeV

$$\begin{aligned} \varvec{N}_{{{\varvec{jet}}}} \ge 5, \end{aligned}$$
(5)

where \(N_{{{\varvec{jet}}}}\) is the number of jets, is applied. Using b-tagging flags, b-jets are identified and the b-jet multiplicity distributions of Fig. 3 are obtained for the b-jets satisfying the lower threshold \(\varvec{{p_T}}_{{{\varvec{b-jet}}}}\ge 20\ GeV\). The condition

Fig. 3
figure 3

b-jet multiplicity distributions corresponding to different signal and background processes at the center-of-mass energies of a 500 and b 1000 GeV

$$\begin{aligned} \varvec{N}_{{{\varvec{b-jet}}}} \ge 3, \end{aligned}$$
(6)

where \(N_{{{\varvec{b-jet}}}}\) is the number of b-jets, is then applied to rule out events with less than three b-jets. b-jets present in each events are analyzed to find the true b-jet pairs which originate from the H Higgs bosons decays. In events with three b-jets, the value of \(\Delta R_{\, bb}\), where \(\Delta R\) follows the definition

Table 6 Event selection efficiencies corresponding to a,b) signal and c,d) background processes assuming different benchmark scenarios at the center-of-mass energies of 500 and 1000 GeV
$$\begin{aligned} \Delta R=\sqrt{(\Delta \eta )^2+(\Delta \phi )^2}, \end{aligned}$$
(7)

is computed for the three possible b-jet pairs and the pair which has the minimum \(\Delta R_{\, bb}\) value is considered as the true b-jet pair if the condition

$$\begin{aligned} \varvec{\Delta R_{\, bb}}\ge {\left\{ \begin{array}{ll} 1.7, &{} \sqrt{s} = 500\, \text {GeV} \\ 0.8, &{} \sqrt{s} = 1000\, \text {GeV} \end{array}\right. } \end{aligned}$$
(8)

is satisfied. The b-jets are expected to make a more collinear pair at 1 TeV due to the higher energy of decaying particle (the Higgs boson). Therefore \(\Delta R_{bb}\) tends to smaller values at 1 TeV. In events with at least four b-jets, all possible combinations of two b-jet pairs are considered and the difference between the invariant masses of the two b-jet pairs \(\vert M_{\, b_1b_2}-M_{\, b_3b_4}\vert \) is computed for each combination, and the b-jet pairs of the combination with minimum invariant mass difference are identified as true pairs if the condition 8 is satisfied. Identifying true pairs, the selection cut

$$\begin{aligned} \varvec{N_{b{\bar{b}}}} \ge 1, \end{aligned}$$
(9)

where \(N_{b{\bar{b}}}\) is the number of identified true b-jet pairs, is imposed and the H Higgs boson is reconstructed by the identified b-jet pairs. Counting the remaining jets which have not participated in the reconstruction of the H Higgs boson, events with less than two remaining jets are vetoed to rule out events with no reconstructable Z boson. Reconstructing the Z boson by the remaining pair of jets, the A Higgs boson can be reconstructed by the reconstructed Z and H bosons. At this stage, each event contains one or two reconstructed H Higgs bosons and one reconstructed Z boson. In events with one reconstructed H Higgs boson, the combination ZH is identified as the true combination if the condition

$$\begin{aligned} \varvec{\Delta R_{\, ZH}}\le {\left\{ \begin{array}{ll} 2, &{} \sqrt{s} = 500\, \text {GeV} \\ 1, &{} \sqrt{s} = 1000\, \text {GeV} \end{array}\right. } \end{aligned}$$
(10)

where \(\Delta R\) follows the definition of Eq. (7), is satisfied. Again the condition is tighter at 1 TeV due to the more collinearity of the Z and H bosons at 1 TeV. In events with two reconstructed H Higgs bosons, \(\Delta R_{ZH}\) is computed for each one of the two possible ZH combinations and the combination with smaller \(\Delta R_{ZH}\) value is identified as the true combination. Having true combinations identified, the selection cut

$$\begin{aligned} \varvec{N_{\, ZH}} =1, \end{aligned}$$
(11)

where \(N_{\, ZH}\) is the number of identified true ZH combinations, is applied and the A Higgs boson is reconstructed using the identified ZH combination. Applying the selection cuts, event selection efficiencies are obtained for different signal and background processes at \(\sqrt{s}=\) 500 and 1000 GeV and the results are provided in Table 6. According to the analysis, the Higgs bosons H and A are reconstructed after three and four selection cuts respectively. Thus, total efficiencies corresponding to the first three selection cuts and all of the four selection cuts are also provided.

Fig. 4
figure 4

Candidate mass distributions of the a, b H and c, d A Higgs bosons with corresponding fitting results and errors in different scenarios at \(\sqrt{s}=500\) GeV

Fig. 5
figure 5figure 5

Candidate mass distributions of the af H and gl A Higgs bosons with corresponding fitting results and errors in different scenarios at \(\sqrt{s}=1000\) GeV

Computing the invariant mass of the identified \(b{\bar{b}}\) and \(jjb{\bar{b}}\) combinations for events surviving the selection cuts, Higgs candidate mass distributions of Figs. 4 and 5 are obtained for \(\sqrt{s}= 500\) and 1000 GeV respectively. Distributions corresponding to different scenarios are shown separately for each one of the Higgs bosons. Signal and different background contributions are also shown separately. As seen, the \(t{\bar{t}}\) process has the most contribution among the background processes, and is, however, well under control. In all of the distributions, signal contribution is seen as a significant excess of data on top of the total background. The distributions are normalized based on \(L\times \sigma \times \epsilon \), where L is the integrated luminosity, \(\sigma \) is the cross section and \(\epsilon \) is the selection efficiency. The integrated luminosity is set to 500 \(fb^{-1}\) for all of the distributions except for the distribution of Fig. 5k which uses the integrated luminosity of 1000 \(fb^{-1}\). Signal cross sections are obtained by multiplying the total cross sections provided in Table 3 by corresponding branching ratios of the decay modes \(A\rightarrow ZH\), \(Z\rightarrow jj\) and \(H\rightarrow b{\bar{b}}\). Background cross sections are taken from Table 4. Selection efficiencies for A mass distributions are taken from Table 6 and efficiencies corresponding to H mass distributions are obtained by computing the total number of signal reconstructed H Higgs bosons divided by twice the number of simulated signal events.

Proper fit functions are fitted to the total background (B) and signal+total background (S+B) distributions of Figs. 4 and 5 and results are shown with associated error bars. Fitted curves show significant peaks near the generated Higgs masses. Fitting is performed by ROOT 5.34 [62]. The fit function used for B distributions is a polynomial function and the fit function used for S+B distributions is the combination of a polynomial and a Gaussian function. The Gaussian function covers the signal and the polynomial covers the total background. First, the polynomial function is fitted to the total background, and then the parameters of the fitted polynomial are used as input for S+B fit. The “Mean” parameter is one of the Gaussian fit function parameters and provides the center of the signal peak. The values obtained for the “Mean” parameter are considered as the reconstructed masses (\(m_{\,Rec.}\)) of the Higgs bosons and are provided in Table 7. The generated masses (\(m_{\,Gen.}\)) are also shown for comparison. According to Table 7, a difference is seen between the generated and reconstructed masses which can be explained by the uncertainties arising from jet clustering algorithm and jet mis-identification, jet mis-tag rate, fitting method and choice of the fit function, errors in energy and momentum of the particles, etc. Optimization of the jet clustering algorithm, b-tagging algorithm and fitting method may reduce the errors. However, since such optimizations lie beyond the scope of this paper, a simple off-set correction is applied to reduce the errors in this study. The off-set correction is applied as follows. According to Table 7a, on average, the reconstructed masses of the H and A Higgs bosons are 22.85 and 29.55 GeV smaller than the corresponding generated masses respectively. Calculating the same values for Table 7b, we obtain 17.85 and 36.17 GeV for the Higgs bosons H and A respectively. To reduce the errors, reconstructed H and A Higgs masses are increased by the same values and the results are provided in Table 7 as corrected reconstructed masses (\(m_{\,\,Corr.\,\, rec.}\)). According to the results, corrected reconstructed masses are obtained with few GeVs difference from the generated masses in the considered scenarios and it is concluded that the masses of the Higgs bosons H and A are measurable in all of the assumed scenarios.

Table 7 Generated mass (\(m_{\,Gen.}\)), reconstructed mass (\(m_{\,Rec.}\)) and corrected reconstructed mass (\(m_{\,\,Corr.\,\, rec.}\)) of the Higgs bosons H and A with associated uncertainties at \(\sqrt{s}=\) (a) 500 and (b) 1000 GeV. The mass values are provided in GeV unit

6 Signal significance

To assess the observability of the Higgs bosons in the considered scenarios, signal significance is computed for each of the candidate mass distributions of Figs. 4 and 5 by counting the number of signal and background candidate masses in the whole mass range. Computation of the signal significance is based on the integrated luminosity of 500 \(fb^{-1}\) for all of the distributions except for the distribution of Fig. 5k which uses the integrated luminosity of 1000 \(fb^{-1}\). Computation results, namely total signal selection efficiency, number of signal (S) and background (B) candidate masses, signal to background ratio and signal significance at the center-of-mass energies of 500 and 1000 GeV are provided in Table 8. Results indicate that, in all of the considered scenarios, both of the Higgs bosons H and A are observable with signals exceeding \(5\sigma \). To be specific, at the center-of-mass energy of 500 GeV, both of the Higgs bosons H and A are observable in the region of parameter space with \(m_H=150\) GeV and \(200\le m_A \le 250\) GeV at the integrated luminosity of 500 \(fb^{-1}\). Also, at the center-of-mass energy of 1000 GeV, the H Higgs boson is observable at the region with \(150\le m_H \le 250\) GeV and \(200\le m_A \le 330\) GeV with a mass splitting of 50-100 GeV between the H and A Higgs bosons at the same integrated luminosity. The A Higgs boson is also observable at \(\sqrt{s}=1000\) GeV at the same region of parameter space at an integrated luminosity up to 1000 \(fb^{-1}\).

Table 8 Total signal selection efficiency (\(\epsilon _{\, Total}\)), number of signal (S) and background (B) candidate masses, signal to background ratio, signal significance and the assumed integrated luminosity in different scenarios at the center-of-mass energies of (a) 500 and (b) 1000 GeV

7 Discovery contours

The results described in the previous section are limited to the selected benchmark points. In this section an extrapolation is made to a wider region in the parameter space. In order to do so, starting from each value of \(m_H\) (in smaller increments than the chosen benchmark points with mass increments of 10 GeV), the \(m_A\) is scanned from \(m_H+50\) GeV up to a point where the signal significance decreases to 5\(\sigma \) as the minimum. This approach is first based on \(\tan \beta =10\) but can be done for other values. Figure 6a shows the color plot for \(\tan \beta =10\) keeping information of the significance at each point. The border of the blue region is obviously the limit of the 5\(\sigma \). In Fig. 6b, the contour in \(m_A\) vs \(\tan \beta \) plane has been obtained with \(m_H\) being 50 GeV lighter than \(m_A\) at every point. As seen, higher \(\tan \beta \) values result in no sizable change in the signal significances and the signal is only sensitive to low \(\tan \beta \) values. Therefore the plot of Fig. 6a can be viewed as a contour for \(\tan \beta >10\).

Fig. 6
figure 6

Discovery contour in a (\(m_H,m_A\)) plane, b (\(m_A,\tan \beta \))

8 The signal sensitivity at CLIC 1.5 and 3 TeV

Since the signal cross section and the signal to background ratio are high enough to provide signal exceeding 5\(\sigma \) at \(\sqrt{s}=0.5\) and 1 TeV, it is interesting to investigate the signal sensitivity of a higher energy collider like CLIC at 1.5 or 3 TeV to the signals studied in this analysis or even heavier Higgs bosons which may be out of the reach of ILC. To this aim the total cross section of the signal (each BP separately) and background processes were extrapolated to the CLIC region at 3 TeV. As Fig. 7a, b show, the signal and background cross sections decrease but the signal to background ratio remains almost the same as in 1 TeV (\(\sim 10^{-3}\)). Assuming the signal significance measure as \(S/\sqrt{B}\), the new signal significance of each selected BP at 3 TeV would be S / B times \(\sqrt{B}\) and since S / B shows no sizable change, the signal significance decreases like \(\sqrt{B}\). Therefore a higher energy collider may not improve these results. Other parameter space points involving heavier Higgs bosons will also have smaller cross sections than the “light” points selected here. Therefore their situation is not better than the selected BPs in the current analysis either. As a conclusion, it is expected that a 1 TeV ILC works better for the light and moderate Higgs masses unless the \(m_H+m_A\) reaches the kinematic threshold of the collider (i.e., \(\sqrt{s}\)) in which case using a higher energy collider like CLIC is inevitable.

Fig. 7
figure 7

Signal and background cross sections down to the CLIC area

9 Conclusions

The signal process \(e^- e^+ \rightarrow A H\rightarrow ZHH \rightarrow jj b{\bar{b}}b{\bar{b}}\) based on two Higgs doublet model type I was staudied at a future linear collider like ILC at 0.5 and 1 TeV. The integrated luminosity was set to 500 \(fb^{-1}\) for all selected benchmark points. Detector simulation was performed by DELPHES using SiD detector card. Results indicate that all selected points are in the reach of ILC with the possibility of mass reconstruction for both scalar and pseudo-scalar Higgs bosons at the same time with a single analysis. The coverage in \(\tan \beta \) direction reaches \(\tan \beta =60\) with almost the same signal sensitivity for \(\tan \beta \) higher than 10. The signal significance in \(m_H,m_A\) plane was also obtained by extending the analysis to higher values of \(m_A\). A comparison with the current LHC results shows that a very larger area can be covered by the current analysis with LHC contour embedded in a part of the whole 5 \(\sigma \) contour. Extrapolation to the CLIC center of mass energy of 1 and 3 TeV shows no gain in the signal significance with increasing the center of mass energy to higher values than 1 TeV unless forced by the kinematic requirements. Therefore the analysis is more relevant to ILC at 1 TeV.