1 Introduction

This paper presents a search for pairs of light Higgs bosons (\({\textrm{H}} _1\)) produced in supersymmetric (SUSY) [1,2,3,4,5,6,7,8] cascade decays in final states with small missing transverse momentum (\(p_{\textrm{T}} ^\text {miss}\)). Such events can arise from the pair production of squarks (\({\tilde{\textrm{q}}}\)) and gluinos (\({\tilde{\textrm{g}}}\)) in the next-to-minimal supersymmetric extension of the standard model (SM) [9] when the lightest SUSY particle (LSP) is a singlino-like neutralino (\(\tilde{\chi }_{\text {S}}^0\)) of small mass [10]. The \(\tilde{\chi }_{\text {S}}^0\) mass eigenstate is dominated by the singlino component and has only small couplings to other SUSY particles, suppressing direct squark or gluino decays to the \(\tilde{\chi }_{\text {S}}^0\). Squarks and gluinos decay via the next-to-LSP \(\tilde{\chi }^{0}_{2}\)   into a \(\tilde{\chi }_{\text {S}}^0\) and a Higgs, Z, or W  boson [10, 11]. The case of a singlet-like CP-even \({\textrm{H}} _1\), shown in Fig. 1, is the focus of this search. When the \(\tilde{\chi }_{\text {S}}^0\) has a far smaller mass than the \({\textrm{H}} _1\) and the phase space for the decay \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{H}} _1 +\tilde{\chi }_{\text {S}}^0 \) is small, the \({\textrm{H}} _1\) carries much larger momentum than the \(\tilde{\chi }_{\text {S}}^0\). In such \(p_{\textrm{T}} ^\text {miss}\)-suppressed scenarios, the key signature for the pair production of squarks and gluinos is a pair of Lorentz-boosted \({\textrm{H}} _1\) bosons.

Fig. 1
figure 1

Diagram of squark pair production and subsequent cascade decay in the benchmark signal model. The particle \(\tilde{\chi }^{0}_{2}\)   is the next-to-LSP, \(\tilde{\chi }_{\text {S}}^0\) is the singlino-like LSP, and \({\textrm{H}} _1\) is the CP-even singlet-like Higgs boson

This search targets events with two highly boosted \({\textrm{H}} _1\) bosons that decay into pairs that are reconstructed as large-radius jets using substructure techniques. This is the first search at the LHC to focus on this type of event, where particles invisible to the detector have only small transverse momentum (\(p_{\textrm{T}}\)) and therefore the events are not selected by searches requiring significant \(p_{\textrm{T}} ^\text {miss}\)  [10, 12]. Previous searches by the ATLAS and CMS experiments with similar final states have considered events with two boosted SM Higgs bosons and large values of \(p_{\textrm{T}} ^\text {miss}\)  [13, 14], or two SM Higgs bosons in resolved final states where each of the four b quarks is reconstructed as a separate jet, with either small [15] or large [14,15,16,17] values of \(p_{\textrm{T}} ^\text {miss}\). This search uses data from \(\hbox {pp}\) collisions collected by the CMS detector at \(\sqrt{s}=13\,\text {TeV} \) during 2016–2018, corresponding to an integrated luminosity of 138\(\,\text {fb}^{-1}\)  [18,19,20].

2 Benchmark signal model

A benchmark signal model is established following the work of Ellwanger and Teixeira [10, 11]. The eight first- and second-generation squarks are assumed mass-degenerate at the mass \(m_{\text {SUSY}}\), and the gluino mass is set at 1% larger. The small gluino-squark mass gap means that the kinematics of the final-state particles are very similar in the \({\tilde{\textrm{q}}}{\tilde{\textrm{q}}}\), \({\tilde{\textrm{q}}}{\tilde{\textrm{g}}}\), and \({\tilde{\textrm{g}}}{\tilde{\textrm{g}}}\) production modes, as little momentum is transferred to the quark in the \({\tilde{\textrm{g}}}\rightarrow {\tilde{\textrm{q}}}+{\textrm{q}} \) decay. All SUSY particles other than gluinos and those shown in Fig. 1 are assumed decoupled.

This search targets squarks and gluinos with \(m_{\text {SUSY}} > 1200 \,\text {GeV} \). Less massive squarks and gluinos can be probed by \(p_{\textrm{T}} ^\text {miss}\)-based searches, owing to their larger pair-production cross sections [12]. Smaller \(m_{\text {SUSY}}\) values can also lead to smaller \(p_{\textrm{T}}\) of the \({\textrm{H}} _1\) than is necessary for the pair to be merged in a single jet. The cross sections (\(\sigma \)) for the signal probed in this search, calculated at next-to-leading order (NLO) accuracy in the strong coupling constant (\(\alpha _\textrm{S}\)) including approximate next-to-NLO (NNLO) corrections and next-to-next-to-leading logarithmic (NNLL) soft gluon corrections [21,22,23,24,25,26,27,28,29], are shown in Table 1.

Table 1 Inclusive pair-production cross sections calculated at approximately NNLO and NNLL in \(\alpha _\textrm{S}\)  [21,22,23,24,25,26,27,28,29] for squark mass \(m_{\text {SUSY}}\) and gluino mass 1% larger. The quoted uncertainty is obtained from variations in the choice of scales, parton distribution functions, and \(\alpha _\textrm{S}\)

The values considered of the \({\textrm{H}} _1\) mass (\(m_{{\textrm{H}} _1}\)) and the corresponding branching fractions (\(\mathcal {B}\)) are shown in Table 2. Only events where both \({\textrm{H}} _1\) bosons decay into pairs are used as signal. The \(\mathcal {B}\) values are chosen to be those of an SM-like Higgs boson (\({\textrm{H}} _{\text {SM}}\)) of the corresponding mass [10], as calculated using \(\textsc {HDECAY}\) 6.61 [30, 31]. The \(\mathcal {B}\) values decrease for larger \({\textrm{H}} _1\) masses as the virtual \({\textrm{W}} {\textrm{W}} ^{*}\) and \({\textrm{Z}} {\textrm{Z}} ^{*}\) decay channels, both of which have sizeable leptonic branching fractions, become more accessible. The region \(m_{{\textrm{H}} _1} < m_{{\textrm{Z}}} \) is therefore where the \(p_{\textrm{T}} ^\text {miss}\)-suppressed all-jet signature is of greatest experimental importance. Nevertheless, to preserve generality, this search attempts to probe as much of the region \(m_{{\textrm{H}} _1} < 125\,\text {GeV} \) as possible.

Table 2 The \(m_{{\textrm{H}} _1}\) values in this search and corresponding branching fractions

In addition to \(m_{{\textrm{H}} _1}\) and \(m_{\text {SUSY}}\), there are two other unknown masses in the benchmark model: those of the \(\tilde{\chi }_{\text {S}}^0\) and the \(\tilde{\chi }^{0}_{2}\). The corresponding degrees of freedom are parameterised by \(R_{m} \equiv m_{{\textrm{H}} _1}/ m_{\tilde{\chi }^{0}_{2}} \) and \(\varDelta _{m} \equiv m_{\tilde{\chi }^{0}_{2}}- m_{{\textrm{H}} _1}- m_{\tilde{\chi }_{\text {S}}^0} \). The \(p_{\textrm{T}} ^\text {miss}\)-suppressed signature arises for values of \(R_{m}\) close to unity, provided \(\varDelta _{m} >0\) to permit the \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{H}} _1 +\tilde{\chi }_{\text {S}}^0 \) decay. In this case, the phase space for the \(\tilde{\chi }^{0}_{2}\)decay is small and the \(\tilde{\chi }_{\text {S}}^0\) has much smaller mass than the \({\textrm{H}} _1\), so the \(\tilde{\chi }_{\text {S}}^0\) always carries much less momentum than the \({\textrm{H}} _1\). The \(p_{\textrm{T}} ^\text {miss}\)-suppressed signature probed in this search is representative of a significant part of the model parameter space since the momenta of reconstructed objects do not exhibit a strong dependence on \(R_{m}\) and \(\varDelta _{m}\) in the region \(R_{m} >0.9\). Models with smaller \(R_{m}\) can be probed by \(p_{\textrm{T}} ^\text {miss}\)-based searches [10, 12]. For the benchmark model, the values \(R_{m} = 0.99\) and \(\varDelta _{m} = 0.1\,\text {GeV} \) are assumed.

Branching fractions of unity are assumed for the decays \({\tilde{\textrm{q}}}\rightarrow {\textrm{q}} +\tilde{\chi }^{0}_{2}\) and \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{H}} _1 +\tilde{\chi }_{\text {S}}^0 \). In the \(R_{m}\) and \(\varDelta _{m}\) region of the benchmark model, this is true except where \(m_{\tilde{\chi }^{0}_{2}} > m_{{\textrm{Z}}} + m_{\tilde{\chi }_{\text {S}}^0} \). In that case, the \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{Z}} +\tilde{\chi }_{\text {S}}^0 \) decay is permitted if the \(\tilde{\chi }^{0}_{2}\)   has a higgsino component [11]. However, the \(\tilde{\chi }^{0}_{2}\)   is expected to be mainly bino-like for relevant values of its mass [10]. For configurations where the \({\textrm{H}} _1\) mass is close to that of the \({\textrm{H}} _{\text {SM}}\), the decay \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{H}} _{\text {SM}} +\tilde{\chi }_{\text {S}}^0 \) is also possible. The signatures for such \({\textrm{H}} _1\) and \({\textrm{H}} _{\text {SM}}\) bosons are indistinguishable in this search. The assumption that the branching fraction for \(\tilde{\chi }^{0}_{2}\rightarrow {\textrm{H}} _1 +\tilde{\chi }_{\text {S}}^0 \) decay is 100% can therefore be relaxed to the assumption that the branching fractions to \({\textrm{H}} _1\) and \({\textrm{H}} _{\text {SM}}\) sum to unity.

3 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of \(6\,{\textrm{m}}\) internal diameter, providing a magnetic field of \(3.8\,{\textrm{T}}\). A silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter, each composed of a barrel and two endcap sections, reside within the solenoid volume. Forward calorimeters extend the pseudorapidity (\(\eta \)) coverage provided by the barrel and endcap detectors. Muons are measured in gas-ionisation detectors embedded in the steel flux-return yoke outside the solenoid. Events of interest are selected using a two-tiered trigger system. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around \(100\,{\textrm{kHz}}\) within a fixed latency of about 4\(\,\mu \text {s}\)  [32]. The second level, known as the high-level trigger, consists of a farm of processors running a version of the full event reconstruction software optimised for fast processing, and reduces the event rate to around \(1\,{\textrm{kHz}}\) before data storage [33]. A more detailed description of the CMS detector, together with a definition of the coordinate system and the kinematic variables, can be found in Ref. [34].

4 Event simulation

The primary background in this search originates from multijet production. Simulated multijet events are used to validate the multijet background estimation based on data (described in Sect. 6), but are not used for any of the final predictions. The remaining significant background is from events with vector bosons that decay into quark–antiquark pairs. Simulated events are used to determine the contributions from \({\textrm{t}}\bar{\textrm{t}}\), Z+jets, and W+jets production. The expected yields from all other SM sources of background are found to be negligible.

The multijet, Z+jets, and W+jets processes are simulated at leading order (LO) in perturbative quantum chromodynamics (QCD) using MadGraph 5_amc@nlo  2.4.2 [35] with up to four additional partons at the matrix element (ME) level. Simulated signal events for each pair of \(m_{{\textrm{H}} _1}\) and \(m_{\text {SUSY}}\) values of the benchmark model are generated at LO at the ME level with up to one additional parton using MadGraph 5_amc@nlo  2.3.3. The MLM [36] prescription is used to match partons from the LO ME calculations to those from the parton showers. Simulated \({\textrm{t}}\bar{\textrm{t}}\) events are produced at NLO in QCD at the ME level with the powheg \(\text {v}2.0\) [37,38,39,40] generator. The NNPDF2.3, NNPDF3.0, and NNPDF3.1 [41,42,43,44] parton distribution functions (PDFs) are used for the signal, 2016 background, and 2017–2018 background simulations, respectively. The parton shower and hadronisation are performed via pythia  8.2 [45]. The CUETP8M1 [46, 47] tune is used for the signal and 2016 background simulations, while the CP5 tune [48] is used for the 2017 and 2018 background simulations. The cross section used to normalise the \({\textrm{t}}\bar{\textrm{t}}\) simulation is calculated at NNLO+NNLL in QCD [49], and those for Z+jets and W+jets are calculated at NNLO in QCD [50,51,52]. Additional \(\hbox {pp}\) interactions within the same or nearby bunch crossings (pileup) are simulated for all events according to the distribution of the number of interactions observed in each bunch crossing [53]. The interactions of particles with the CMS detector are simulated using Geant4  [54].

5 Object reconstruction and event selection

The data are collected using triggers based on the scalar sum of jet \(p_{\textrm{T}}\) (\(H_{\textrm{T}}\)), with a requirement of \(H_{\textrm{T}}>900\,\text {GeV} \) (2016) and \(H_{\textrm{T}}>1050\,\text {GeV} \) (2017 and 2018). Events are reconstructed offline using a particle-flow (PF) algorithm [55] that reconstructs and identifies each individual particle (PF candidate) in an event using an optimised combination of information from the components of the CMS detector.

Jets are reconstructed by clustering the PF candidates using the anti-\(k_{\textrm{T}}\) clustering algorithm [56], as implemented in the FastJet package [57]. A distance parameter of 0.4 or 0.8 is used for standard- and large-radius jets, referred to as AK4 and AK8 jets, respectively. The jet momentum is defined as the vectorial sum of all particle momenta in the jet. To mitigate the effect of pileup, constituent charged PF candidates identified to be originating from vertices other than the primary \(\hbox {pp}\) interaction vertex are not used in the clustering algorithm. The primary vertex is taken to be the vertex corresponding to the hardest scattering in the event, evaluated using tracking information alone, as described in Section 9.4 of Ref. [58]. For AK4 jets, an offset correction is applied to correct for remaining pileup contributions. For AK8 jets, the pileup-per-particle identification algorithm [59, 60] is used to rescale the momenta of constituent neutral particles according to the probability they originated from the primary vertex. This probability is based on a local shape variable that distinguishes between collinear and soft diffuse distributions of the surrounding charged particles that are compatible with the primary vertex. For all jets, jet energy corrections are derived from simulation to bring the measured average response of jets to that of particle-level jets. In situ measurements of the momentum balance in dijet, photon+jet, Z+jet, and multijet events are used to account for any residual differences in jet energy scale and resolution between data and simulation [61, 62]. Additional criteria are imposed to reject jets from spurious sources, such as electronics noise and detector malfunctions [63, 64].

The identification of AK8 jets originating from two collimated b quarks (double-b tagging) is integral to the reconstruction of the \({\textrm{H}} _1\). A discriminant is calculated for each jet using a double-b tagging algorithm that combines tracking and vertexing information in a multivariate approach with no strong dependence on jet mass or \(p_{\textrm{T}}\)  [65].

The event preselection requires two AK8 jets with \(p_{\textrm{T}} > 170\,\text {GeV} \) and \(|\eta | < 2.4\) (so that they are within the acceptance of the tracker). If there are more than two candidate AK8 jets, the two with the largest double-b  tag discriminants are selected as most likely to have originated from decays. For the offline analysis, \(H_{\textrm{T}}\)is defined as the scalar \(p_{\textrm{T}}\) sum of all AK4 jets with \(p_{\textrm{T}} > 40\,\text {GeV} \) and \(|\eta | < 3.0\), including AK4 jets with PF candidates clustered into AK8 jets. The \(H_{\textrm{T}}\)distributions for various simulated signal and background processes are shown in Fig. 2, after implementing all preselection requirements. Since the final state contains only jets, the average signal event \(H_{\textrm{T}}\)depends significantly on \(m_{\text {SUSY}}\), and signal events with \(m_{\text {SUSY}} > 1200 \,\text {GeV} \) tend to have \(H_{\textrm{T}}>1500\,\text {GeV} \).

Fig. 2
figure 2

The \(H_{\textrm{T}}\)distribution in signal events with different values of \(m_{\text {SUSY}}\), and in the simulated SM backgrounds, normalised to unit area. The uncertainties are statistical. All events satisfy the preselection

Additional requirements based on the expected kinematic properties of signal events are applied after the preselection. They define the kinematic event selection:

  1. 1.

    Both selected AK8 jets must have \(p_{\textrm{T}} > 300\,\text {GeV} \) and \(|\eta | < 2.4\), characteristic of the jets originating from decay in signal events.

  2. 2.

    There must be at least one AK4 jet with \(p_{\textrm{T}} > 300\,\text {GeV} \) and \(|\eta | < 3.0\), characteristic of the quarks from squark decays in signal events. Such jets must be separated by \(\varDelta R \equiv \sqrt{\smash [b]{(\varDelta \phi )^2 + (\varDelta \eta )^2}} > 1.4\) from both selected AK8 jets, to avoid being constructed from the same PF candidates.

  3. 3.

    The event \(H_{\textrm{T}}\)must exceed 1500\(\,\text {GeV}\).

Although the offline \(H_{\textrm{T}}\)resolution is better than that of the trigger-level variable, the offline \(H_{\textrm{T}}\)threshold is comfortably above the trigger-level \(H_{\textrm{T}}\)requirements. The trigger efficiency for this analysis is measured using events collected with a single muon trigger with a muon \(p_{\textrm{T}}\) threshold between 24 and 27\(\,\text {GeV}\). The efficiency for each data-taking year is nearly 100%. For the 2018 data, the \(|\eta |\) selection for the AK4 jets is reduced from 3.0 to 2.4 to avoid a region of the endcap electromagnetic calorimeters affected by large losses in crystal transparency, and therefore increased energy-equivalent electronics noise [66]. This change has a negligible effect on signal acceptance for all considered masses.

The fraction of signal events that satisfy the kinematic selection is essentially independent of \(m_{{\textrm{H}} _1}\). It increases from about 60 to 80% as \(m_{\text {SUSY}}\) increases from 1200 to 2000\(\,\text {GeV}\), after which it remains approximately constant.

5.1 Double-b tag based event selection

The two AK8 jets that are classified as the candidates in each event are randomly assigned the labels “A” and “B”. Their double-b  tag discriminants define a two-dimensional (2D) parameter space, shown with simulated signal and multijet event distributions in Fig. 3. The signal events are expected to contain two decays and therefore accumulate in the region where both double-b  tag discriminants are large. The signal-enhanced tag region (TR) is defined as the region where the sum of the two double-b  tag discriminants exceeds 1.3, illustrated by the shaded triangle in Fig. 3. Two additional regions are defined in Fig. 3 for use in the multijet background estimation and validation: the control region (CR), a multijet-dominated region with negligible signal; and the validation region (VR), a more signal-like region where one of the two jets has a large double-b  tag discriminant. The VR is defined sufficiently far from the TR for the signal contamination to be negligible.

Fig. 3
figure 3

Distributions of simulated signal and multijet events in the 2D double-b  tag discriminant plane, where the fractions of events in each bin are represented by the areas of the filled red and open blue squares, respectively. The signal parameters are \(m_{{\textrm{H}} _1} = 70\,\text {GeV} \) and \(m_{\text {SUSY}} = 2000\,\text {GeV} \). The kinematic selection is implemented with the masses of the two AK8 jets required to be within the set of signal and sideband mass regions defined in Sect. 5.2. The green, yellow, and orange shaded areas represent the tag region (TR), control region (CR), and validation region (VR), respectively. Of the plotted signal events, 65% fall within the TR

About 50% of the signal events that satisfy the kinematic selection populate the TR, with variation at the level of \(\pm 10\%\) across the \(m_{{\textrm{H}} _1}\) and \(m_{\text {SUSY}}\) parameter space considered. Since the multijet background is dominated by light-flavour quark and gluon initiated jets, only about 3% of these events populate the TR. For the \({\textrm{t}}\bar{\textrm{t}}\), Z+jets, and W+jets backgrounds, the corresponding figures are 13, 6, and 3%, respectively.

5.2 Soft-drop mass based signal and sideband regions

In signal events, both selected AK8 jets are likely to originate from decays and therefore have a jet mass close to \(m_{{\textrm{H}} _1}\). The multijet background has no resonant mass peak, while the other backgrounds are only expected to exhibit peaks near the known top quark and vector bosons masses, which means that an accurate reconstruction of the jet mass is important in distinguishing signal from background. The AK8 jet masses are evaluated using the “soft-drop” algorithm [67] (with a soft-drop threshold of \(z_{\text {cut}} = 0.1\) and angular exponent of \(\beta = 0\)), in which wide-angle soft radiation is removed recursively from a jet. In signal events this algorithm achieves a relative jet mass resolution from 10% for \(m_{{\textrm{H}} _1} =125\,\text {GeV} \) to 20% for \(m_{{\textrm{H}} _1} =30\,\text {GeV} \).

The soft-drop masses of the two AK8 jets define a 2D parameter space, shown in Fig. 4, in which 10 signal regions (\(S_i\)) and 10 sideband regions (\(U_{i}\)) are defined. The \(S_i\) contain events in which the two \({\textrm{H}} _1\)-candidate jets have approximately the same soft-drop mass. The width of each \(S_i\) corresponds to about four times the experimental soft-drop mass resolution for the relevant simulated value of \(m_{{\textrm{H}} _1}\).

Fig. 4
figure 4

Map of mass regions used in the 2D soft-drop mass plane. The regions labelled \(S_i\) are the signal mass regions, and the disjoint regions \(U_{i}\) form the corresponding sidebands

Fig. 5
figure 5

The normalised distribution of events in the 2D soft-drop mass plane overlaid by the map of mass regions. The upper left, upper right, and middle left panels correspond to signal events for \(m_{\text {SUSY}} = 2000\,\text {GeV} \) and \(m_{{\textrm{H}} _1}\) values of 40, 70, and 125\(\,\text {GeV}\), respectively. The panels at middle right, lower left, and lower right correspond to simulated multijet, \({\textrm{t}}\bar{\textrm{t}}\), and vector boson backgrounds, respectively. All events satisfy the TR requirement and the kinematic selection

The event distributions for a set of signal models with different \(m_{{\textrm{H}} _1}\) values are shown in Fig. 5, with the signal and sideband mass regions overlaid. The peaks in the signal distributions where one or both AK8 jets have a soft-drop mass close to zero result from a selected jet originating from a single parton or one of the decays lying outside the acceptance of the jet reconstruction algorithm. The latter can happen when the angular separation of the b  quarks exceeds the AK8 jet distance parameter, or when the ratio of the b  quark \(p_{\textrm{T}}\) values is larger than 9 (such that the softer b  quark would not satisfy the \(z_{\text {cut}}\) threshold in the soft-drop algorithm). For signal models with \(40< m_{{\textrm{H}} _1} < 125\,\text {GeV} \), \(\approx \!50\%\) of the events that satisfy the kinematic and TR selection fall within any of the \(S_i\). However, for \(m_{{\textrm{H}} _1} < 35\,\text {GeV} \) the bulk of the distribution is lower in mass than \(S_1\), leading to a rapid decrease in signal acceptance.

The distributions of background events are also shown in Fig. 5. The majority of multijet events contain at least one AK8 jet evaluated to have a small soft-drop mass, reflecting the characteristic one-prong structure of quark and gluon jets. After applying the kinematic and TR selection criteria, approximately 5% of multijet events fall within any of the \(S_i\), with greater probability at small masses. For the vector boson and \({\textrm{t}}\bar{\textrm{t}}\) backgrounds the corresponding figures are 7 and 19%, respectively, concentrated in the \(S_i\) corresponding to masses between the W  boson and top quark masses.

For each \(S_i\) there are two corresponding sideband regions, \(U_{i}\), used for the multijet background estimation described in Sect. 6. The sideband regions \(U_{1}\) have a triangular form to avoid the region of very small soft-drop masses, where the density from multijet events increases sharply.

5.3 Categorisation in \(H_{\textrm{T}}\)and expected yields

The selected events are classified according to three \(H_{\textrm{T}}\) categories: 1500–2500, 2500–3500, and above \(3500\,\text {GeV} \). Each \(H_{\textrm{T}}\)category is divided into the 10 mass signal regions \(S_i\) defined in Fig. 4, resulting in a total of 30 search regions for each data-taking year. As can be seen in Fig. 6 for TR data summed over the three data-taking years, the search region yields can be visualised through a 30-bin histogram where bins 1–10 represent the \(S_i\), in ascending order, for the first \(H_{\textrm{T}}\)category. The subsequent two sets of 10 bins represent the results for the second and third \(H_{\textrm{T}}\)categories. The primary background is from multijet events, estimated from data using the method described in Sect. 6. The expected contribution from \({\textrm{t}}\bar{\textrm{t}}\) events is also significant, particularly in the larger soft-drop mass regions populated by jets from hadronic top quark or W boson decays. The \({\textrm{t}}\bar{\textrm{t}}\) simulation is validated in a dedicated \({\textrm{t}}\bar{\textrm{t}}\)-enriched control region in data. In Fig. 4 this is the triangular region of the parameter space with both jet masses below 200\(\,\text {GeV}\) and above the upper boundary of mass region 10. The yields from Z+jets and W+jets production are small in comparison. All expected SM backgrounds tend to exhibit small values of \(H_{\textrm{T}}\)compared to signal.

Fig. 6
figure 6

Observed and expected yields in the TR for each of the 30 search regions, summed over the three data-taking years. The multijet background is estimated from data using the method described in Sect. 6, while the other backgrounds are simulated. Example signal distributions are shown for \(m_{{\textrm{H}} _1} = 70\,\text {GeV} \) and \(m_{\text {SUSY}} =1200\), 2000, and 2800\(\,\text {GeV}\). The error bars represent the statistical uncertainties and the hatched bands the systematic uncertainties

The distributions in signal events for \(m_{{\textrm{H}} _1} = 70\,\text {GeV} \) and \(m_{\text {SUSY}} =1200\), 2000 and 2800\(\,\text {GeV}\) are also shown in Fig. 6. Although the production cross section decreases quickly with increasing \(m_{\text {SUSY}}\), the fraction of events in the larger-\(H_{\textrm{T}}\)categories increases. Within each \(H_{\textrm{T}}\)category, the distribution of events in the 10 \(S_i\) bins is described by a peak with a width of about three bins, centred near the model value of \(m_{{\textrm{H}} _1}\).

6 Multijet background estimation from data

The mass sideband regions \(U_{i}\) form a basis for using data to estimate the multijet background. The density of the multijet background is approximately uniform within each of the 10 mass regions (spanning \(S_i\) and \(U_{i}\) for each region i illustrated in Fig. 4). Apart from \(U_{1}\), each \(U_{i}\) is constructed to have the same area as \(S_i\) such that the corresponding multijet yields, respectively denoted \(\hat{U}_{i}\) and \(\hat{S}_{i}\), are approximately equal. The observed ratios of \(S_i\) to \(U_{i}\) yields, \(F_{i}\), are measured in CR data. The \(F_{i}\) factors are found to be close to unity except for the \(F_{1}\) values which are approximately 1.5.

The multijet background in the TR is estimated independently for each signal region \(S_i\):

$$\begin{aligned} \hat{S}_{i}^{\text {TR}} = F_{i} \, \hat{U}_{i} ^{\text {TR}}, \end{aligned}$$
(1)

where \(\hat{U}_{i} ^{\text {TR}}\) is the observed TR yield in sideband region \(U_{i}\) after subtracting the contributions from the other simulated backgrounds. In rare cases where the prediction \(\hat{S}_{i}^{\text {TR}}\) is negative, it is set equal to zero.

Since the \(F_i\) factors are measured and applied in different regions of double-b  tag discriminant space, any correlation between the soft-drop mass and the double-b  tag discriminant of AK8 jets can bias the prediction of Eq. (1). Using a sample of data satisfying an alternative kinematic event selection with the requirement for one or more AK4 jets inverted, the variation of \(F_i\) between the TR and the CR is found to be less than 10%.

The overall accuracy of the multijet estimation is assessed through closure tests. First the method is applied to simulated multijet events in the TR where, within statistical uncertainties, the predicted yields are consistent with the simulated yields for each data-taking year. Second the method is applied in the multijet-dominated VR data (defined in Fig. 3) by making the appropriate modification to Eq. (1): \(\hat{S}_{i}^{\text {VR}} = F_i \, \hat{U}_{i} ^{\text {VR}}\). The resulting predicted and observed VR yields are consistent within uncertainties, as shown in Fig. 7. Based on the results of the closure tests, a systematic uncertainty of 15 (30%) is assigned in the lower two \(H_{\textrm{T}}\)categories (upper \(H_{\textrm{T}}\)category).

Fig. 7
figure 7

A comparison of the predicted and observed multijet yields in the validation region (VR), after subtraction of the other simulated backgrounds. The prediction is made separately for the three data-taking years, and the results are summed. The error bars on the data points represent their statistical uncertainties. The uncertainties in the predicted yields (statistical and systematic) are indicated by the hatched bands

7 Systematic uncertainties

The simulated events for signal and the \({\textrm{t}}\bar{\textrm{t}}\), Z+jets, and W+jets backgrounds are affected by various systematic uncertainties. The efficiency for tagging (mistagging) a jet originating from two b quarks (a light-flavour quark or gluon) is corrected to match that observed in data [65]. The uncertainty in this correction corresponds to \(\approx \!10\%\) in the simulated signal and background yields. The uncertainties related to the jet energy corrections are applied to the jet properties in bins of \(p_{\textrm{T}}\) and \(\eta \). These uncertainties affect the event \(H_{\textrm{T}}\), leading to an \(\approx \!4\%\) migration of events between adjacent \(H_{\textrm{T}}\)categories. The uncertainty in the soft-drop mass scale in simulation relative to data leads to a migration of events between adjacent \(S_i\) and \(U_{i}\) regions of up to 10%. The uncertainty in the simulated soft-drop mass resolution affects the widths of the simulated mass peaks. This effect is larger for signal models with small \(m_{{\textrm{H}} _1}\) and can reduce the \(S_i\) selection efficiency by up to 20%.

The systematic uncertainties are assumed to be fully correlated among the data-taking years except for the 2016 double-b tagging uncertainties, which are assumed uncorrelated because the CMS pixel detector was upgraded prior to 2017 data-taking. Changing these correlation assumptions is found to have only a small effect on the final results. Systematic uncertainties related to integrated luminosity, pileup, PDFs, renormalisation and factorisation scales, modelling of initial-state radiation, and background cross sections were also evaluated, along with the statistical uncertainties in the simulation, and were found to make negligible contributions to the total uncertainty.

Systematic uncertainties in multijet yields arise from the systematic uncertainties in the \(F_i\) factors. As described in Sect. 6, an uncertainty of \(15\%\) is applied to the \(F_i\) in the lower two \(H_{\textrm{T}}\)categories and \(30\%\) in the upper \(H_{\textrm{T}}\)category, uncorrelated among different \(F_i\). Except in the lowest \(H_{\textrm{T}}\)category, the total uncertainty in the multijet yield is dominated by the statistical uncertainty in \(\hat{U}_{i} ^{\text {TR}}\).

8 Results

Binned maximum likelihood fits to the data in all 30 search regions \(S_i\) for each data-taking year are carried out under background-only and signal+background hypotheses. The corresponding sideband regions \(U_{i}\) are fitted simultaneously, thereby constraining the multijet contributions to the search region yields through Eq. (1). The likelihood functions are defined through the product of \(90 \times 2\) Poisson distributions [68], one for each search region and one for each sideband region, with additional constraint terms for the “nuisance” parameters that account for the systematic uncertainties summarised in Sect. 7. Figure 8 compares the result of the background-only fit to the yields in the search regions for the combination of 2016, 2017, and 2018 data. There is no evidence for deviations of the data from the fitted background. The values and uncertainties of most nuisance parameters are unchanged in the fit, but the ones corresponding to the \(F_i\) are constrained through Eq. (1) when the yields \(\hat{S}_{i}^{\text {TR}}\) and \(\hat{U}_{i} ^{\text {TR}}\) are sufficiently large.

Fig. 8
figure 8

Yields in all search regions after the background-only fit, summed over the three data-taking years. Example signal contributions used in the signal+background fits are shown for \(m_{\text {SUSY}} =2200\,\text {GeV} \), and \(m_{{\textrm{H}} _1} =50\), 90, and 125\(\,\text {GeV}\). The error bars represent the statistical uncertainties and the hatched bands the systematic uncertainties

Signal+background fits are used to set 95% confidence level (\(\textrm{CL}\)) upper limits on the product \(\sigma \mathcal {B} ^2\) for the mass points in the benchmark signal model. The limits are set using the modified frequentist \(\textrm{CL}_{{s}}\) criterion [69, 70], with the profile likelihood ratio as test statistic [68]. The observed and expected 95% \(\textrm{CL}\)upper limits on \(\sigma \mathcal {B} ^2\) are shown in Fig. 9, as functions of \(m_{{\textrm{H}} _1}\) for constant \(m_{\text {SUSY}}\). The upper limits are weaker for models with \(m_{{\textrm{H}} _1} <35\,\text {GeV} \), for which the signal-event distribution in the 2D soft-drop mass plane peaks outside the signal regions. The limits have no significant dependence on \(m_{\text {SUSY}}\) for models with \(m_{\text {SUSY}} >2000\,\text {GeV} \), whose signal events mostly populate the upper \(H_{\textrm{T}}\)category (as shown in Fig. 6).

Fig. 9
figure 9

Upper limits at 95% \(\textrm{CL}\)on \(\sigma \mathcal {B} ^2\) as a function of \(m_{{\textrm{H}} _1}\), for \(m_{\text {SUSY}}\) values of 1200 (upper), 2000 (middle), and 2800\(\,\text {GeV}\) (lower). The solid and dashed black lines indicate the observed and median expected limits, respectively. The inner (green) band and the outer (yellow) band indicate the regions containing 68 and 95%, respectively, of the distribution of limits expected under the background-only hypothesis. The solid and dashed red lines show the theoretical value of \(\sigma \mathcal {B} ^2\) and its uncertainty [21,22,23,24,25,26,27,28,29,30]. In the upper plot, these \(\sigma \mathcal {B} ^2\) values are beyond the maximum of the vertical axis

The \(\sigma \mathcal {B} ^2\) upper limits are used in conjunction with the theoretical \(\sigma \) and \(\mathcal {B}\) values from Sect. 2 to exclude ranges of masses in \(m_{{\textrm{H}} _1}\) and \(m_{\text {SUSY}}\) in the benchmark model. The observed 95% \(\textrm{CL}\)upper limits on r, the ratio of measured and theoretical values of \(\sigma \mathcal {B} ^2\), are shown in Fig. 10, with the corresponding exclusion contours at \(r=1\). Masses \(1200< m_{\text {SUSY}} < 2500\,\text {GeV} \) are excluded within the range \(40< m_{{\textrm{H}} _1} < 120\,\text {GeV} \). Expected exclusion contours for the background-only scenario agree within one standard deviation with the observed contours. In the region \(110< m_{{\textrm{H}} _1} < 125\,\text {GeV} \), \(\mathcal {B}\) starts to decrease more quickly (as shown in Table 2), leading to a corresponding reduction in sensitivity. Most of the sensitivity at large \(m_{\text {SUSY}}\) comes from the \(H_{\textrm{T}}> 3500\,\text {GeV} \) region, where the statistical uncertainties in the observed yields are dominant over systematic uncertainties. This search does not explore the region outside of that shown in Fig. 10.

Fig. 10
figure 10

The observed 95% \(\textrm{CL}\)upper limit on \(\sigma \mathcal {B} ^2 / (\sigma \mathcal {B} ^2)_{\text {theory}}\), quantified by the colour scale as a function of \(m_{{\textrm{H}} _1}\) and \(m_{\text {SUSY}}\). The solid and dashed red lines indicate the observed excluded region and its theoretical uncertainty, respectively. The solid and dashed black lines respectively represent the expected excluded region and its 68% \(\textrm{CL}\) interval, under the background-only hypothesis

To aid reinterpretation of the search by reducing the model-dependence, limits evaluated using only the upper \(H_{\textrm{T}}\)category are presented in Appendix A. Tabulated results are provided in the HEPData record for this analysis [71].

9 Summary

This paper presents a search for pairs of light Higgs bosons (\({\textrm{H}} _1\)) produced in supersymmetric cascade decays. The targeted final states have small amounts of missing transverse momentum and two decays that are reconstructed as large-radius jets using substructure techniques. The search is based on data from \(\hbox {pp}\) collisions collected by the CMS experiment at \(\sqrt{s}=13\,\text {TeV} \) during 2016–2018, corresponding to an integrated luminosity of 138\(\,\text {fb}^{-1}\).

With no evidence found for an excess of events beyond the background expectations of the standard model (SM), the results are interpreted in the next-to-minimal supersymmetric extension of the SM (NMSSM), where a “singlino” of small mass leads to squark and gluino cascade decays that can predominantly end in a highly Lorentz-boosted singlet-like \({\textrm{H}} _1\) and a singlino-like neutralino of small transverse momentum.

Upper limits are set on the product of the production cross section and the square of the branching fraction of the \({\textrm{H}} _1\) for an NMSSM benchmark model with almost mass-degenerate gluinos and light-flavour squarks and branching fractions of unity for the cascade decays ending with the \({\textrm{H}} _1\). Under the assumption of an SM-like branching fraction, \({\textrm{H}} _1\) bosons with masses in the range 40–120\(\,\text {GeV}\) arising from the decays of squarks or gluinos with a mass of 1200 to 2500\(\,\text {GeV}\) are excluded at 95% confidence level.