1 Introduction

Astrophysical and cosmological observations [1,2,3] provide strong support for the existence of dark matter (DM), which could originate from physics beyond the standard model (BSM). In a large class of BSM models, DM consists of stable, weakly-interacting massive particles (WIMPs). In collider experiments, WIMPs (\(\chi \)) could be pair-produced through the exchange of new mediating fields that couple to DM and to standard model (SM) particles. Following their production, the WIMPs would escape detection, thereby creating an imbalance of transverse momentum (missing transverse momentum, \(p_{\mathrm {T}} ^\text{miss} \)) in the event.

If the new physics associated with DM respects the principle of minimal flavor violation [4, 5], the interactions of spin-0 mediators retain the Yukawa structure of the SM. This principle is motivated by the apparent lack of new flavor physics at the electroweak (EWK) scale. Because only the top quark has a Yukawa coupling of order unity, WIMP DM couples preferentially to the heavy top quark in models with minimal flavor violation. In high energy proton-proton collisions, this coupling leads to the production of \({t}\overline{{t}} +\chi \overline{\chi } \) at lowest-order via a scalar (\(\phi \)) or pseudoscalar (a) mediator (Fig. 1), and to the production of so-called mono-X final states through a top quark loop [6,7,8,9,10,11,12,13,14]. At the CERN Large Hadron Collider (LHC), the \({t}\overline{{t}} +\chi \overline{\chi } \) process can be probed directly via the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signatures. The \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signature provides additional sensitivity to the \({b} \overline{{b}} +\chi \overline{\chi }\) process for models in which mediator couplings to up-type quarks are suppressed, as can be the case in Type-II two Higgs doublet models [15].

This paper describes a search for DM produced with a \({t}\overline{{t}} \) or \({b} \overline{{b}} \) pair in pp collisions at \(\sqrt{s}=13\,\text{TeV} \) with the CMS experiment at the LHC. A potential DM signal is extracted from simultaneous fits to the \(p_{\mathrm {T}} ^\text{miss} \) distributions in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search channels. Data from control regions enriched in SM \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\) processes are included in the fits, to constrain the major backgrounds. The top quark nearly always decays to a W boson and a b quark. The W boson subsequently decays leptonically (to charged leptons and neutrinos) or hadronically (to quark pairs). The dileptonic, lepton(\(\ell \))+jets, and all-hadronic \({t}\overline{{t}} \) final states consist, respectively, of events in which both, either, or neither of the W bosons decay leptonically. Each of these primary \({t}\overline{{t}} \) final states are explored.

Previous LHC searches for DM produced with heavy-flavor quark pairs were interpreted using effective field theories that parameterize the DM-SM coupling in terms of an interaction scale \(M_{*}\) [16,17,18]. An earlier search by the CMS Collaboration investigated the \(\ell +\text{jets}\) \({t}\overline{{t}} \) final state using \({19.7}{\,\text{fb}^{-1}} \) of data collected at \(\sqrt{s} = 8\,\text{TeV} \) [19]. That search excluded values of \(M_{*}\) below 118\(\,\text{GeV}\), assuming \(m_{\chi } = 100\,\text{GeV} \). The ATLAS Collaboration performed a similar search separately for the all-hadronic and \(\ell +\text{jets}\) \({t}\overline{{t}} \) final states and obtained comparable limits on \(M_{*}\) [20]. More recently, the limitations of effective field theory interpretations of DM production at the LHC has led to the development of simplified models that remain valid when the mediating particle is produced on-shell [21]. This analysis adopts the simplified model framework to provide the first interpretation of heavy-flavor search results in terms of the decays of spin-0 mediators with scalar or pseudoscalar couplings. This paper also reports the first statistical combination of dileptonic (\(\text{ee} \), \(\text{e}\mu \), \(\mu \mu \)), \(\ell +\text{jets}\) (e, \(\mu \)), and all-hadronic \({t}\overline{{t}} +\chi \overline{\chi } \) searches, as well as the first combination of \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) search results.

Fig. 1
figure 1

A leading order Feynman diagram describing the production of a pair of DM particles (\(\chi \)) with heavy-flavor (top or bottom) quark pairs via scalar (\(\phi \)) or pseudoscalar (\(\mathrm {a}\)) mediators

The paper is organized as follows. Section 2 reviews the properties of the CMS detector and the particle reconstruction algorithms used in the analysis. Section 3 describes the modeling of \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) signal and SM background events, and Sect. 4 provides the selections applied to data and simulation. Section 5 discusses the techniques used to extract a potential DM signal in the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) search channels. Section 6 describes the systematic uncertainties considered in the analysis. The results of the search and their interpretation within a simplified DM framework are presented in Sect. 7. Section 8 concludes with a summary of the results.

2 CMS detector and event reconstruction

The CMS detector [22] is a multipurpose apparatus optimized for the study high transverse momentum (\(p_{\mathrm {T}} \)) physics processes in pp and heavy ion collisions. A superconducting solenoid surrounds the central region, providing a magnetic field of 3.8\(\text{\,T}\) parallel to the beam direction. Charged particle trajectories are measured using the silicon pixel and strip trackers, which cover the pseudorapidity region of \(|\eta |< 2.5\). A lead tungstate crystal electromagnetic calorimeter (ECAL) and a brass and scintillator hadron calorimeter (HCAL) surround the tracking volume, and cover the region with \(|\eta |< 3\). Each calorimeter is composed of a barrel and two endcap sections. A steel and quartz-fiber Cherenkov forward hadron calorimeter extends the coverage to \(|\eta |< 5\). The muon system consists of gas-ionization detectors embedded in the steel flux return yoke outside the solenoid, and covers the region of \(|\eta |< 2.4\). The first level of the CMS trigger system is composed of special hardware processors that select the most interesting events in less than 4\(\,\mu \text{s}\) using information from the calorimeters and muon detectors. This system reduces the event rate from 40\(\text{\,MHz}\) to approximately 100\(\text{\,kHz}\). The high-level trigger processor farm performs a coarse reconstruction of events selected by the first-level trigger, and applies additional selections to reduce the event rate to less than 1\(\text{\,kHz}\) for storage.

Event reconstruction is based on the CMS Particle Flow (PF) algorithm [23, 24], which combines information from all CMS subdetectors to identify and reconstruct the individual particles emerging from a collision: electrons, muons, photons, and charged and neutral hadrons. Interaction vertices are reconstructed using the deterministic annealing algorithm [25]. The primary vertex is selected as that with the largest sum of \(p_{\mathrm {T}} ^{2}\) of its associated charged particles. Events are required to have a primary vertex that is consistent with being in the luminous region.

Jets are reconstructed by clustering PF candidates using the anti-\(k_{\mathrm {T}}\) algorithm [26, 27] with a distance parameter of 0.4. Corrections based on jet area are applied to remove the energy from additional collisions in the same or neighboring bunch crossing (pileup) [28]. Energy scale calibrations determined from the comparison of simulation and data are then applied to correct the four momenta of the jets [29]. Jets are required to have \(p_{\mathrm {T}} > 30\,\text{GeV} \), \(|\eta |< 2.4\), and to satisfy a loose set of identification criteria designed to reject events arising from spurious detector and reconstruction effects.

The combined secondary vertex b tagging algorithm (CSVv2) is used to identify jets originating from the hadronization of bottom quarks [30, 31]. Jets are considered to be b-tagged if the CSVv2 discriminant for that jet passes a requirement that roughly corresponds to efficiencies of 70% to tag bottom quark jets, 20% to mistag charm quark jets, and 1% to misidentify light-flavor jets as b jets. Efficiency scale factors in the range of 0.92–0.98, varying with jet \(p_{\mathrm {T}}\), are applied to simulated events in order to reproduce the b tagging performance for bottom and charm quark jets observed in data. A scale factor of 1.14 is applied to simulation to reproduce the measured mistag rate for light-flavor quark and gluon jets.

The \(p_{\mathrm {T}} ^\text{miss} \) variable is initially calculated as the magnitude of the vector sum of the \(p_{\mathrm {T}}\) of all PF particles. This quantity is adjusted by applying jet energy scale corrections. Detector noise, inactive calorimeter cells, and cosmic rays can give rise to events with severely miscalculated \(p_{\mathrm {T}} ^\text{miss} \). Such events are removed via a set of quality filters that take into account the timing and distribution of signals from the calorimeters, missed tracker hits, and global characteristics of the event topology.

Electron candidates are reconstructed by combining tracking information with energy depositions in the ECAL [32]. The energy of the ECAL clusters is required to be compatible with the momentum of the associated electron track. Muon candidates are reconstructed by combining tracks from the inner silicon tracker and the outer muon system [33]. Tracks associated with muon candidates must be consistent with a muon originating from the primary vertex, and must satisfy a set of quality criteria [33]. Electrons and muons are selected with \(p_{\mathrm {T}} > 30\,\text{GeV} \) and \(|\eta |< 2.1\) for consistency with the coverage of the single-lepton triggers, and are required to be isolated from hadronic activity, to reject hadrons misidentified as leptons. Relative isolation is defined as the scalar \(p_{\mathrm {T}}\) sum of PF candidates within a \(\Delta {R} = \sqrt{\smash [b]{\eta ^{2} + \phi ^{2}}}\) cone of radius 0.4 or 0.3 centered on electrons or muons, respectively, divided by the lepton \(p_{\mathrm {T}} \). Relative isolation is nominally required to be less than 0.035 (0.065) for electrons in the barrel (endcap), respectively, and less than 0.15 for muons. Identification requirements, based on hit information in the tracker and muon systems, and on energy depositions in the calorimeters, are imposed to ensure that candidate leptons are well-measured. These restrictive isolation and identification criteria are used to select events from the dileptonic \({t}\overline{{t}} \), \(\ell +\text{jets}\) \({t}\overline{{t}} \), \(\text{W}(\ell \nu )+\text{jets}\), and \({Z} (\ell \ell )+\text{jets}\) processes.

The efficiencies of the requirements for electrons (muons) with \(p_{\mathrm {T}} > 30\,\text{GeV} \) range from 52 to 83% (91 to 96%), for increasing lepton \(p_{\mathrm {T}} \). Less restrictive lepton isolation and identification requirements are used to reject events containing additional leptons with \(p_{\mathrm {T}} > 10\,\text{GeV} \). Efficiencies for these requirements range from 66 to 96% for electrons and 73 to 99% for muons, for increasing lepton \(p_{\mathrm {T}} \). Electron and muon selection efficiency scale factors are applied in simulation to match the efficiencies measured in data using the tag-and-probe procedure [34]. Averaged over lepton \(p_{\mathrm {T}} \), the electron and muon efficiency scale factors for the more restrictive selection requirements are 98 and 99%, respectively.

The “resolved top tagger” (RTT) is a multivariate discriminant that uses jet properties and kinematics to identify top quarks that decay into three resolved jets. The input observables are the values of the quark/gluon discriminant [35], which combines track multiplicity, jet shape, and fragmentation information for each jet, values of the b tagging discriminants, and the opening angles between the candidate b jet and the two jets from the candidate W boson. Within each jet triplet, the b candidate is considered to be the jet with the largest value of the b tagging discriminant. The RTT discriminant also utilizes the \(\chi ^{2}\) value of a simultaneous kinematic fit to the top quark and W boson masses [36]. The fit attempts to satisfy the mass constraints by allowing the jet momenta and energies to vary within their measured resolutions. The RTT is implemented as a boosted decision tree using the TMVA framework [37], and is trained on simulated \(\ell +\text{jets}\) \({t}\overline{{t}} \) events using correct (incorrect) jet combinations as signal (background).

The performance of the RTT discriminant is characterized with data enriched in SM \(\ell +\text{jets}\) \({t}\overline{{t}} \) events containing four or more jets. At least one of these jets is required to be b-tagged. The output discriminant for these events is plotted in Fig. 2. Each entry in the plot corresponds to the jet triplet with the highest RTT score in the event. Data are modeled using simulated \(\ell +\text{jets}\) \({t}\overline{{t}} \) signal events, and simulated events for each of the primary backgrounds (dileptonic \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), single t). The simulation is split into three classes that correspond to correctly tagged jet triplets and the two possibilities for mistagging, as explained below. Simulation describes the data well. A jet triplet is considered as a tagged top quark decay when the RTT discriminant value is greater than zero.

Fig. 2
figure 2

The distribution of the RTT discriminant in data enriched in \(\ell +\text{jets}\) \({t}\overline{{t}} \) events. Simulated \(\ell +\text{jets}\) \({t}\overline{{t}} \) events in which jets from the all-hadronic top quark decay are correctly chosen are labeled “\({t}\overline{{t}} (1\ell )\) with matched jets”. Simulated \(\ell +\text{jets}\) \({t}\overline{{t}} \) events in which an incorrect combination of jets is chosen are labeled “\({t}\overline{{t}} (1\ell )\) combinatorial”. Events from processes that do not contain a hadronically-decaying top quark, such as dileptonic \({t}\overline{{t}} \), are labeled “other background”. The uncertainties shown in the ratios of data to simulation are statistical only. Jet triplets in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search are considered to be top quark tagged if their RTT discriminant value is larger than zero

There are three efficiencies associated with the RTT selection, which correspond to the three classes of events in Fig. 2: \(\ell +\text{jets}\) \({t}\overline{{t}} \) events in which the hadronically-decaying top quark is correctly identified (“\({t}\overline{{t}} (1\ell )\) matched”), \(\ell +\text{jets}\) \({t}\overline{{t}} \) events in which an incorrect combination of jets is tagged (“\({t}\overline{{t}} (1\ell )\) combinatorial”), and events with no hadronically-decaying top quarks that contain a mistagged jet triplet (“other background”). Dileptonic \({t}\overline{{t}} \) events are used to extract the nonhadronic mistag rate in data. Then, \(\ell +\text{jets}\) \({t}\overline{{t}} \) events are used to extract the tagging and mistagging efficiencies for hadronically-decaying top quarks through a fit to the trijet mass distribution. Mass templates obtained from simulation are associated with each efficiency term in the fit. The efficiency of the RTT \(>0\) selection for events determined to be \({t}\overline{{t}} (1\ell )\) matched, \({t}\overline{{t}} (1\ell )\) combinatorial, or other background are \(0.97 \pm 0.03\), \(0.80 \pm 0.05\), and \(0.69 \pm 0.02\), respectively. Corresponding data-to-simulation scale factors are found to be consistent with unity.

The \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) search includes vetoes on hadronically-decaying \(\tau \) leptons, which are reconstructed from PF candidates using the “hadron plus strips” algorithm [38]. The algorithm combines one or three charged pions with up to two neutral pions. Neutral pions are reconstructed by the PF algorithm from the photons that arise from \(\pi ^0\rightarrow \gamma \gamma \) decay. Photons are reconstructed from ECAL energy clusters, which are corrected to recover the energy deposited by photon conversions and bremsstrahlung. Photons are identified and distinguished from jets and electrons using cut-based criteria that include the isolation and transverse shape of the ECAL deposit, and the ratio of HCAL/ECAL energies in a region surrounding the candidate photon.

3 Modeling and simulation

The associated production of DM and heavy-flavor quark pairs provides rich detector signatures that include significant \(p_{\mathrm {T}} ^\text{miss} \) accompanied by high-\(p_{\mathrm {T}} \) jets, bottom quarks, and leptons. The largest backgrounds in the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) searches are SM \({t}\overline{{t}} \) events, inclusive W boson production in which the W decays leptonically (\(\text{W}(\ell \nu )+\text{jets}\)), and inclusive Z boson production in which the Z decays to neutrinos (\({Z} (\nu \bar{\nu })+\text{jets}\)). Simulated events are used throughout the analysis to determine signal and background expectations. Where possible, corrections determined from data are applied to the simulations.

Monte Carlo (MC) samples of SM \({t}\overline{{t}} \) and single t backgrounds are generated at next-to-leading order (NLO) in quantum chromodynamics (QCD) using Powhegv2 and Powhegv1 [39,40,41], respectively. As with all MC generators subsequently described, Powheg is interfaced with Pythia8.205 [42] for parton showering using the CUETP8M1 tune [43]. Samples of \({Z} +\text{jets}\), \(\text{W}+\text{jets}\), and QCD multijet events are produced at leading order (LO) using MG5_amc@nlo v2.2.2 [44] with the MLM prescription [45] for matching jets from the matrix element calculation to the parton shower description. The \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) samples are corrected using EWK and QCD NLO/LO K-factors calculated with MG5_amc@nlo as functions of the generated boson \(p_{\mathrm {T}} \). The simulation of \({t}\overline{{t}} +\gamma \), \({t}\overline{{t}} +\text{W}\), and \({t}\overline{{t}} +{Z} \) events makes use of NLO matrix element calculations implemented in MG5_amc@nlo , and the FxFx [46] prescription to merge multileg processes. Diboson processes (WW, WZ, and ZZ) are generated at NLO using either MG5_amc@nlo or Powhegv2.

The signal processes are simulated using simplified models that were developed in the LHC Dark Matter Forum (DMF) [21]. The DM particles \(\chi \) are assumed to be Dirac fermions, and the mediators are spin-0 particles with scalar (\(\phi \)) or pseudoscalar (\(\mathrm {a}\)) couplings. The coupling strength of the mediator to SM fermions is assumed to be \(g_{{q} {q}}= g_{{q}} y_{{q}}\) where: \(y_{{q}}=\sqrt{2}m_{{q}}/v\) is the SM Yukawa coupling, \(m_{{q}}\) is the quark mass, and \(v = 246\,\text{GeV} \) is the Higgs field vacuum expectation value. As per the recommendations of the LHC Dark Matter Working Group [47], \(g_{{q}} \) is taken to be flavor universal and equal to 1. Likewise, the coupling strength of the mediator to DM, \(g_{\chi }\), is set to 1 and is independent of the DM mass. The LHC DMF spin-0 models do not account for mixing between the \(\phi \) scalar and the SM Higgs boson [48]. As is discussed in [21], the \(p_{\mathrm {T}} ^\text{miss} \) spectra of both the scalar and pseudoscalar mediated processes broaden with increasing mediator mass. For \(m_{\phi /\mathrm {a}}\) larger than twice the top quark mass (\(m_{\text{top}}\)), the \(p_{\mathrm {T}} ^\text{miss} \) distributions of the scalar and pseudoscalar processes are essentially identical. As \(m_{\phi /\mathrm {a}}\) decreases below \(2m_{\text{top}} \), the \(p_{\mathrm {T}} ^\text{miss} \) spectra of the two processes increasingly differ, with the distribution of the scalar process peaking at lower \(p_{\mathrm {T}} ^\text{miss} \) values [49, 50]. For all mediator masses, the total cross section of the scalar process is larger than that of the pseudoscalar equivalent [50]. This analysis focuses on the \(m_{\chi } = 1\,\text{GeV} \) LHC DMF benchmark point, which provides a convenient signal reference for both low and high mass mediators.

The \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) signals are generated at LO in QCD using MG5_amc@nlo with up to one additional jet in the final state. Jets from the matrix element calculations are matched to the parton shower descriptions using the MLM prescription. Angular correlations in the decays of the top quarks are included using MadSpinv 2.2.2 [51]. Minimum decay widths are assumed for the mediators, and are calculated from the partial width formulas given in Ref. [52]. This calculation assumes that the spin-0 mediators couple only to SM quarks and the DM fermion \(\chi \). Simulated signal samples are produced for a DM mass of \(m_{\chi } = 1\,\text{GeV} \) and for mediator masses in the range of 10–500\(\,\text{GeV}\). The relative width of the scalar (pseudoscalar) mediator varies between 4 and 6% (4–8%) for this mediator mass range. The predicted rates of the \({b} \overline{{b}} +\chi \overline{\chi }\) process, which is generated in the 4-flavor scheme, are adjusted to match the cross sections calculated in the 5-flavor scheme [21, 53].

All samples generated at LO and NLO use corresponding NNPDF3.0 [54] parton distribution function (PDF) sets. All signal and background samples are processed using a detailed simulation of the CMS detector based on Geant4 [55]. The samples are reweighted to account for the distribution of pileup observed in data.

Table 1 Overview of the selection criteria used to define the eight \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. The signal region selections (including the definitions of the variables \(M_{\mathrm {T}} \) and \(M^{\text{W}}_{\mathrm {T2}} \)) are described in detail in Sect. 4.1. Vetoes are applied in the dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal region to remove overlaps with the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) control regions. These control regions are summarized in Table 2 and discussed in Sect. 4.2

4 Event selection

Signal events are expected to exhibit both large \(p_{\mathrm {T}} ^\text{miss} \) from the production of two noninteracting DM particles and event topologies consistent with the presence of top quarks or b quark jets. Data are therefore collected using triggers that select events containing large \(p_{\mathrm {T}} ^\text{miss} \) or high-\(p_{\mathrm {T}} \) leptons. Data for the dileptonic and \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) searches are obtained using single-lepton triggers that require an electron (muon) with \(p_{\mathrm {T}} \ge 27 ~ (20)\,\text{GeV} \). These trigger selections are more than \(90\%\) efficient for PF-reconstructed electrons and muons that satisfy the \(p_{\mathrm {T}} \), identification, and isolation requirements imposed. The trigger used for the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) searches selects events based on the amount of \(p_{\mathrm {T}} ^\text{miss} \) and \({ H_{\mathrm {T}}^{\text{miss}}} \) reconstructed using a coarse version of the PF algorithm. The \({ H_{\mathrm {T}}^{\text{miss}}} \) variable is defined as the magnitude of the vector sum of the \(p_{\mathrm {T}}\) of all jets in the event with \(p_{\mathrm {T}} > 20\,\text{GeV} \), \(|\eta |<5.0\). Jets reconstructed from detector noise are removed in the \({ H_{\mathrm {T}}^{\text{miss}}} \) calculation by additionally requiring neutral hadron energy fractions of less than 0.9. The \(p_{\mathrm {T}} ^\text{miss} \) and \({ H_{\mathrm {T}}^{\text{miss}}} \) requirements for this trigger are 120\(\,\text{GeV}\). The trigger is nearly 100% efficient for events that satisfy subsequent selections based on fully-reconstructed PF \(p_{\mathrm {T}} ^\text{miss} \).

Additional selections, described in Sect. 4.1 and summarized in Table 1, are applied to define eight independent regions of data that are sensitive to DM signals: two \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \), one \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \), three dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \), and two all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) regions. Control regions (CRs) enriched in various background processes are also defined and are used to improve background estimates in the aforementioned signal regions (SRs). In the CRs, individual signal selection requirements are inverted to enhance background yields and to prevent event overlaps with the SRs. Collectively, the SRs and CRs associated with the individual \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) production and decay modes are referred to as “channels”. The \({b} \overline{{b}} +\chi \overline{\chi }\) channel and the three \({t}\overline{{t}} +\chi \overline{\chi } \) channels are used in simultaneous \(p_{\mathrm {T}} ^\text{miss} \) fits (described in Sect. 5) to extract a potential DM signal. The fits allow the background-enriched CRs to constrain the contributions of SM \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\) processes within the CRs and SRs of each channel. The selections used to define the SRs and CRs are described in Sects. 4.1 and 4.2, respectively. Tables 1 and 2 briefly summarize these selections. Table 2 defines a CR labeling scheme that is extensively used in subsequent sections.

Table 2 Overview of the selection criteria used to define the background control regions associated with the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. The control region selections are described in detail in Sect. 4.2

4.1 Signal region selections

Dileptonic \({\varvec{{t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} }}\) Events in the dileptonic \({t}\overline{{t}} \) SR are required to contain exactly two leptons that satisfy stringent identification and isolation requirements. One of the leptons must have \(p_{\mathrm {T}} > 30\,\text{GeV} \), while the second must have \(p_{\mathrm {T}} > 10\,\text{GeV} \). Events containing additional, loosely identified leptons with \(p_{\mathrm {T}} > 10\,\text{GeV} \) are rejected. Events are also required to have \(p_{\mathrm {T}} ^\text{miss} > 50\,\text{GeV} \), and to contain two or more jets, at least one of which must satisfy b tagging requirements. Overlaps between the dileptonic SR and the dileptonic and \({Z} +\text{jets}\) CRs of the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channels (discussed in Sect. 4.2) are removed by vetoing events that satisfy the selections for those CRs. These vetoes remove 2.5% of the events from the dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SR. The azimuthal opening angle between the \(p_{\mathrm {T}} \) vector of the dilepton system and the \(p_{\mathrm {T}} ^\text{miss} \) vector, \(\Delta \phi ({\mathbf {p}}_{\mathrm {T}} ^{\ell \ell },{\mathbf {p}}_{\mathrm {T}}^{\text{miss}}) \), is required to be larger than 1.2 radians. This requirement preferentially selects events consistent with a \({t}\overline{{t}} \) system recoiling against the invisibly decaying DM mediator. The dilepton mass, \(m_{\ell \ell }\), is required to be larger than \(20\,\text{GeV} \). In dielectron and dimuon events, \(m_{\ell \ell }\) is also required to be at least \(15\,\text{GeV} \) away from the Z boson mass [56]. These requirements reduce backgrounds from low-mass dilepton resonances and from leptonic Z boson decays.

Events that satisfy these criteria are divided among three SR categories that correspond to the flavor assignments of the two selected leptons: \(\text{ee} \), \(\text{e}\mu \), and \(\mu \mu \). Signal efficiencies for the dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SR event selections range from \(6 \times 10^{-3}\) to \(10^{-2}\) for mediator masses between \(10\,\text{GeV} \) and \(500\,\text{GeV} \). The denominator used in the efficiency calculation is the total number of signal events, irrespective of the \({t}\overline{{t}} \) final state. The low efficiencies result primarily from the small dileptonic branching fraction.

\({\varvec{\ell }}+\mathbf jets {\varvec{{t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} }}\) Events in the \(\ell +\text{jets}\) \({t}\overline{{t}} \) SR are selected by requiring \(p_{\mathrm {T}} ^\text{miss} >160\,\text{GeV} \), exactly one lepton, and three or more jets, of which at least one must satisfy the b tagging criteria. The lepton is required to have \(p_{\mathrm {T}} > 30\,\text{GeV} \), and to pass tight identification criteria. Events must not contain additional leptons with \(p_{\mathrm {T}} > 10\,\text{GeV} \) that satisfy a looser set of identification requirements. To reduce SM \(\ell +\text{jets}\) \({t}\overline{{t}} \) and \(\text{W}+\text{jets}\) backgrounds, the transverse mass, calculated from \({\mathbf {p}}_{\mathrm {T}}^{\text{miss}} \) and the lepton momentum (\(\mathbf {p}_{\mathrm {T}}^{\ell }\)) as:

$$\begin{aligned} M_{\mathrm {T}} =\sqrt{2p_{\mathrm {T}} ^{\ell }p_{\mathrm {T}} ^\text{miss} (1-\cos \Delta \phi (\mathbf {p}_{\text{T}}^{\ell },{\mathbf {p}}_{\mathrm {T}}^{\text{miss}}))}, \end{aligned}$$
(1)

is required to be larger than \(160\,\text{GeV} \).

Following these selections, the remaining background events primarily consist of dileptonic \({t}\overline{{t}} \) final states in which one of the leptons is not identified. Because of the requirement of \(p_{\mathrm {T}} ^\text{miss} > 160\,\text{GeV} \), this background tends to contain events with Lorentz-boosted top quark decays in which the b jet is closely aligned with the direction of the neutrino. This background is suppressed by requiring that the smallest azimuthal angle formed from the missing transverse momentum vector and each of the two highest \(p_{\mathrm {T}} \) jets in the event, \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) where \(i=1,2\), be larger than 1.2 radians. In addition, the \(M^{\text{W}}_{\mathrm {T2}} \) variable [57] is required to be larger than \(200\,\text{GeV} \). This variable is defined as:

$$\begin{aligned} M_{\mathrm {T}2}^{\text{W}} = \min \left\{ m_{y} \text{ consistent with: } \left[ \begin{aligned}&\mathbf {p}_{1}^{T} + \mathbf {p}_{2}^{T} = {\mathbf {p}}_{\mathrm {T}}^{\text{miss}}, p_{1}^{2}=0, (p_{1} + p_{l})^{2} = p_{2}^{2} = M^{2}_{\text{W}}, \\&(p_{1}+p_{l}+p_{b1})^{2} = (p_{2} + p_{b2})^{2} = m^{2}_{y} \end{aligned} \right] \right\} \end{aligned}$$
(2)

where \(m_{y}\) is the mass of two parent particles that each decay to bW(\(\ell \nu \)). One of the W decays is assumed to produce a lepton that is not reconstructed. For the W decay that does produce a reconstructed lepton, the neutrino and lepton 4-momenta are denoted \(p_{1}\) and \(p_{\ell }\), respectively. The 4-momentum of the W that produces the unreconstructed lepton is denoted \(p_{2}\), while the momenta of the two b candidates are referred to as \(p_{b1}\) and \(p_{b2}\). Assuming perfect measurements, the \(M^{\text{W}}_{\mathrm {T2}} \) has a kinematic end-point at \(m_{\text{top}}\)   for \({t}\overline{{t}} \) events, whereas signal events lack this feature because both the neutrino and DM particles contribute to \(p_{\mathrm {T}} ^\text{miss} \).

The efficiency of the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SR event selections for the \({t}\overline{{t}} +\chi \overline{\chi } \) process range from \(10^{-4}\) for mediator masses of the order of \(10\,\text{GeV} \), to \(10^{-3}\) for masses of about 500\(\,\text{GeV}\). Signal efficiencies are low because of the stringent \(p_{\mathrm {T}} ^\text{miss} \) requirement applied. The efficiency improves with increasing mediator mass because of the broadening of the \(p_{\mathrm {T}} ^\text{miss} \) spectrum.

All-hadronic \({\varvec{{t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} }}\) Any event with a loosely identified lepton with \(p_{\mathrm {T}} > 10\,\text{GeV} \) is vetoed from the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs. The \(p_{\mathrm {T}} ^\text{miss} \) value must be larger than \(200\,\text{GeV} \), and four or more jets are required, at least one of which must satisfy b tagging criteria. Spurious \(p_{\mathrm {T}} ^\text{miss} \) can arise in multijet events due to jet energy mismeasurement. In such cases, the reconstructed \(p_{\mathrm {T}} ^\text{miss} \) tends to align with one of the jets. Multijet background is suppressed by requiring that \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) >0.4\) or 1 radian (depending on the number of RTT tags, as described below) for all jets in the event. The \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) selections also help to reduce \(\ell +\text{jets}\) \({t}\overline{{t}} \) background, for which the \(p_{\mathrm {T}} ^\text{miss} \) vector is typically aligned with a b jet.

Following these selection requirements, the dominant residual background is \(\ell +\text{jets}\) SM \({t}\overline{{t}} \) production. By contrast, selected signal typically includes events in which both top quarks decay hadronically. The resolved top quark tagger (RTT, introduced in Sect. 2) is employed to suppress the \(\ell +\text{jets}\) background by identifying potential hadronic top quark decays. The RTT is applied to the all-hadronic search region to define a category of events with two hadronic top quark decays. In this double-tag (2 RTT) category, one or more b-tagged jets are required and \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) >0.4\) radians is imposed for all jets in the event. The 2 RTT category implicitly requires at least six jets in the event. A second category is defined for events with 0 or 1 top quark tags (0, 1 RTT), four or more jets with at least two b-tagged jets, and a tighter requirement of \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) >1\) radian.

The selection efficiency for \({t}\overline{{t}} +\chi \overline{\chi } \) events in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs ranges from \(10^{-3}\) for mediator masses of the order of 10\(\,\text{GeV}\) to \(10^{-1}\) for masses near \(500\,\text{GeV} \). These values are larger than the corresponding efficiencies of the dileptonic and \(\ell +\text{jets}\) SR selections because of the larger branching fraction to the all-hadronic final state.

\({\varvec{{b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} }}\) Events with \(p_{\mathrm {T}} ^\text{miss} >200\,\text{GeV} \) are selected for the SRs of this final state. Events containing identified and isolated electrons or muons with \(p_{\mathrm {T}} \) larger than 10\(\,\text{GeV}\) or identified \(\tau \) leptons with \(p_{\mathrm {T}} > 18\,\text{GeV} \) are rejected. Multijet background is reduced by requiring \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) > 0.5\) radians for all jets in the event.

Following these selections, two exclusive event categories are defined using the number of jets and b-tagged jets in the event. The single b-tagged jet category provides high efficiency for \({b} \overline{{b}} +\chi \overline{\chi }\) signal and requires at most two jets. At least one of these jets must have \(p_{\mathrm {T}} > 50\,\text{GeV} \), and exactly one must satisfy b tagging requirements. The second category allows exactly two b-tagged jets. This SR selects \({b} \overline{{b}} +\chi \overline{\chi }\) signal and partially recovers \({t}\overline{{t}} +\chi \overline{\chi } \) events that are not selected in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) categories. At most three jets are allowed in the 2 b tag SR, and at least two of these jets must have \(p_{\mathrm {T}} > 50\,\text{GeV} \).

The efficiency of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SR event selections for the \({b} \overline{{b}} +\chi \overline{\chi }\) process range from \(10^{-6}\) for mediator masses of the order of 10\(\,\text{GeV}\), to \(10^{-2}\) for masses of 500\(\,\text{GeV}\). The selection efficiency for the \({t}\overline{{t}} +\chi \overline{\chi } \) process is found to be less dependent on the mediator mass, and varies from \(10^{-4}\) to \(10^{-3}\) for the same mass range.

4.2 Background control region selections

Figure 3 shows the simulated background yields in each of the SRs following the selections of Sect. 4.1. Clearly, the dominant backgrounds in the SRs are from the SM \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\) processes. The estimation of backgrounds in the SRs is improved through the use of corresponding data CRs enriched in these processes. Independent CRs are defined for each of the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \), all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs. In some cases, multiple CRs are used to constrain a given background process in a SR. In this section we describe the main \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\) backgrounds and the selections used to define the CRs. The CR selections are designed to ensure that these regions are both mutually exclusive and exclusive of the SRs as well. The contributions of multijet, diboson, single t, and \({t}\overline{{t}} +{{Z}/\text{W}/}\gamma \) processes in the SRs are either subdominant or insignificant after the SR selections. The residual backgrounds from these processes are modeled with simulation. Dilepton background events from Drell–Yan and processes in which jets are misidentified as leptons are estimated using the sideband techniques described in Ref. [58].

Fig. 3
figure 3

Simulation-derived background expectations in the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions

The remainder of this section describes how the contributions of SM backgrounds in the SRs are estimated using the CRs. The discussion utilizes the CR labeling convention defined in Table 2, for ease of reference. The CRs for the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SR are denoted slA and slB, those for the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs are hadA–hadG, and those for the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs are bbA–bbJ.

Section 5 describes how the CRs are simultaneously fit with the SRs to constrain the predicted normalization of the \({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\) background processes. Figures 6, 7 and 8 compare the integrated yields in each CR before and after background-only fits to the CR \(p_{\mathrm {T}} ^\text{miss} \) distributions. Reasonable agreement is found between the observed and predicted CR yields. In general, the expected and observed \(p_{\mathrm {T}} ^\text{miss} \) distributions in the CRs also agree. Regions for which the distributions of data and of the initial (“prefit”) MC disagree are noted in the text.

Dileptonic \({\varvec{{t}\overline{{t}}}}\) Dileptonic \({t}\overline{{t}} \) background in the \(\ell +\text{jets}\) \({t}\overline{{t}} \) SR consists of events in which only one of the leptons is identified. A dileptonic CR (slA) for the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search region is defined by requiring an additional lepton with respect to the \(\ell +\text{jets}\) selection, and by removing the selections on \(M_{\mathrm {T}} \), \(M^{\text{W}}_{\mathrm {T2}} \), and \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \). Both leptons from dileptonic \({t}\overline{{t}} \) decays in the \(\ell +\text{jets}\) SR are typically within the detector acceptance. The lepton momenta are therefore included in the \(p_{\mathrm {T}} \) vector sum for this CR, so as to simulate the \(p_{\mathrm {T}} ^\text{miss} \) distribution expected for the dileptonic \({t}\overline{{t}} \) background in the \(\ell +\text{jets}\) SR. Mutual exclusion with the dileptonic \({t}\overline{{t}} \) and \({Z} +\text{jets}\) CRs of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) search region (described below) is ensured by vetoing events that additionally satisfy the selection requirements of those CRs.

The \({t}\overline{{t}} \) background in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs consists of dileptonic and \(\ell +\text{jets}\) \({t}\overline{{t}} \) events in which no leptons are identified. Dileptonic \({t}\overline{{t}} \) CRs (bbE, bbJ) are formed for the 1 b tag and 2 b tag \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs by requiring two opposite-charge, different-flavor leptons with \(p_{\mathrm {T}} >30 \,\text{GeV} \). Tight (loose) identification and isolation criteria are imposed on the leading \(p_{\mathrm {T}} \) (subleading \(p_{\mathrm {T}} \)) lepton. In contrast to the dileptonic background in the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SR, the leptons from \({t}\overline{{t}} \) in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs typically fall outside of the detector acceptance. The momentum of the selected leptons in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) CRs is therefore subtracted from the \(p_{\mathrm {T}} ^\text{miss} \) observable in order to mimic the \(p_{\mathrm {T}} ^\text{miss} \) distribution in the SR. The SR requirements on \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \), which primarily remove multijet background, are not imposed. All other selections from the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs are applied.

Dileptonic \({t}\overline{{t}} \) production is the dominant SM background in the dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs. Corresponding CRs are not employed for this search channel because dileptonic \({t}\overline{{t}} \) events are found to be well-modeled by simulation and are selected with high efficiency in the dileptonic SR.

\({\varvec{\ell }}+\mathbf jets {\varvec{{t}\overline{{t}}}}\) The most significant source of background in the hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs is \(\ell +\text{jets}\) \({t}\overline{{t}} \) production. This process contributes to the hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search when the lepton is not identified. Control regions for \(\ell +\text{jets}\) \({t}\overline{{t}} \) (hadA, hadE) are defined by selecting events with exactly one identified lepton with \(p_{\mathrm {T}} >30 \,\text{GeV} \), and by requiring \(M_{\mathrm {T}} <160 \,\text{GeV} \) in order to avoid overlaps with the SR of the \(\ell +\text{jets}\) channel. All other requirements used to define the hadronic SRs are applied, and the CR is split into 0,1 RTT and 2 RTT categories.

The dileptonic \({t}\overline{{t}} \) CRs for the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) search (described above) provide stringent constraints on \({t}\overline{{t}} \) backgrounds in the corresponding SRs. Additional constraints on \({t}\overline{{t}} \) background in this channel are provided through four single-lepton CRs (bbA, bbB, bbF, and bbG). A single-electron (muon) CR for the 1 b tag SR requires exactly one electron (muon) with \(p_{\mathrm {T}} >30 \,\text{GeV} \). The lepton must satisfy tight isolation and identification criteria. The \(M_{\mathrm {T}} \) observable calculated from the lepton momenta and \(p_{\mathrm {T}} ^\text{miss} \) must satisfy \(50< M_{\mathrm {T}} < 160 \,\text{GeV} \). Except for the requirement on \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \), each of the selection criteria for the 1 b tag signal category must also be satisfied. Analogous CRs for the 2 b tag signal category are formed by applying the corresponding signal selection criteria. As in the dileptonic \({t}\overline{{t}} \) CRs for the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) searches, the lepton is removed from the \(p_{\mathrm {T}} ^\text{miss} \) calculation.

\({\varvec{{\bf W}+{\bf jets}}}\) A \(\text{W}+\text{jets}\) CR for the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search (slB) is created by requiring zero b tags. The \(M_{\mathrm {T}} >160 \,\text{GeV} \) requirement from the \(\ell +\text{jets}\) signal selection is maintained, however, the cuts on \(M^{\text{W}}_{\mathrm {T2}} \) and \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) are removed.

Control regions enriched in both \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) (hadB, hadF) are formed for the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) categories by modifying the SR selections to require zero b tags. In addition, dedicated \(\text{W}+\text{jets}\) CRs (hadC, hadG) are defined by requiring the presence of an isolated, identified lepton with \(p_{\mathrm {T}} > 30 \,\text{GeV} \) and \(M_{\mathrm {T}} < 160 \,\text{GeV} \). The \(\text{W/Z}+\text{jets}\) and \(\text{W}+\text{jets}\) CRs are both categorized using the number of RTTs, as in the corresponding SRs. The prefit yields and \(p_{\mathrm {T}} ^\text{miss} \) distributions in the hadB and hadC regions are observed to differ from those of data. The discrepancy is due to a mismodeling of hadronic activity in the simulation, which leads to an overestimation of the selection efficiency for the Z+jets and W+jets processes. Reasonable agreement is achieved through the fit, as is shown in Figs 4. and 7.

Fig. 4
figure 4

Observed data, and prefit and fitted background-only \(p_{\mathrm {T}} ^\text{miss}\)  distributions in two control regions (hadB and hadC in Table 2) for the 0,1 RTT hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal region with 0 leptons (upper) and with 1 lepton (lower) and 0 b tags. The 0 lepton control region is used to constrain \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) backgrounds. The 1 lepton CR provides an additional constraint on \(\text{W}+\text{jets}\) background. The last bin contains overflow events. The lower panels show the ratios of observed data to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties are indicated as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms

The \(\text{W}+\text{jets}\) process contributes the second-largest background in the 1 b tag SR of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel. This background is constrained via the single-lepton CRs (bbA, bbB, bbF, bbG) of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel, which were introduced previously in the context of constraints on \(\ell +\text{jets}\) \({t}\overline{{t}} \) backgrounds.

\({\varvec{{\bf Z} +{\bf jets}}}\) The \({Z} (\nu \bar{\nu })+\text{jets}\) process is a significant source of background in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs. This background is partially controlled via the \(\text{W/Z}+\text{jets}\) CRs (hadB, hadF) described previously. An additional constraint is derived from a distinct \({Z} (\ell \ell )+\text{jets}\) CR (hadD), in which two oppositely-charged, same-flavor leptons are required to pass tight isolation and identification requirements. The mass of the lepton pair must fall between 60 and 120 \(\,\text{GeV}\). A prediction for the \(p_{\mathrm {T}} ^\text{miss} \) distribution in the hadronic SRs is obtained by subtracting the lepton momenta in the \(p_{\mathrm {T}} ^\text{miss} \) calculation. The \({Z} (\ell \ell )+\text{jets}\) CR is not categorized in the number of RTTs because of the negligible yields obtained with two RTT tags. The selections for jets and \(p_{\mathrm {T}} ^\text{miss} \) used in the 0,1 RTT SR are applied in the \({Z} (\ell \ell )+\text{jets}\) CR, with those on \(p_{\mathrm {T}} ^\text{miss} \) applied to lepton-subtracted \(p_{\mathrm {T}} ^\text{miss} \). The requirements on \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) and b tags are removed to increase \({Z} +\text{jets}\) yields. Figure 5 demonstrates that the lepton-subtracted \(p_{\mathrm {T}} ^\text{miss} \) distribution observed in the \({Z} (\ell \ell )+\text{jets}\) CR of the all-hadronic channel is not well described by the prefit expectation. Agreement substantially improves following the fit.

Fig. 5
figure 5

Observed data, and prefit and fitted background-only, lepton-subtracted \(p_{\mathrm {T}} ^\text{miss}\)  distributions in the dileptonic control region (hadD in Table 2) for the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. This control region is used to constrain \({Z} (\nu \bar{\nu })+\text{jets}\) background. The selections for jets and \(p_{\mathrm {T}} ^\text{miss} \) used in the 0,1 RTT signal region are applied, with those on \(p_{\mathrm {T}} ^\text{miss} \) applied to lepton-subtracted \(p_{\mathrm {T}} ^\text{miss} \). The signal region requirements on \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) and b tags are removed to increase \({Z} +\text{jets}\) yields. The last bin contains overflow events. The lower panel shows the ratios of observed data to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties are indicated as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms

The \({Z} (\nu \bar{\nu })+\text{jets}\) process is also a significant background in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs. This background is constrained with four distinct CRs: bbC, bbD, bbH, and bbI. The \({Z} (\text{ee})\) and \({Z} (\mu \mu )\) CRs require two electrons and two muons with \(p_{\mathrm {T}} >30\,\text{GeV} \), respectively. The isolation and identification criteria applied on the leading-\(p_{\mathrm {T}} \) lepton are identical to those used in the \(\text{W}+\text{jets}\) CRs for the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel. The subleading lepton is required to satisfy a looser set of isolation and identification criteria, as in the dileptonic CRs. The leptons must be consistent with the decay of a Z boson; opposite-charge, same-flavor requirements are imposed, and the leptons must satisfy a constraint on the dilepton mass of \(70< m_{\ell \ell }< 110\,\text{GeV} \). As in the \(\text{W}+\text{jets}\) and dileptonic \({t}\overline{{t}} \) CRs, events must also satisfy all but the \(\min \Delta \phi (\overrightarrow{p} _{{\text{T}}}^{{{\text{jet}}_{\text{i}} }}, \overrightarrow{p} _{{\text{T}}}^{{{\text{miss}}}} ) \) selection criteria of the corresponding 1 b tag or 2 b tag signal category. As in the \({Z} +\text{jets}\) CR for all-hadronic \({t}\overline{{t}} \) channel, lepton momenta are subtracted in the \(p_{\mathrm {T}} ^\text{miss} \) calculation to approximate the distribution of \(p_{\mathrm {T}} ^\text{miss} \) from \({Z} (\nu \bar{\nu })+\text{jets}\) expected in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs.

Fig. 6
figure 6

Observed data, and prefit and fitted background-only event yields in the control regions associated with the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal region. The 2 lepton, \(\ge \) 0 b tag region (slA in Table 2) is used to constrain the dileptonic \({t}\overline{{t}} \) background in the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal region, while the 1 lepton, 0 b tag control region (slB) constrains \(\text{W}+\text{jets}\) background. The lower panel shows the ratios of observed to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms

Fig. 7
figure 7

Observed data, and prefit and fitted background-only event yields in the control regions associated with the 0,1 RTT (upper) and 2 RTT (lower) all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. The 1 lepton, \(\ge 2\) b tag control region (hadA in Table 2) constrains \(\ell +\text{jets}\) \({t}\overline{{t}} \) background in the 0,1 RTT signal region. This process is constrained in the 2 RTT signal region using the 1 lepton, \(\ge \) 1 b tag control region (hadE). The \(\le \)1 lepton, 0 b tag control regions (hadB, hadC, hadF, hadG) constrain \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) backgrounds, while the 2 lepton, 0 b tag control region (hadD) provides an additional constraint on the \({Z} +\text{jets}\) background. The lower panels show the ratios of observed to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms

Fig. 8
figure 8

Observed data, and prefit and fitted background-only event yields in the control regions associated with the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal region with 1 b tag (upper) and with 2 b tags (lower). The 1 lepton, \(\ge \) 1 b control regions (bbA, bbB, bbF and bbG in Table 2) are used to constrain \(\text{W}+\text{jets}\) and \({t}\overline{{t}} \) backgrounds in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. The dileptonic control regions (bbC-bbE, bbH-bbJ) are used to constrain \({Z} +\text{jets}\) and \({t}\overline{{t}} \) backgrounds. The lower panels show the ratio of observed to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms

5 Signal extraction

A potential DM signal could be revealed as an excess of events relative to SM expectations in a region of high \(p_{\mathrm {T}} ^\text{miss} \). The shape of the observed \(p_{\mathrm {T}} ^\text{miss}\)  distribution provides additional information that is used in this analysis to improve the sensitivity of the search. A potential signal is searched for via simultaneous template fits to the \(p_{\mathrm {T}} ^\text{miss}\)  distributions in the SRs and the associated CRs defined in Sects. 4.1 and 4.2. Signal and background \(p_{\mathrm {T}} ^\text{miss} \) templates are derived from simulation and are parameterized to allow for constrained shape and normalization variations in the fits.

The fits are performed using the RooStats statistical software package [59]. The effects of uncertainties in the normalizations and in the \(p_{\mathrm {T}} ^\text{miss} \) shapes of signal and background processes are represented as nuisance parameters. Uncertainties that only affect normalization are modeled using nuisance parameters with log-normal probability densities. Uncertainties that affect the shape of the \(p_{\mathrm {T}} ^\text{miss} \) distribution, which may also include an overall normalization effect, are incorporated using a template “morphing” technique. These treatments, as well as the approach used to account for MC statistical uncertainties on template predictions, follow the procedures described in Ref. [60].

Within each search channel, additional unconstrained nuisance parameters scale the normalization of each dominant background process (\({t}\overline{{t}} \), \(\text{W}+\text{jets}\), and \({Z} +\text{jets}\)) across the SRs and CRs. For example, a single parameter is associated with the contribution of the \(\ell +\text{jets}\) \({t}\overline{{t}} \) process in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs and CRs. A separate parameter is associated with the \(\ell +\text{jets}\) \({t}\overline{{t}} \) background in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) SRs and CRs. These nuisance parameters allow the data in the background-enriched CRs to constrain the background estimates in the SRs to which they correspond. Because separate nuisance parameters are used for each search channel, a given normalization parameter cannot affect background predictions in unassociated search channels. The yields and \(p_{\mathrm {T}} ^\text{miss} \) shapes of subdominant backgrounds vary in the fit only through the constrained nuisance parameters. Signal yields in the SRs and associated CRs are scaled simultaneously by signal strength parameters (\(\mu \)), defined as the ratio of the signal cross section to the theoretical cross section, \(\mu =\sigma /\sigma _\text{TH}\). The \(\mu \) parameters scale signal normalization coherently across regions, and thus account for signal contamination in the CRs.

Signal extraction is performed for the individual search channels as well as for their combination. The separate fits to the individual signal and associated CRs provide independent estimates of \({b} \overline{{b}} +\chi \overline{\chi }\) and \({t}\overline{{t}} +\chi \overline{\chi } \) contributions in each channel. In this fitting scenario, separate signal strength parameters are used for each of the search channels. The \({b} \overline{{b}} +\chi \overline{\chi }\) process is considered as a potential signal in the 1 b tag and 2 b tag regions of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel. The \({t}\overline{{t}} +\chi \overline{\chi } \) process is searched for in all SRs of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channels separately. The contribution of the \({b} \overline{{b}} +\chi \overline{\chi }\) process in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channel is negligible due to the jet multiplicity requirement. An inclusive fit to all signal and CRs is also performed. This fit uses a single signal strength parameter to extract the combined contribution of \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) in data. Additional details on the per-channel and combined fits are provided in Sect. 7.

6 Systematic uncertainties

Table 3 summarizes the uncertainties considered in the signal extraction fits. The procedures used to evaluate the uncertainties are described later in this section. Normalization uncertainties are expressed relative to the predicted central values of the corresponding nuisance parameters. These uncertainties are used to specify the widths of the associated log-normal probability densities. The integrated luminosity, b tagging efficiency, \(p_{\mathrm {T}} ^\text{miss} \) trigger efficiency, pileup, and multijet/single t background normalization uncertainties are taken to be fully correlated across SRs and CRs. Shape uncertainties are expressed in Table 3 as the change in the prefit yields of the lowest and highest \(p_{\mathrm {T}} ^\text{miss} \) bins resulting from a variation of the corresponding nuisance by ± 1 standard deviation (s.d.). These uncertainties are propagated to the fit by using the full \(p_{\mathrm {T}} ^\text{miss} \) spectra obtained from ±1 s.d. variations of the corresponding nuisance parameters [60]. The PDF and jet energy scale shape uncertainties are taken to be fully correlated across SRs and CRs. In general, the uncertainty estimation is performed in the same way for signal and background processes; however, the uncertainty from missing higher-order corrections for signal processes, which is approximately 30% at LO in QCD, is not considered to facilitate a comparison with other CMS DM results.

Table 3 Summary of systematic uncertainties in the signal regions of each search channel. The values given for uncertainties that are not process specific correspond to the dominant background in each signal region (i.e. \({Z} +\text{jets}\) in the 1 b tag \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) region, and \({t}\overline{{t}} \) in all others). The systematic uncertainties are categorized as affecting either the normalization or the shape of the \(p_{\mathrm {T}} ^\text{miss} \) distribution. For shape uncertainties, the ranges quoted give the uncertainty in the yield for the lowest \(p_{\mathrm {T}} ^\text{miss} \) bin and for the highest \(p_{\mathrm {T}} ^\text{miss} \) bin. Sources of systematic uncertainties that are common across channels are considered to be fully correlated in the channel combination fit
Table 4 Fitted background yields for a background-only hypothesis in the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions. The yields are obtained from separate fits to the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and individual \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search channels. Prefit yields for DM produced via a pseudoscalar mediator with mass \(m_{\mathrm {a}}=50 \,\text{GeV} \) and a scalar mediator with mass \(m_{\phi }=100 \,\text{GeV} \) are also shown. Mediator couplings are set to \(g_{{q}} =g_{\chi }=1\), and a DM particle of mass \(m_{\chi }=1 \,\text{GeV} \) is assumed. Uncertainties include both statistical and systematic components
Fig. 9
figure 9

The \(p_{\mathrm {T}} ^\text{miss} \) distributions in the following signal regions: dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) in the \(\text{ee} \) signal region (upper left), in the \(\mu \mu \) region (upper right), in the \(\text{e}\mu \) region (lower left), and in \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) region (lower right). The \(p_{\mathrm {T}} ^\text{miss} \) distributions of background correspond to background-only fits to the individual \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) signal regions and associated background control regions. The prefit \(p_{\mathrm {T}} ^\text{miss} \) distribution of an example signal (pseudoscalar mediator, \(m_{\mathrm {a}} = 300 \,\text{GeV} \) and \(m_{\chi } = 1 \,\text{GeV} \)) is scaled up by a factor of 20. The last bin contains overflow events. The lower panels of each plot show the ratio of observed data to fitted background. The uncertainty bands shown in these panels are the fitted values, and the magenta lines correspond to the ratio of prefit to fitted background expectations

Fig. 10
figure 10

The \(p_{\mathrm {T}} ^\text{miss} \) distributions in the following signal regions: all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) with 0 or 1 RTTs (upper left), all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) with 2 RTTs (upper right), \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) with 1 b tag (lower left), and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) with 2 b tags (lower right). The \(p_{\mathrm {T}} ^\text{miss} \) distributions of background correspond to background-only fits to the individual \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal regions and associated background control regions. The prefit \(p_{\mathrm {T}} ^\text{miss} \) distribution of an example signal (pseudoscalar mediator, \(m_{\mathrm {a}} = 300 \,\text{GeV} \) and \(m_{\chi } = 1\,\text{GeV} \)) is scaled up by a factor of 20. The last bin contains overflow events. The lower panels of each plot show the ratio of observed data to fitted background. The uncertainty bands shown in these panels are the fitted values, and the magenta lines correspond to the ratio of prefit to fitted background expectations

The following sources of uncertainty correspond to constrained normalization nuisance parameters in the fit:

  • Integrated luminosity An uncertainty of 2.7% is used for the integrated luminosity of the data sample [61].

  • Pileup modeling Systematic uncertainties due to pileup modeling are taken into account by varying the total inelastic cross section used to calculate the data pileup distributions by ± 5%. Normalization differences in the range of 0.2–1.4% result from reweighting the simulation accordingly.

  • W/Z + heavy-flavor fraction The uncertainty in the fraction of W/Z + heavy-flavor jets is assigned to account for the usage of CRs dominated by light-flavor jets in constraining the prediction of \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) in SRs that require b tags. The flavor fractions for the \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) processes are allowed to vary independently within 20% [62,63,64,65].

  • Drell–Yan background: The uncertainties in the data-driven Drell–Yan background estimates for the dileptonic channels are 64% (\(\text{ee} \)) and 43% (\(\mu \mu \)). These uncertainties are dominated by the statistical uncertainties in quantities used to extrapolate yields from a region near the Z boson mass to regions away from it. Again, these relatively large uncertainties have little effect on the sensitivity of the search.

  • Multijet background normalization Uncertainties of 50–100% (depending on the SR) are applied in the normalization of multijet backgrounds to cover tail effects that are not well modeled by the simulation.

  • Misidentified-lepton background The sources of uncertainty in the misidentified-lepton background for the dileptonic search stem from the uncertainty in the measured misidentification rate, and from the statistical uncertainty of the single-lepton control sample to which the rate is applied. The uncertainties per channel are 200% (\(\text{ee} \)), 48% (\(\text{e}\mu \)), 30% (\(\mu \mu \)), and are dominated by the statistical uncertainty associated with the single-lepton control sample. Because the misidentified lepton background is small, these relatively large uncertainties do not significantly degrade the sensitivity of the search.

  • RTT efficiency Jet energy scale and resolution uncertainties are propagated to the RTT efficiency scale factors by using modified shape templates in the efficiency extraction fit. A systematic uncertainty due to the choice of parton showering scheme is estimated by comparing the efficiencies obtained with default and alternative \(p_{\mathrm {T}} ^\text{miss} \) templates. The default simulation is showered using Pythia8.205, which implements dipole-based parton showering. The alternative templates are derived from simulated events that are showered with Herwig [66], which uses an angular-ordered shower model. Overall, statistical plus systematic uncertainties of 6, 3, and 3% are assigned for the hadronic tag, hadronic mistag, and nonhadronic mistag scale factors, respectively. These correspond to an overall normalization uncertainty for the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) SRs of 4%.

  • b tagging efficiency The b tagging efficiency and its uncertainty are measured using independent control samples. Uncertainties from gluon splitting, the b quark fragmentation function, and the selections used to define the control samples are propagated to the efficiency scale factors [31]. The corresponding normalization uncertainty ranges from 2.2 to 12%.

  • Lepton identification and trigger efficiency: The uncertainty in lepton identification and triggering efficiency is measured with samples of Z bosons decaying to dielectrons and dimuons [34]. The corresponding normalization uncertainty ranges from 2 to 4%.

  • \(p_{\mathrm {T}} ^\text{miss} \) trigger Uncertainties of 0.3–2% (depending on the SR) are associated with the efficiency scale factors of the \(p_{\mathrm {T}} ^\text{miss} \) trigger. The efficiency of this trigger is measured using data collected with the single-lepton triggers. For values of \(p_{\mathrm {T}} ^\text{miss} >200\,\text{GeV} \), these data primarily consist of \(\text{W}+\text{jets}\) events.

Table 5 Observed and expected 95% CL upper limits on the ratios (\(\mu \)) of the observed \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) cross sections to the simplified model expectations. The limits correspond to separate fits to the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and individual \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search channels. DM mediators with scalar couplings of \(g_{{q}} =g_{\chi }=1\) are assumed
Table 6 Same as Table 5, but for DM mediators with pseudoscalar couplings. Again, mediator couplings correspond to \(g_{{q}} =g_{\chi }=1\)

The following sources of uncertainty correspond to constrained \(p_{\mathrm {T}} ^\text{miss}\)  shape nuisance parameters in the fit:

  • PDF uncertainties Uncertainties due to the choice of PDFs are estimated by reweighting the samples with the ensemble of PDF replicas provided by NNPDF3.0 [67]. The standard deviation of the reweighted \(p_{\mathrm {T}} ^\text{miss} \) shapes is used as an estimate of the uncertainty.

  • Jet energy scale Reconstructed jet four-momenta in the simulation are simultaneously varied according to the uncertainty in the jet energy scale [29]. Jet energy scale uncertainties are coherently propagated to all observables including \(p_{\mathrm {T}} ^\text{miss} \).

  • Top quark \(p_{\mathrm {T}} \) reweighting Differential measurements of top quark pair production show that the measured \(p_{\mathrm {T}} \) spectrum of top quarks is softer than that of simulation. Scale factors to cover this effect have been derived in previous CMS measurements [68] and are applied to all simulated SM \({t}\overline{{t}} \) samples by default. The uncertainty in the top quark \(p_{\mathrm {T}} \) spectrum is estimated from a comparison with the spectrum obtained without reweighting.

  • Higher-order QCD corrections The uncertainties due to missing higher-order QCD corrections in the LO samples are estimated by generating alternative event samples in which the factorization and renormalization scale parameters (\(\mu _{\text{F}} {},\mu _{\text{R}} \)) are simultaneously increased or decreased by a factor of two. These uncertainties are correlated across the bins of the \(p_{\mathrm {T}} ^\text{miss} \) distribution. Uncertainties in the NLO K-factors applied to \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) simulation are determined by recalculating the K-factor with \(\mu _{\text{F}} \) and \(\mu _{\text{R}} \) independently varied by a factor of two up or down.

  • EWK corrections Uncertainties in the K-factors applied to \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) simulation from missing higher-order EWK corrections are estimated by taking the difference in results obtained with and without the EWK correction applied.

  • Simulation statistics: Shape uncertainties due to the limited sizes of the simulated signal and background samples are included via the method of Barlow and Beeston [60, 69]. This approach allows each bin of the \(p_{\mathrm {T}} ^\text{miss} \) distributions to independently fluctuate according to Poisson statistics.

7 Results and interpretation

Separate signal strength parameters are first determined from fits to each of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channels. These fits use the predicted cross sections and \(p_{\mathrm {T}} ^\text{miss} \) shapes from the LHC DMF signal models with \(g_{{q}} = g_{\chi } = 1\). The fits result in independent upper limits on signal yields for the \({b} \overline{{b}} +\chi \overline{\chi }\) and \({t}\overline{{t}} +\chi \overline{\chi } \) processes, which are reported in Sect. 7.1.

Next, all SRs and CRs are simultaneously fit under the hypothesis of combined \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) contributions. In this case, a single signal strength parameter is used, which results in a combined best fit estimate of the \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) signal yields. Again, cross section predictions for \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) assume \(g_{{q}} =g_{\chi }=1\). Results from this fit are reported in Sect. 7.2.

The most interesting DM scenarios to explore at the LHC involve on-shell mediator decays to \(\chi \overline{\chi } \), which corresponds to \(m_{\phi /\mathrm {a}} > 2m_{\chi }\). Kinematic variables and cross sections are independent of \(m_{\chi }\) in this regime [21]. The \(m_{\chi } < 10\,\text{GeV} \) region is of particular interest because of the strong phenomenological and theoretical motivations for low-mass DM [70] and the relative strength of collider experiments in this mass range [71]. For these reasons, the DM mass has been fixed to \(m_{\chi }=1\,\text{GeV} \) in all signal extraction fits. The results obtained with \(m_{\chi }=1\,\text{GeV} \) are valid for other values of \(m_{\chi } < m_{\phi /\mathrm {a}}/2\) provided they are not too near the kinematic threshold.

7.1 Individual search results

Table 4 provides the background yields in the SRs obtained from background-only fits to the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and individual \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search channels. Relative nuisance parameter shifts – defined as \((\text{p}_{\text{fit}} - \text{p}_{\text{prefit}}) / \sigma _{\text{p}}\), where \(\text{p}\) represents the parameter value and \(\sigma _{\text{p}}\) its fit uncertainty – do not indicate any particular tension in these fits. The largest shifts correspond to the nuisance parameters for the EWK correction for the \(\text{W}+\text{jets}\) and \({Z} +\text{jets}\) processes in the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel (\(+ 0.8\)), to the \(\mu _F\,,\mu _R\) scale uncertainty in the \({t}\overline{{t}} \) process in the \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channel (+0.6), and to the lepton efficiency in the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channel (\(-1.9\)). The nuisance parameter shifts account for residual mismodeling of the yields by the simulation in the background-enriched regions. The background-only fitted \(p_{\mathrm {T}} ^\text{miss} \) distributions in the eight SRs are shown in Figs. 9 and 10.

Fig. 11
figure 11

The ratio (\(\mu \)) of 95% CL upper limits on the \({b} \overline{{b}} +\chi \overline{\chi }\) and \({t}\overline{{t}} +\chi \overline{\chi } \) cross sections to simplified model expectations. The limits are obtained from fits to the individual \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) search channels for the hypothesis of a scalar mediator (upper) or a pseudoscalar mediator (lower). A fermionic DM particle with a mass of \(1\,\text{GeV} \) is assumed in both panels. Mediator couplings correspond to \(g_{{q}} =g_{\chi }=1\)

Table 7 Observed and expected 95% CL upper limits on the ratio (\(\mu \)) of the combined \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) cross sections to the simplified model expectation. The limits are obtained from a combined fit to all signal and background control regions. DM mediators with scalar or pseudoscalar couplings are assumed. Mediator couplings correspond to \(g_{q}=g_{\chi }=1\)

The fitted background-only \(p_{\mathrm {T}} ^\text{miss} \) distributions of the individual search channels are assessed using the likelihood ratio for the saturated model, which provides a generalization of the \(\chi ^{2}\) goodness-of-fit test [72, 73]. Pseudodata are generated from the fitted MC yields to determine the distribution of the likelihood ratio. The p-values obtained are larger than 0.5 for each channel except for the all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channel, for which a low p-value of 0.01 is determined. This value appears to result from the scatter in the 0,1 RTT CRs. No significant excess in the individual search channels is observed.

Upper limits are set on the \({b} \overline{{b}} +\chi \overline{\chi }\) and \({t}\overline{{t}} +\chi \overline{\chi } \) production cross sections. The limits are calculated using a modified frequentist approach (CLs) with a test statistic based on the profile likelihood in the asymptotic approximation [74,75,76]. For each signal hypothesis, 95% confidence level (CL) upper limits on the signal strength parameter \(\mu \) are determined. Tables 5 and 6 list the expected limits on \(\mu \) obtained for various signal hypotheses. Figure 11 shows the expected and observed limits on \(\mu \) as a function of the mediator mass for \(m_{\chi }=1\,\text{GeV} \).

The all-hadronic and \(\ell +\text{jets}\) \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channels provide the highest sensitivity to the \({t}\overline{{t}} +\chi \overline{\chi } \) process for all mediator masses considered. Expected limits on the \({t}\overline{{t}} +\chi \overline{\chi } \) process from the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel are comparable with those of the dileptonic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) channel. The only relevant search channel for the \({b} \overline{{b}} +\chi \overline{\chi }\) process is \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \), from which observed upper limits of \(\mu \ge 26\) are obtained for the pseudoscalar mediator hypothesis (see Table 6). The relatively weak sensitivity of the \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) channel in the search is due, in part, to the specific signal model considered; the performance of this channel would improve in models in which the mediator couplings to up-type quarks are suppressed.

In all search channels, the expected sensitivity to low-mass scalar mediators is better than that for low-mass pseudoscalars. This reflects the higher predicted cross section for the low-mass scalar, which is approximately 40 times larger than that of the pseudoscalar for a mediator mass of 10\(\,\text{GeV}\)  [50]. Scalar and pseudoscalar cross sections become comparable at mediator masses of around 200\(\,\text{GeV}\) and above. The expected scalar limits therefore rise quickly with increasing mass, while the limits for the pseudoscalar mediator change less, as can be seen from Tables 5 and 6.

7.2 Combined search results

Signal region yields obtained from a simultaneous background-only fit of all of the search channels are similar to those listed in Table 4. Fitted \(p_{\mathrm {T}} ^\text{miss} \) distributions in the eight SRs are nearly indistinguishable from those of Figs. 9 and 10. The nuisance parameter shifts in the combined fit are consistent with those of the individual channel fits, while the fit uncertainty in the b tagging efficiency nuisance parameter becomes more tightly constrained. The p value of the saturated likelihood goodness-of-fit test is 0.11, which indicates no significant deviation with respect to background predictions.

A simultaneous signal+background fit is performed using all SRs and CRs, and 95% CL upper limits are set on the cross section ratio \(\mu \) for DM produced in association with heavy-flavor quark pairs. Table 7 provides limits obtained for the scalar and pseudoscalar mediator hypotheses. These limits are presented graphically in Fig. 12. The combination of \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) search channels enhances sensitivity to both the scalar and the pseudoscalar mediator scenarios.

Signal cross sections may be scaled to larger values of \(g_{{q}} \) and \(g_{\chi }\) using the relationship given in Ref. [21]. This simple scaling approximation is valid as long as the mediator width remains below 20% of its mass. With \(g_{{q}} =g_{\chi }=1.5\), the relative width of the 500\(\,\text{GeV}\) scalar (pseudoscalar) mediator is 14% (18%). The relative width decreases with decreasing mediator mass. For coupling values of \(g_{{q}} =g_{\chi }=1.5\), the \(p_{\mathrm {T}} ^\text{miss} \) distributions of the various mediator hypotheses are also unchanged with respect to those obtained with \(g_{{q}} =g_{\chi }=1\), thus the limits of Fig. 7 may be scaled accordingly [21]. Assuming coupling values of \(g_{{q}} =g_{\chi }=1.5\), the observed (expected) 95% CL exclusions are \(m_{\phi } < 124\,(105)\,\text{GeV} \) for a scalar mediator, and \(m_{\mathrm {a}} < 128~(76)\,\text{GeV} \) for a pseudoscalar mediator.

Fig. 12
figure 12

The ratios (\(\mu \)) of the 95% CL upper limits on the combined \({t}\overline{{t}} +\chi \overline{\chi } \) and \({b} \overline{{b}} +\chi \overline{\chi }\) cross section to simplified model expectations. The limits are obtained from combined fits to the \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) and \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) signal and background control regions for the hypothesis of a scalar mediator (upper) and a pseudoscalar mediator (lower). A fermionic DM particle with a mass of \(1\,\text{GeV} \) is assumed in both panels. Mediator couplings correspond to \(g_{{q}} =g_{\chi }=1\)

8 Summary

A search for an excess of events with large missing transverse momentum (\(p_{\mathrm {T}} ^\text{miss}\)) produced in association with a pair of heavy-flavor quarks has been performed with a sample of proton-proton interaction data at a center-of-mass energy of 13 TeV. The data correspond to an integrated luminosity of 2.2\(\,\text{fb}^{-1}\) collected with the CMS detector at the CERN LHC. The analysis explores \({b} \overline{{b}} +p_{\mathrm {T}} ^\text{miss} \) and the dileptonic, \(\ell +\text{jets}\), and all-hadronic \({t}\overline{{t}} +p_{\mathrm {T}} ^\text{miss} \) final states. A resolved top quark tagger is used to categorize events in the all-hadronic channel. No significant deviation from the standard model background prediction is observed. Results are interpreted in terms of dark matter (DM) production, and constraints are placed on the parameter space of simplified models with scalar and pseudoscalar mediators. The DM search channels are considered both individually and, for the first time, in combination. The combined search excludes production cross sections larger than 1.5 or 1.8 times the values predicted for a 10\(\,\text{GeV}\) scalar mediator or a 10\(\,\text{GeV}\) pseudoscalar mediator, respectively, for couplings of \(g_{{q}} =g_{\chi }=1\). The limits presented are the first achieved on simplified models of dark matter produced in association with heavy-flavor quark pairs.