Search for dark matter produced in association with heavy-flavor quark pairs in proton-proton collisions at \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}= 13\,\text{TeV} $$\end{document}s=13TeV

A search is presented for an excess of events with heavy-flavor quark pairs (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${t}\overline{{t}} $$\end{document}tt¯ and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${b} \overline{{b}} $$\end{document}bb¯) and a large imbalance in transverse momentum in data from proton–proton collisions at a center-of-mass energy of 13\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text{TeV}$$\end{document}TeV. The data correspond to an integrated luminosity of 2.2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text{fb}^{-1}$$\end{document}fb-1 collected with the CMS detector at the CERN LHC. No deviations are observed with respect to standard model predictions. The results are used in the first interpretation of dark matter production in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${t}\overline{{t}} $$\end{document}tt¯ and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${b} \overline{{b}} $$\end{document}bb¯ final states in a simplified model. This analysis is also the first to perform a statistical combination of searches for dark matter produced with different heavy-flavor final states. The combination provides exclusions that are stronger than those achieved with individual heavy-flavor final states.


Introduction
Astrophysical and cosmological observations [1][2][3] provide strong support for the existence of dark matter (DM), which could originate from physics beyond the standard model (BSM). In a large class of BSM models, DM consists of stable, weakly-interacting massive particles (WIMPs). In collider experiments, WIMPs (χ ) could be pair-produced through the exchange of new mediating fields that couple to DM and to standard model (SM) particles. Following their production, the WIMPs would escape detection, thereby creating an imbalance of transverse momentum (missing transverse momentum, p miss T ) in the event. If the new physics associated with DM respects the principle of minimal flavor violation [4,5], the interactions of spin-0 mediators retain the Yukawa structure of the SM. This principle is motivated by the apparent lack of new flavor physics at the electroweak (EWK) scale. Because only the top quark has a Yukawa coupling of order unity, WIMP DM couples preferentially to the heavy top quark in models with minimal flavor violation. In high energy proton-proton collisions, this e-mail: cms-publication-committee-chair@cern.ch coupling leads to the production of tt + χ χ at lowest-order via a scalar (φ) or pseudoscalar (a) mediator (Fig. 1), and to the production of so-called mono-X final states through a top quark loop [6][7][8][9][10][11][12][13][14]. At the CERN Large Hadron Collider (LHC), the tt + χ χ process can be probed directly via the tt + p miss T and bb + p miss T signatures. The bb + p miss T signature provides additional sensitivity to the bb+χ χ process for models in which mediator couplings to up-type quarks are suppressed, as can be the case in Type-II two Higgs doublet models [15]. This paper describes a search for DM produced with a tt or bb pair in pp collisions at √ s = 13 TeV with the CMS experiment at the LHC. A potential DM signal is extracted from simultaneous fits to the p miss T distributions in the bb + p miss T and tt + p miss T search channels. Data from control regions enriched in SM tt, W + jets, and Z + jets processes are included in the fits, to constrain the major backgrounds. The top quark nearly always decays to a W boson and a b quark. The W boson subsequently decays leptonically (to charged leptons and neutrinos) or hadronically (to quark pairs). The dileptonic, lepton( )+jets, and all-hadronic tt final states consist, respectively, of events in which both, either, or neither of the W bosons decay leptonically. Each of these primary tt final states are explored.
Previous LHC searches for DM produced with heavyflavor quark pairs were interpreted using effective field theories that parameterize the DM-SM coupling in terms of an interaction scale M * [16][17][18]. An earlier search by the CMS Collaboration investigated the + jets tt final state using 19.7 fb −1 of data collected at √ s = 8 TeV [19]. That search excluded values of M * below 118 GeV, assuming m χ = 100 GeV. The ATLAS Collaboration performed a similar search separately for the all-hadronic and + jets tt final states and obtained comparable limits on M * [20]. More recently, the limitations of effective field theory interpretations of DM production at the LHC has led to the development of simplified models that remain valid when the mediating particle is produced on-shell [21]. This analysis adopts the simplified model framework to provide the first interpreta- tion of heavy-flavor search results in terms of the decays of spin-0 mediators with scalar or pseudoscalar couplings. This paper also reports the first statistical combination of dileptonic (ee, eμ, μμ), + jets (e, μ), and all-hadronic tt + χ χ searches, as well as the first combination of tt + χ χ and bb + χ χ search results.
The paper is organized as follows. Section 2 reviews the properties of the CMS detector and the particle reconstruction algorithms used in the analysis. Section 3 describes the modeling of tt +χ χ and bb +χ χ signal and SM background events, and Sect. 4 provides the selections applied to data and simulation. Section 5 discusses the techniques used to extract a potential DM signal in the tt + p miss T and bb + p miss T search channels. Section 6 describes the systematic uncertainties considered in the analysis. The results of the search and their interpretation within a simplified DM framework are presented in Sect. 7. Section 8 concludes with a summary of the results.

CMS detector and event reconstruction
The CMS detector [22] is a multipurpose apparatus optimized for the study high transverse momentum ( p T ) physics processes in pp and heavy ion collisions. A superconducting solenoid surrounds the central region, providing a magnetic field of 3.8 T parallel to the beam direction. Charged particle trajectories are measured using the silicon pixel and strip trackers, which cover the pseudorapidity region of |η| < 2.5. A lead tungstate crystal electromagnetic calorimeter (ECAL) and a brass and scintillator hadron calorimeter (HCAL) surround the tracking volume, and cover the region with |η| < 3. Each calorimeter is composed of a barrel and two endcap sections. A steel and quartz-fiber Cherenkov forward hadron calorimeter extends the coverage to |η| < 5. The muon system consists of gas-ionization detectors embedded in the steel flux return yoke outside the solenoid, and covers the region of |η| < 2.4. The first level of the CMS trigger system is com-posed of special hardware processors that select the most interesting events in less than 4 μs using information from the calorimeters and muon detectors. This system reduces the event rate from 40 MHz to approximately 100 kHz. The highlevel trigger processor farm performs a coarse reconstruction of events selected by the first-level trigger, and applies additional selections to reduce the event rate to less than 1 kHz for storage.
Event reconstruction is based on the CMS Particle Flow (PF) algorithm [23,24], which combines information from all CMS subdetectors to identify and reconstruct the individual particles emerging from a collision: electrons, muons, photons, and charged and neutral hadrons. Interaction vertices are reconstructed using the deterministic annealing algorithm [25]. The primary vertex is selected as that with the largest sum of p 2 T of its associated charged particles. Events are required to have a primary vertex that is consistent with being in the luminous region.
Jets are reconstructed by clustering PF candidates using the anti-k T algorithm [26,27] with a distance parameter of 0.4. Corrections based on jet area are applied to remove the energy from additional collisions in the same or neighboring bunch crossing (pileup) [28]. Energy scale calibrations determined from the comparison of simulation and data are then applied to correct the four momenta of the jets [29]. Jets are required to have p T > 30 GeV, |η| < 2.4, and to satisfy a loose set of identification criteria designed to reject events arising from spurious detector and reconstruction effects.
The combined secondary vertex b tagging algorithm (CSVv2) is used to identify jets originating from the hadronization of bottom quarks [30,31]. Jets are considered to be b-tagged if the CSVv2 discriminant for that jet passes a requirement that roughly corresponds to efficiencies of 70% to tag bottom quark jets, 20% to mistag charm quark jets, and 1% to misidentify light-flavor jets as b jets. Efficiency scale factors in the range of 0.92-0.98, varying with jet p T , are applied to simulated events in order to reproduce the b tagging performance for bottom and charm quark jets observed in data. A scale factor of 1.14 is applied to simulation to reproduce the measured mistag rate for light-flavor quark and gluon jets.
The p miss T variable is initially calculated as the magnitude of the vector sum of the p T of all PF particles. This quantity is adjusted by applying jet energy scale corrections. Detector noise, inactive calorimeter cells, and cosmic rays can give rise to events with severely miscalculated p miss T . Such events are removed via a set of quality filters that take into account the timing and distribution of signals from the calorimeters, missed tracker hits, and global characteristics of the event topology.
Electron candidates are reconstructed by combining tracking information with energy depositions in the ECAL [32]. The energy of the ECAL clusters is required to be compatible with the momentum of the associated electron track. Muon candidates are reconstructed by combining tracks from the inner silicon tracker and the outer muon system [33]. Tracks associated with muon candidates must be consistent with a muon originating from the primary vertex, and must satisfy a set of quality criteria [33]. Electrons and muons are selected with p T > 30 GeV and |η| < 2.1 for consistency with the coverage of the single-lepton triggers, and are required to be isolated from hadronic activity, to reject hadrons misidentified as leptons. Relative isolation is defined as the scalar p T sum of PF candidates within a R = √ η 2 + φ 2 cone of radius 0.4 or 0.3 centered on electrons or muons, respectively, divided by the lepton p T . Relative isolation is nominally required to be less than 0.035 (0.065) for electrons in the barrel (endcap), respectively, and less than 0.15 for muons. Identification requirements, based on hit information in the tracker and muon systems, and on energy depositions in the calorimeters, are imposed to ensure that candidate leptons are well-measured. These restrictive isolation and identification criteria are used to select events from the dileptonic tt, + jets tt, W( ν) + jets, and Z ( ) + jets processes. The efficiencies of the requirements for electrons (muons) with p T > 30 GeV range from 52 to 83% (91 to 96%), for increasing lepton p T . Less restrictive lepton isolation and identification requirements are used to reject events containing additional leptons with p T > 10 GeV. Efficiencies for these requirements range from 66 to 96% for electrons and 73 to 99% for muons, for increasing lepton p T . Electron and muon selection efficiency scale factors are applied in simulation to match the efficiencies measured in data using the tagand-probe procedure [34]. Averaged over lepton p T , the electron and muon efficiency scale factors for the more restrictive selection requirements are 98 and 99%, respectively.
The "resolved top tagger" (RTT) is a multivariate discriminant that uses jet properties and kinematics to identify top quarks that decay into three resolved jets. The input observables are the values of the quark/gluon discriminant [35], which combines track multiplicity, jet shape, and fragmentation information for each jet, values of the b tagging discriminants, and the opening angles between the candidate b jet and the two jets from the candidate W boson. Within each jet triplet, the b candidate is considered to be the jet with the largest value of the b tagging discriminant. The RTT discriminant also utilizes the χ 2 value of a simultaneous kinematic fit to the top quark and W boson masses [36]. The fit attempts to satisfy the mass constraints by allowing the jet momenta and energies to vary within their measured resolutions. The RTT is implemented as a boosted decision tree using the TMVA framework [37], and is trained on simulated + jets tt events using correct (incorrect) jet combinations as signal (background).
The performance of the RTT discriminant is characterized with data enriched in SM + jets tt events containing four

Fig. 2
The distribution of the RTT discriminant in data enriched in + jets tt events. Simulated + jets tt events in which jets from the all-hadronic top quark decay are correctly chosen are labeled "tt(1 ) with matched jets". Simulated + jets tt events in which an incorrect combination of jets is chosen are labeled "tt(1 ) combinatorial". Events from processes that do not contain a hadronically-decaying top quark, such as dileptonic tt, are labeled "other background". The uncertainties shown in the ratios of data to simulation are statistical only. Jet triplets in the all-hadronic tt + p miss T search are considered to be top quark tagged if their RTT discriminant value is larger than zero or more jets. At least one of these jets is required to be btagged. The output discriminant for these events is plotted in Fig. 2. Each entry in the plot corresponds to the jet triplet with the highest RTT score in the event. Data are modeled using simulated +jets tt signal events, and simulated events for each of the primary backgrounds (dileptonic tt, W + jets, single t). The simulation is split into three classes that correspond to correctly tagged jet triplets and the two possibilities for mistagging, as explained below. Simulation describes the data well. A jet triplet is considered as a tagged top quark decay when the RTT discriminant value is greater than zero.
There are three efficiencies associated with the RTT selection, which correspond to the three classes of events in Fig. 2: + jets tt events in which the hadronically-decaying top quark is correctly identified ("tt(1 ) matched"), + jets tt events in which an incorrect combination of jets is tagged ("tt(1 ) combinatorial"), and events with no hadronicallydecaying top quarks that contain a mistagged jet triplet ("other background"). Dileptonic tt events are used to extract the nonhadronic mistag rate in data. Then, + jets tt events are used to extract the tagging and mistagging efficiencies for hadronically-decaying top quarks through a fit to the trijet mass distribution. Mass templates obtained from simulation are associated with each efficiency term in the fit. The effi-ciency of the RTT > 0 selection for events determined to be tt(1 ) matched, tt(1 ) combinatorial, or other background are 0.97 ± 0.03, 0.80 ± 0.05, and 0.69 ± 0.02, respectively. Corresponding data-to-simulation scale factors are found to be consistent with unity.
The bb + p miss T search includes vetoes on hadronicallydecaying τ leptons, which are reconstructed from PF candidates using the "hadron plus strips" algorithm [38]. The algorithm combines one or three charged pions with up to two neutral pions. Neutral pions are reconstructed by the PF algorithm from the photons that arise from π 0 → γ γ decay. Photons are reconstructed from ECAL energy clusters, which are corrected to recover the energy deposited by photon conversions and bremsstrahlung. Photons are identified and distinguished from jets and electrons using cut-based criteria that include the isolation and transverse shape of the ECAL deposit, and the ratio of HCAL/ECAL energies in a region surrounding the candidate photon.

Modeling and simulation
The associated production of DM and heavy-flavor quark pairs provides rich detector signatures that include significant p miss T accompanied by highp T jets, bottom quarks, and leptons. The largest backgrounds in the tt + p miss T and bb+ p miss T searches are SM tt events, inclusive W boson production in which the W decays leptonically (W( ν) + jets), and inclusive Z boson production in which the Z decays to neutrinos (Z (νν)+jets). Simulated events are used throughout the analysis to determine signal and background expectations. Where possible, corrections determined from data are applied to the simulations.
Monte Carlo (MC) samples of SM tt and single t backgrounds are generated at next-to-leading order (NLO) in quantum chromodynamics (QCD) using Powhegv2 and Powhegv1 [39][40][41], respectively. As with all MC generators subsequently described, Powheg is interfaced with Pythia8.205 [42] for parton showering using the CUETP8M1 tune [43]. Samples of Z + jets, W + jets, and QCD multijet events are produced at leading order (LO) using MG5_amc@nlo v2.2.2 [44] with the MLM prescription [45] for matching jets from the matrix element calculation to the parton shower description. The W + jets and Z + jets samples are corrected using EWK and QCD NLO/LO K-factors calculated with MG5_amc@nlo as functions of the generated boson p T . The simulation of tt + γ , tt + W, and tt + Z events makes use of NLO matrix element calculations implemented in MG5_amc@nlo, and the FxFx [46] prescription to merge multileg processes. Diboson processes (WW, WZ, and ZZ) are generated at NLO using either MG5_amc@nlo or Powhegv2.
The signal processes are simulated using simplified models that were developed in the LHC Dark Matter Forum (DMF) [21]. The DM particles χ are assumed to be Dirac fermions, and the mediators are spin-0 particles with scalar (φ) or pseudoscalar (a) couplings. The coupling strength of the mediator to SM fermions is assumed to be g qq = g q y q where: y q = √ 2m q /v is the SM Yukawa coupling, m q is the quark mass, and v = 246 GeV is the Higgs field vacuum expectation value. As per the recommendations of the LHC Dark Matter Working Group [47], g q is taken to be flavor universal and equal to 1. Likewise, the coupling strength of the mediator to DM, g χ , is set to 1 and is independent of the DM mass. The LHC DMF spin-0 models do not account for mixing between the φ scalar and the SM Higgs boson [48]. As is discussed in [21], the p miss T spectra of both the scalar and pseudoscalar mediated processes broaden with increasing mediator mass. For m φ/a larger than twice the top quark mass (m top ), the p miss T distributions of the scalar and pseudoscalar processes are essentially identical. As m φ/a decreases below 2m top , the p miss T spectra of the two processes increasingly differ, with the distribution of the scalar process peaking at lower p miss T values [49,50]. For all mediator masses, the total cross section of the scalar process is larger than that of the pseudoscalar equivalent [50]. This analysis focuses on the m χ = 1 GeV LHC DMF benchmark point, which provides a convenient signal reference for both low and high mass mediators.
The tt + χ χ and bb + χ χ signals are generated at LO in QCD using MG5_amc@nlo with up to one additional jet in the final state. Jets from the matrix element calculations are matched to the parton shower descriptions using the MLM prescription. Angular correlations in the decays of the top quarks are included using MadSpin v2.2.2 [51]. Minimum decay widths are assumed for the mediators, and are calculated from the partial width formulas given in Ref. [52]. This calculation assumes that the spin-0 mediators couple only to SM quarks and the DM fermion χ . Simulated signal samples are produced for a DM mass of m χ = 1 GeV and for mediator masses in the range of 10-500 GeV. The relative width of the scalar (pseudoscalar) mediator varies between 4 and 6% (4-8%) for this mediator mass range. The predicted rates of the bb + χ χ process, which is generated in the 4-flavor scheme, are adjusted to match the cross sections calculated in the 5-flavor scheme [21,53].
All samples generated at LO and NLO use corresponding NNPDF3.0 [54] parton distribution function (PDF) sets. All signal and background samples are processed using a detailed simulation of the CMS detector based on Geant4 [55]. The samples are reweighted to account for the distribution of pileup observed in data.

Event selection
Signal events are expected to exhibit both large p miss T from the production of two noninteracting DM particles and event topologies consistent with the presence of top quarks or b quark jets. Data are therefore collected using triggers that select events containing large p miss T or highp T leptons. Data for the dileptonic and +jets tt + p miss T searches are obtained using single-lepton triggers that require an electron (muon) with p T ≥ 27 (20) GeV. These trigger selections are more than 90% efficient for PF-reconstructed electrons and muons that satisfy the p T , identification, and isolation requirements imposed. The trigger used for the bb + p miss T and all-hadronic tt + p miss T searches selects events based on the amount of p miss T and H miss T reconstructed using a coarse version of the PF algorithm. The H miss T variable is defined as the magnitude of the vector sum of the p T of all jets in the event with p T > 20 GeV, |η| < 5.0. Jets reconstructed from detector noise are removed in the H miss T calculation by additionally requiring neutral hadron energy fractions of less than 0.9. The p miss T and H miss T requirements for this trigger are 120 GeV. The trigger is nearly 100% efficient for events that satisfy subsequent selections based on fully-reconstructed PF p miss T . Additional selections, described in Sect. 4.1 and summarized in Table 1, are applied to define eight independent regions of data that are sensitive to DM signals: two bb + p miss T , one + jets tt + p miss T , three dileptonic tt + p miss T , and two all-hadronic tt + p miss T regions. Control regions (CRs) enriched in various background processes are also defined and are used to improve background estimates in the aforementioned signal regions (SRs). In the CRs, individual signal selection requirements are inverted to enhance background yields and to prevent event overlaps with the SRs. Collectively, the SRs and CRs associated with the individual tt + χ χ and bb + χ χ production and decay modes are referred to as "channels". The bb + χ χ channel and the three tt + χ χ channels are used in simultaneous p miss T fits (described in Sect. 5) to extract a potential DM signal. The fits allow the background-enriched CRs to constrain the contributions of SM tt, W + jets, and Z + jets processes within the CRs and SRs of each channel. The selections used to define the SRs and CRs are described in Sects. 4.1 and 4.2, respectively. Tables 1 and 2 briefly summarize these selections. Table 2 defines a CR labeling scheme that is extensively used in subsequent sections.

Signal region selections
Dileptonic t t + p miss T Events in the dileptonic tt SR are required to contain exactly two leptons that satisfy stringent identification and isolation requirements. One of the leptons must have p T > 30 GeV, while the second must have p T > 10 GeV. Events containing additional, loosely identified leptons with p T > 10 GeV are rejected. Events are also required to have p miss T > 50 GeV, and to contain two or more jets, at least one of which must satisfy b tagging requirements. Overlaps between the dileptonic SR and the dileptonic and Z + jets CRs of the + jets tt + p miss T and bb + p miss T channels (discussed in Sect. 4.2) are removed by vetoing events that satisfy the selections for those CRs. These vetoes Table 1 Overview of the selection criteria used to define the eight tt + p miss  Table 2 Dileptonic tt control region veto μμ Z + jets control region veto remove 2.5% of the events from the dileptonic tt + p miss T SR. The azimuthal opening angle between the p T vector of the dilepton system and the p miss , is required to be larger than 1.2 radians. This requirement preferentially selects events consistent with a tt system recoiling against the invisibly decaying DM mediator. The dilepton mass, m , is required to be larger than 20 GeV. In dielectron and dimuon events, m is also required to be at least 15 GeV away from the Z boson mass [56]. These requirements reduce backgrounds from low-mass dilepton resonances and from leptonic Z boson decays.
Events that satisfy these criteria are divided among three SR categories that correspond to the flavor assignments of the two selected leptons: ee, eμ, and μμ. Signal efficiencies for the dileptonic tt + p miss T SR event selections range from 6 × 10 −3 to 10 −2 for mediator masses between 10 GeV and 500 GeV. The denominator used in the efficiency calculation is the total number of signal events, irrespective of the tt final state. The low efficiencies result primarily from the small dileptonic branching fraction. + jets t t + p miss T Events in the +jets tt SR are selected by requiring p miss T > 160 GeV, exactly one lepton, and three or more jets, of which at least one must satisfy the b tagging criteria. The lepton is required to have p T > 30 GeV, and to pass tight identification criteria. Events must not contain additional leptons with p T > 10 GeV that satisfy a looser set of identification requirements. To reduce SM + jets tt and W + jets backgrounds, the transverse mass, calculated from − → p miss T and the lepton momentum ( − → p T ) as: is required to be larger than 160 GeV. Following these selections, the remaining background events primarily consist of dileptonic tt final states in which one of the leptons is not identified. Because of the requirement of p miss T > 160 GeV, this background tends to contain events with Lorentz-boosted top quark decays in which the b jet is closely aligned with the direction of the neutrino. This background is suppressed by requiring that the smallest azimuthal angle formed from the missing transverse momentum vector and each of the two highest p T jets in the event, where m y is the mass of two parent particles that each decay to bW( ν). One of the W decays is assumed to produce a lepton that is not reconstructed. For the W decay that does produce a reconstructed lepton, the neutrino and lepton 4-momenta are denoted p 1 and p , respectively. The 4momentum of the W that produces the unreconstructed lepton is denoted p 2 , while the momenta of the two b candidates are referred to as p b1 and p b2 . Assuming perfect measurements, the M W T2 has a kinematic end-point at m top for tt events, whereas signal events lack this feature because both the neutrino and DM particles contribute to p miss T . The efficiency of the + jets tt + p miss T SR event selections for the tt + χ χ process range from 10 −4 for mediator masses of the order of 10 GeV, to 10 −3 for masses of about 500 GeV. Signal efficiencies are low because of the stringent p miss T requirement applied. The efficiency improves with increasing mediator mass because of the broadening of the p miss T spectrum. All-hadronic t t + p miss T Any event with a loosely identified lepton with p T > 10 GeV is vetoed from the allhadronic tt + p miss T SRs. The p miss T value must be larger than 200 GeV, and four or more jets are required, at least one of which must satisfy b tagging criteria. Spurious p miss T can arise in multijet events due to jet energy mismeasurement. In such cases, the reconstructed p miss T tends to align with one of the jets. Multijet background is suppressed by requiring that min φ( − → p Following these selection requirements, the dominant residual background is + jets SM tt production. By contrast, selected signal typically includes events in which both top quarks decay hadronically. The resolved top quark tagger (RTT, introduced in Sect. 2) is employed to suppress the + jets background by identifying potential hadronic top quark decays. The RTT is applied to the all-hadronic search region to define a category of events with two hadronic top quark decays. In this double-tag (2 RTT) category, one or more btagged jets are required and min φ( − → p jet i T , − → p miss T ) > 0.4 radians is imposed for all jets in the event. The 2 RTT category implicitly requires at least six jets in the event. A second category is defined for events with 0 or 1 top quark tags (0, 1 RTT), four or more jets with at least two b-tagged jets, and a tighter requirement of min φ( − → p The selection efficiency for tt + χ χ events in the allhadronic tt + p miss T SRs ranges from 10 −3 for mediator masses of the order of 10 GeV to 10 −1 for masses near 500 GeV. These values are larger than the corresponding efficiencies of the dileptonic and + jets SR selections because of the larger branching fraction to the all-hadronic final state. bb + p miss T Events with p miss T > 200 GeV are selected for the SRs of this final state. Events containing identified and isolated electrons or muons with p T larger than 10 GeV or identified τ leptons with p T > 18 GeV are rejected. Multijet background is reduced by requiring min φ( − → p jet i T , − → p miss T ) > 0.5 radians for all jets in the event. Following these selections, two exclusive event categories are defined using the number of jets and b-tagged jets in the event. The single b-tagged jet category provides high efficiency for bb + χ χ signal and requires at most two jets. At least one of these jets must have p T > 50 GeV, and exactly one must satisfy b tagging requirements. The second category allows exactly two b-tagged jets. This SR selects bb + χ χ signal and partially recovers tt + χ χ events that are not selected in the all-hadronic tt + p miss T categories. At most three jets are allowed in the 2 b tag SR, and at least two of these jets must have p T > 50 GeV.
The efficiency of the bb + p miss T SR event selections for the bb + χ χ process range from 10 −6 for mediator masses of the order of 10 GeV, to 10 −2 for masses of 500 GeV. The selection efficiency for the tt + χ χ process is found to be less dependent on the mediator mass, and varies from 10 −4 to 10 −3 for the same mass range. Figure 3 shows the simulated background yields in each of the SRs following the selections of Sect. 4.1. Clearly, the dominant backgrounds in the SRs are from the SM tt, W + jets, and Z + jets processes. The estimation of backgrounds in the SRs is improved through the use of corresponding data CRs enriched in these processes. Independent CRs are defined for each of the + jets tt + p miss T , all-hadronic tt + p miss T and bb + p miss T SRs. In some cases, multiple CRs are used to constrain a given background process in a SR. In this section we describe the main tt, W + jets, and Z + jets backgrounds and the selections used to define the CRs. The CR selections are designed to ensure that these regions are both mutually exclusive and exclusive of the SRs as well. The contributions of multijet, diboson, single t, and tt + Z /W/γ processes in the SRs are either subdominant or insignificant after the SR selections. The residual backgrounds from these processes are modeled with simulation. Dilepton background events from Drell-Yan and processes in which jets are misidentified as leptons are estimated using the sideband techniques described in Ref. [58]. The remainder of this section describes how the contributions of SM backgrounds in the SRs are estimated using the CRs. The discussion utilizes the CR labeling convention defined in Table 2, for ease of reference. The CRs for the + jets tt + p miss T SR are denoted slA and slB, those for the all-hadronic tt + p miss T SRs are hadA-hadG, and those for the bb + p miss T SRs are bbA-bbJ. Section 5 describes how the CRs are simultaneously fit with the SRs to constrain the predicted normalization of the tt, W + jets, and Z + jets background processes. Figures 4, 5 and 6 compare the integrated yields in each CR before and after background-only fits to the CR p miss T distributions. Reasonable agreement is found between the observed and predicted CR yields. In general, the expected and observed p miss T distributions in the CRs also agree. Regions for which the distributions of data and of the initial ("prefit") MC disagree are noted in the text.

Background control region selections
Dileptonic t t Dileptonic tt background in the + jets tt SR consists of events in which only one of the leptons is identified. A dileptonic CR (slA) for the + jets tt + p miss     Table 2) constrains + jets tt background in the 0,1 RTT signal region. This process is constrained in the 2 RTT signal region using the 1 lepton, ≥ 1 b tag control region (hadE). The ≤1 lepton, 0 b tag control regions (hadB, hadC, hadF, hadG) constrain W + jets and Z + jets backgrounds, while the 2 lepton, 0 b tag control region (hadD) provides an additional constraint on the Z + jets background. The lower panels show the ratios of observed to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms the lepton is not identified. Control regions for + jets tt (hadA, hadE) are defined by selecting events with exactly one identified lepton with p T > 30 GeV, and by requiring M T < 160 GeV in order to avoid overlaps with the SR of the + jets channel. All other requirements used to define the hadronic SRs are applied, and the CR is split into 0,1 RTT and 2 RTT categories.
The dileptonic tt CRs for the bb + p miss T search (described above) provide stringent constraints on tt backgrounds in the corresponding SRs. Additional constraints on tt background in this channel are provided through four singlelepton CRs (bbA, bbB, bbF, and bbG). A single-electron (muon) CR for the 1 b tag SR requires exactly one electron (muon) with p T > 30 GeV. The lepton must satisfy tight isolation and identification criteria. The M T observable calculated from the lepton momenta and p miss T must satisfy 50 < M T < 160 GeV. Except for the requirement on min φ( − → p jet i T , − → p miss T ), each of the selection criteria for the 1 b tag signal category must also be satisfied. Analogous CRs for the 2 b tag signal category are formed by applying the corresponding signal selection criteria. As in the dileptonic tt CRs for the bb + p miss T searches, the lepton is removed from the p miss T calculation. W + jets A W + jets CR for the + jets tt + p miss T search (slB) is created by requiring zero b tags. The M T > 160 GeV requirement from the + jets signal selection is maintained, however, the cuts on M W T2 and min φ( − → p Control regions enriched in both W + jets and Z + jets (hadB, hadF) are formed for the all-hadronic tt + p miss T categories by modifying the SR selections to require zero b tags. In addition, dedicated W+jets CRs (hadC, hadG) are defined by requiring the presence of an isolated, identified lepton with p T > 30 GeV and M T < 160 GeV. The W/Z + jets and W+jets CRs are both categorized using the number of RTTs, as in the corresponding SRs. The prefit yields and p miss T distributions in the hadB and hadC regions are observed to differ from those of data. The discrepancy is due to a mismodeling of hadronic activity in the simulation, which leads to an overestimation of the selection efficiency for the Z+jets and W+jets processes. Reasonable agreement is achieved through the fit, as is shown in Figs. 7 and 5.
The W + jets process contributes the second-largest background in the 1 b tag SR of the bb + p miss T channel. This background is constrained via the single-lepton CRs (bbA, bbB, bbF, bbG) of the bb + p miss T channel, which were introduced previously in the context of constraints on + jets tt backgrounds.
Z + jets The Z (νν) + jets process is a significant source of background in the all-hadronic tt + p miss T SRs. This background is partially controlled via the W/Z + jets CRs (hadB, hadF) described previously. An additional constraint is derived from a distinct Z ( ) + jets CR (hadD), in which two oppositely-charged, same-flavor leptons are required to   Fig. 6 Observed data, and prefit and fitted background-only event yields in the control regions associated with the bb + p miss T signal region with 1 b tag (upper) and with 2 b tags (lower). The 1 lepton, ≥ 1 b control regions (bbA, bbB, bbF and bbG in Table 2) are used to constrain W + jets and tt backgrounds in the bb + p miss T signal regions. The dileptonic control regions (bbC-bbE, bbH-bbJ) are used to constrain Z + jets and tt backgrounds. The lower panels show the ratio of observed to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms pass tight isolation and identification requirements. The mass of the lepton pair must fall between 60 and 120 GeV. A prediction for the p miss T distribution in the hadronic SRs is obtained by subtracting the lepton momenta in the p miss T cal-  Fig. 7 Observed data, and prefit and fitted background-only p miss T distributions in two control regions (hadB and hadC in Table 2) for the 0,1 RTT hadronic tt + p miss T signal region with 0 leptons (upper) and with 1 lepton (lower) and 0 b tags. The 0 lepton control region is used to constrain W + jets and Z + jets backgrounds. The 1 lepton CR provides an additional constraint on W + jets background. The last bin contains overflow events. The lower panels show the ratios of observed data to fitted background yields. In both panels, the statistical uncertainties of the data are indicated as vertical error bars and the fit uncertainties are indicated as hatched bands. Prefit yields and the ratios of prefit to fitted background expectations are shown as dashed magenta histograms culation. The Z ( ) + jets CR is not categorized in the number of RTTs because of the negligible yields obtained with two RTT tags. The selections for jets and p miss T used in the  Figure 8 demonstrates that the leptonsubtracted p miss T distribution observed in the Z ( ) + jets CR of the all-hadronic channel is not well described by the prefit expectation. Agreement substantially improves following the fit.
The Z (νν) + jets process is also a significant background in the bb + p miss T SRs. This background is constrained with four distinct CRs: bbC, bbD, bbH, and bbI. The Z (ee) and Z (μμ) CRs require two electrons and two muons with p T > 30 GeV, respectively. The isolation and identification criteria applied on the leadingp T lepton are identical to those used in the W + jets CRs for the bb + p miss T channel. The subleading lepton is required to satisfy a looser set of isolation and identification criteria, as in the dileptonic CRs. The leptons must be consistent with the decay of a Z boson; oppositecharge, same-flavor requirements are imposed, and the leptons must satisfy a constraint on the dilepton mass of 70 < m < 110 GeV. As in the W + jets and dileptonic tt CRs, events must also satisfy all but the min φ( − → p jet i T , − → p miss T ) selection criteria of the corresponding 1 b tag or 2 b tag signal category. As in the Z + jets CR for all-hadronic tt channel, lepton momenta are subtracted in the p miss T calculation to approximate the distribution of p miss T from Z (νν) + jets expected in the bb + p miss T SRs.

Signal extraction
A potential DM signal could be revealed as an excess of events relative to SM expectations in a region of high p miss T . The shape of the observed p miss T distribution provides additional information that is used in this analysis to improve the sensitivity of the search. A potential signal is searched for via simultaneous template fits to the p miss The fits are performed using the RooStats statistical software package [59]. The effects of uncertainties in the normalizations and in the p miss T shapes of signal and background processes are represented as nuisance parameters. Uncertainties that only affect normalization are modeled using nuisance parameters with log-normal probability densities. Uncertainties that affect the shape of the p miss T distribution, which may also include an overall normalization effect, are incorporated using a template "morphing" technique. These treatments, as well as the approach used to account for MC statistical uncertainties on template predictions, follow the procedures described in Ref. [60].
Within each search channel, additional unconstrained nuisance parameters scale the normalization of each dominant background process (tt, W + jets, and Z + jets) across the SRs and CRs. For example, a single parameter is associated with the contribution of the + jets tt process in the allhadronic tt + p miss T SRs and CRs. A separate parameter is associated with the + jets tt background in the bb + p miss T SRs and CRs. These nuisance parameters allow the data in the background-enriched CRs to constrain the background estimates in the SRs to which they correspond. Because separate nuisance parameters are used for each search channel, a given normalization parameter cannot affect background predictions in unassociated search channels. The yields and p miss T shapes of subdominant backgrounds vary in the fit only through the constrained nuisance parameters. Signal yields in the SRs and associated CRs are scaled simultaneously by signal strength parameters (μ), defined as the ratio of the signal cross section to the theoretical cross section, μ = σ/σ TH . The μ parameters scale signal normalization coherently across regions, and thus account for signal contamination in the CRs.
Signal extraction is performed for the individual search channels as well as for their combination. The separate fits to the individual signal and associated CRs provide independent estimates of bb + χ χ and tt + χ χ contributions in each channel. In this fitting scenario, separate signal strength parameters are used for each of the search channels. The bb + χ χ process is considered as a potential signal in the 1 b tag and 2 b tag regions of the bb + p miss T channel. The tt + χ χ process is searched for in all SRs of the bb + p miss T and tt + p miss T channels separately. The contribution of the bb + χ χ process in the all-hadronic tt + p miss T channel is negligible due to the jet multiplicity requirement. An inclusive fit to all signal and CRs is also performed. This fit uses a single signal strength parameter to extract the combined contribution of tt + χ χ and bb + χ χ in data. Additional details on the per-channel and combined fits are provided in Sect. 7. Table 3 summarizes the uncertainties considered in the signal extraction fits. The procedures used to evaluate the uncertainties are described later in this section. Normalization uncertainties are expressed relative to the predicted central values of the corresponding nuisance parameters. These uncertainties are used to specify the widths of the associated lognormal probability densities. The integrated luminosity, b tagging efficiency, p miss T trigger efficiency, pileup, and multijet/single t background normalization uncertainties are taken to be fully correlated across SRs and CRs. Shape uncertainties are expressed in Table 3 as the change in the prefit yields of the lowest and highest p miss T bins resulting from a variation of the corresponding nuisance by ± 1 standard deviation (s.d.). These uncertainties are propagated to the fit by using the full p miss T spectra obtained from ±1 s.d. variations of the corresponding nuisance parameters [60]. The PDF and jet energy scale shape uncertainties are taken to be fully correlated across SRs and CRs. In general, the uncertainty estimation is performed in the same way for signal and background processes; however, the uncertainty from missing higher-order corrections for signal processes, which is approximately 30% at LO in QCD, is not considered to facilitate a comparison with other CMS DM results.

Systematic uncertainties
The following sources of uncertainty correspond to constrained normalization nuisance parameters in the fit: -Integrated luminosity An uncertainty of 2.7% is used for the integrated luminosity of the data sample [61].    Prefit yields for DM produced via a pseudoscalar mediator with mass m a = 50 GeV and a scalar mediator with mass m φ = 100 GeV are also shown. Mediator couplings are set to g q = g χ = 1, and a DM particle of mass m χ = 1 GeV is assumed. Uncertainties include both statistical and systematic components -Pileup modeling Systematic uncertainties due to pileup modeling are taken into account by varying the total inelastic cross section used to calculate the data pileup distributions by ± 5%. Normalization differences in the range of 0.2-1.4% result from reweighting the simulation accordingly. -W/Z + heavy-flavor fraction The uncertainty in the fraction of W/Z + heavy-flavor jets is assigned to account for the usage of CRs dominated by light-flavor jets in constraining the prediction of W + jets and Z + jets in SRs that require b tags. The flavor fractions for the W + jets and Z + jets processes are allowed to vary independently within 20% [62][63][64][65]. -Drell-Yan background: The uncertainties in the datadriven Drell-Yan background estimates for the dileptonic channels are 64% (ee) and 43% (μμ). These uncertainties are dominated by the statistical uncertainties in quantities used to extrapolate yields from a region near the Z boson mass to regions away from it. Again, these relatively large uncertainties have little effect on the sensitivity of the search. -Multijet background normalization Uncertainties of 50-100% (depending on the SR) are applied in the normalization of multijet backgrounds to cover tail effects that are not well modeled by the simulation.
-Misidentified-lepton background The sources of uncertainty in the misidentified-lepton background for the dileptonic search stem from the uncertainty in the measured misidentification rate, and from the statistical uncertainty of the single-lepton control sample to which the rate is applied. The uncertainties per channel are 200% (ee), 48% (eμ), 30% (μμ), and are dominated by the statistical uncertainty associated with the singlelepton control sample. Because the misidentified lepton background is small, these relatively large uncertainties do not significantly degrade the sensitivity of the search. -RTT efficiency Jet energy scale and resolution uncertainties are propagated to the RTT efficiency scale factors by using modified shape templates in the efficiency extraction fit. A systematic uncertainty due to the choice of parton showering scheme is estimated by comparing the efficiencies obtained with default and alternative p miss T templates. The default simulation is showered using Pythia8.205, which implements dipole-based parton showering. The alternative templates are derived from simulated events that are showered with Herwig [66], which uses an angular-ordered shower model. Overall, statistical plus systematic uncertainties of 6, 3, and 3% are assigned for the hadronic tag, hadronic mistag, and nonhadronic mistag scale factors, respectively. These cor- tors [31]. The corresponding normalization uncertainty ranges from 2.2 to 12%. -Lepton identification and trigger efficiency: The uncertainty in lepton identification and triggering efficiency is measured with samples of Z bosons decaying to dielectrons and dimuons [34]. The corresponding normalization uncertainty ranges from 2 to 4%.  The following sources of uncertainty correspond to constrained p miss T shape nuisance parameters in the fit: -PDF uncertainties Uncertainties due to the choice of PDFs are estimated by reweighting the samples with the ensemble of PDF replicas provided by NNPDF3.0 [67].  The standard deviation of the reweighted p miss T shapes is used as an estimate of the uncertainty. -Jet energy scale Reconstructed jet four-momenta in the simulation are simultaneously varied according to the uncertainty in the jet energy scale [29]. Jet energy scale uncertainties are coherently propagated to all observables including p miss T . -Top quark p T reweighting Differential measurements of top quark pair production show that the measured p T spectrum of top quarks is softer than that of simulation. Scale factors to cover this effect have been derived in previous CMS measurements [68] and are applied to all simulated SM tt samples by default. The uncertainty in the top quark p T spectrum is estimated from a comparison with the spectrum obtained without reweighting. -Higher-order QCD corrections The uncertainties due to missing higher-order QCD corrections in the LO samples are estimated by generating alternative event samples in which the factorization and renormalization scale parameters (μ F , μ R ) are simultaneously increased or decreased by a factor of two. These uncertainties are correlated across the bins of the p miss T distribution. Uncertainties in the NLO K-factors applied to W + jets and Z + jets simulation are determined by recalculating the K-factor with μ F and μ R independently varied by a factor of two up or down.
-EWK corrections Uncertainties in the K-factors applied to W + jets and Z + jets simulation from missing higherorder EWK corrections are estimated by taking the difference in results obtained with and without the EWK correction applied. -Simulation statistics: Shape uncertainties due to the limited sizes of the simulated signal and background samples are included via the method of Barlow and Beeston [60,69]. This approach allows each bin of the p miss T distributions to independently fluctuate according to Poisson statistics.

Results and interpretation
Separate signal strength parameters are first determined from fits to each of the bb + p miss T and tt + p miss T channels. These fits use the predicted cross sections and p miss T shapes from the LHC DMF signal models with g q = g χ = 1. The fits result in independent upper limits on signal yields for the bb + χ χ and tt + χ χ processes, which are reported in Sect. 7.1.
Next, all SRs and CRs are simultaneously fit under the hypothesis of combined tt + χ χ and bb + χ χ contributions. In this case, a single signal strength parameter is used, which results in a combined best fit estimate of the tt + χ χ and bb + χ χ signal yields. Again, cross section predictions for tt + χ χ and bb + χ χ assume g q = g χ = 1. Results from this fit are reported in Sect. 7.2.
The most interesting DM scenarios to explore at the LHC involve on-shell mediator decays to χ χ , which corresponds to m φ/a > 2m χ . Kinematic variables and cross sections are independent of m χ in this regime [21]. The m χ < 10 GeV region is of particular interest because of the strong phenomenological and theoretical motivations for low-mass DM [70] and the relative strength of collider experiments in this mass range [71]. For these reasons, the DM mass has been fixed to m χ = 1 GeV in all signal extraction fits. The results obtained with m χ = 1 GeV are valid for other values of m χ < m φ/a /2 provided they are not too near the kinematic threshold. Table 4 provides the background yields in the SRs obtained from background-only fits to the bb + p miss T and individual tt + p miss T search channels. Relative nuisance parameter shifts -defined as (p fit − p prefit )/σ p , where p represents the parameter value and σ p its fit uncertainty -do not indicate any particular tension in these fits. The largest shifts correspond to the nuisance parameters for the EWK correction for the W + jets and Z + jets processes in the bb + p miss T channel (+0.8), to the μ F , μ R scale uncertainty in the tt process in the + jets tt + p miss  Fig. 11 The ratio (μ) of 95% CL upper limits on the bb + χχ and tt + χχ cross sections to simplified model expectations. The limits are obtained from fits to the individual bb + p miss T and tt + p miss T search channels for the hypothesis of a scalar mediator (upper) or a pseudoscalar mediator (lower). A fermionic DM particle with a mass of 1 GeV is assumed in both panels. Mediator couplings correspond to g q = g χ = 1  [24,54] The fitted background-only p miss T distributions of the individual search channels are assessed using the likelihood ratio for the saturated model, which provides a generalization of the χ 2 goodness-of-fit test [72,73]. Pseudodata are generated from the fitted MC yields to determine the distribution of the likelihood ratio. The p-values obtained are larger than 0.5 for each channel except for the all-hadronic tt + p miss T channel, for which a low p-value of 0.01 is determined. This value appears to result from the scatter in the 0,1 RTT CRs. No significant excess in the individual search channels is observed.

Individual search results
Upper limits are set on the bb+χ χ and tt +χ χ production cross sections. The limits are calculated using a modified frequentist approach (CLs) with a test statistic based on the profile likelihood in the asymptotic approximation [74][75][76]. For each signal hypothesis, 95% confidence level (CL) upper limits on the signal strength parameter μ are determined. Tables 5 and 6 list the expected limits on μ obtained for various signal hypotheses. Figure 11 shows the expected and observed limits on μ as a function of the mediator mass for m χ = 1 GeV.
The all-hadronic and + jets tt + p miss T channels provide the highest sensitivity to the tt + χ χ process for all mediator masses considered. Expected limits on the tt + χ χ process from the bb + p miss T channel are comparable with those of the dileptonic tt + p miss T channel. The only relevant search channel for the bb + χ χ process is bb + p miss T , from which observed upper limits of μ ≥ 26 are obtained for the pseudoscalar mediator hypothesis (see Table 6). The relatively weak sensitivity of the bb + p miss T channel in the search is due, in part, to the specific signal model considered; the performance of this channel would improve in models in which the mediator couplings to up-type quarks are suppressed.
In all search channels, the expected sensitivity to lowmass scalar mediators is better than that for low-mass pseudoscalars. This reflects the higher predicted cross section for the low-mass scalar, which is approximately 40 times larger than that of the pseudoscalar for a mediator mass of 10 GeV [50]. Scalar and pseudoscalar cross sections become comparable at mediator masses of around 200 GeV and above. The expected scalar limits therefore rise quickly with increasing mass, while the limits for the pseudoscalar mediator change less, as can be seen from Tables 5 and 6.

Combined search results
Signal region yields obtained from a simultaneous backgroundonly fit of all of the search channels are similar to those listed in Table 4. Fitted p miss T distributions in the eight SRs are nearly indistinguishable from those of Figs. 9 and 10. The nuisance parameter shifts in the combined fit are consistent with those of the individual channel fits, while the fit uncertainty in the b tagging efficiency nuisance parameter becomes more tightly constrained. The p value of the saturated likelihood goodness-of-fit test is 0.11, which indicates no significant deviation with respect to background predictions.
A simultaneous signal+background fit is performed using all SRs and CRs, and 95% CL upper limits are set on the cross section ratio μ for DM produced in association with heavyflavor quark pairs. Table 7 provides limits obtained for the scalar and pseudoscalar mediator hypotheses. These limits are presented graphically in Fig. 12. The combination of tt + p miss T and bb + p miss T search channels enhances sensitivity to both the scalar and the pseudoscalar mediator scenarios.
Signal cross sections may be scaled to larger values of g q and g χ using the relationship given in Ref. [21]. This simple scaling approximation is valid as long as the mediator width remains below 20% of its mass. With g q = g χ = 1.5, the relative width of the 500 GeV scalar (pseudoscalar) mediator is 14% (18%). The relative width decreases with decreasing mediator mass. For coupling values of g q = g χ = 1.5, the p miss T distributions of the various mediator hypotheses are also unchanged with respect to those obtained with g q = g χ = 1, thus the limits of Fig. 5 may be scaled accordingly [21]. Assuming coupling values of g q = g χ = 1.5, the observed (expected) 95% CL exclusions are m φ < 124 (105) GeV for a scalar mediator, and m a < 128 (76) GeV for a pseudoscalar mediator. The ratios (μ) of the 95% CL upper limits on the combined tt +χχ and bb+χχ cross section to simplified model expectations. The limits are obtained from combined fits to the tt + p miss T and bb + p miss T signal and background control regions for the hypothesis of a scalar mediator (upper) and a pseudoscalar mediator (lower). A fermionic DM particle with a mass of 1 GeV is assumed in both panels. Mediator couplings correspond to g q = g χ = 1

Summary
A search for an excess of events with large missing transverse momentum ( p miss T ) produced in association with a pair of heavy-flavor quarks has been performed with a sample of proton-proton interaction data at a center-of-mass energy of 13 TeV. The data correspond to an integrated luminosity of 2.2 fb −1 collected with the CMS detector at the CERN LHC. The analysis explores bb + p miss T and the dileptonic, +jets, and all-hadronic tt + p miss T final states. A resolved top quark tagger is used to categorize events in the all-hadronic channel. No significant deviation from the standard model background prediction is observed. Results are interpreted in terms of dark matter (DM) production, and constraints are placed on the parameter space of simplified models with scalar and pseudoscalar mediators. The DM search channels are considered both individually and, for the first time, in combination. The combined search excludes production cross sections larger than 1.5 or 1.8 times the values predicted for a 10 GeV scalar mediator or a 10 GeV pseudoscalar mediator, respectively, for couplings of g q = g χ = 1. The limits presented are the first achieved on simplified models of dark matter produced in association with heavy-flavor quark pairs. pado de Asturias; the Thalis and Aristeia programs cofinanced by EU-ESF and the Greek NSRF; the Rachadapisek Sompot Fund for Postdoctoral Fellowship, Chulalongkorn University and the Chulalongkorn Academic into Its 2nd Century Project Advancement Project (Thailand); and the Welch Foundation, contract C-1845.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecomm ons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.