1 Introduction

The Higgs boson (H) [1,2,3,4,5,6] of mass 125\(\,\text {Ge}\hspace{-.08em}\text {V}\) was discovered by the ATLAS and CMS Collaborations in 2012 [7,8,9]. Since then its properties, including its coupling to other standard model (SM) particles, have been extensively studied using proton-proton (\({\textrm{pp}}\)) collision data from the CERN LHC collected at \({\sqrt{s}=7}\), 8, and 13\(\,\text {Te}\hspace{-.08em}\text {V}\) with the ATLAS [10] and CMS [11] detectors. Properties of the Higgs boson can be exploited to probe for signs of behaviour beyond the SM (BSM). In the SM, the decay of the Higgs boson to an invisible final state (\({\textrm{H}} \rightarrow \text {inv}\)) is only possible via \({\textrm{H}} \rightarrow {\textrm{Z}} {\textrm{Z}} ^* \rightarrow 4{{\upnu }} \), with a branching fraction of 0.1% [12]. Several BSM theories predict a larger branching fraction to invisible final states, \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\)  [13,14,15,16], namely in Ref. [17] and references therein. For example, in a scenario where the Higgs boson connects the SM and dark matter (DM) sectors [18,19,20,21,22,23], \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) is enhanced as the Higgs boson can decay to a pair of DM particles of mass \(m_{\text {DM}}<m_{{\textrm{H}}}/2\).

Direct searches for \({\textrm{H}} \rightarrow \text {inv}\) have been performed by the ATLAS [24,25,26,27,28,29] and CMS [30,31,32,33,34,35,36] Collaborations using data collected during Run 1 (2011–2012) and Run 2 (2015–2018). These target channels in which the Higgs boson is produced via vector boson fusion (VBF), gluon-gluon fusion (ggH), and in association with either a vector boson (VH, where V stands for either a W or Z boson) or with a t \(\overline{{{{\textrm{t}}}}}\) quark pair (\({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \)). The current most stringent constraint on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) set by the CMS experiment is via the VBF channel using Run 1 and Run 2 data, which reports a 95% confidence level (\(\text {CL}\)) upper limit of 0.18 (0.10 expected) [36].

In this paper, a search for an invisibly decaying Higgs boson, produced in association with a t \(\overline{{{{\textrm{t}}}}}\) quark pair or a V boson, where the associated particles decay to a fully hadronic final state, is reported. Representative leading order (LO) Feynman diagrams for \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH are presented in Fig. 1. The search in the VH channel looks only at topologies in which the presence of the V boson is inferred from well separated decay products, complementing the previous VH search with merged decay products arising from boosted V bosons [35]. The search uses LHC \({\textrm{pp}}\) collision data collected during the years 2016–2018, corresponding to a total integrated luminosity of 138\(\,\text {fb}^{-1}\) at \(\sqrt{s}=13\,\text {Te}\hspace{-.08em}\text {V} \). This is the first time that these final states have been used by the CMS experiment to search for the \({\textrm{H}} \rightarrow \text {inv}\) process using data from 2016–2018.

The missing transverse momentum, \({\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}\), is the transverse component of the negative vector sum of all reconstructed particle momenta in an event, and has a magnitude \(p_{\textrm{T}} ^\text {miss}\). There are two main sources of background events resulting in \({\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}\) measurement. The first is events with invisible Z boson decays and visible jets (\({\textrm{Z}} \rightarrow \text {inv}\)). The second is referred to as the lost lepton background, \(\ell _{\text {lost}}\), where \(\ell \) stands for either an e or \(\upmu \). This includes events from \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} + \text {jets}\) and \({\textrm{W}} + \text {jets}\) processes where one or more leptons are misreconstructed, excluded by the phase space selection, or fall outside the detector acceptance. Control regions (CRs) enriched in these background sources, requiring either one lepton, one photon, or two same-flavour opposite-sign leptons, are used to constrain these backgrounds from data. The hadronic recoil is defined as the vectorial sum of the \({\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}\) and the \(p_{\textrm{T}}\) of any selected charged lepton(s) or photon in an event, and its magnitude is used as the discriminating variable to separate the \({\textrm{H}} \rightarrow \text {inv}\) signal from backgrounds. The 95% \(\text {CL}\) upper limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) is extracted from a fit to the hadronic recoil distribution of selected events, performed across the signal regions (SRs) and CRs. In the SRs, the hadronic recoil is equivalent to the \(p_{\textrm{T}} ^\text {miss}\), while in the CRs it effectively measures the \(p_{\textrm{T}}\) of the V boson or photon. The exclusion of leptons and photons ensures good correspondence between SRs and CRs.

Fig. 1
figure 1

Representative LO Feynman diagrams for the SM Higgs boson production channels \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH

This paper is organised as follows: Section 2 is a brief description of the CMS detector. The simulated samples used in this analysis are summarised in Sect. 3. Section 4 describes the event reconstruction and object definitions used in this analysis, while the event selection and event categorisation are detailed in Sect. 5. The data CRs used for estimating the SM backgrounds are introduced in Sect. 6. Section 7 describes the statistical procedure used to constrain the backgrounds and extract the signal. The results of the search are presented in Sect. 8. The results of combining this search with other CMS searches for invisibly decaying Higgs bosons are described in Sect. 9, and the results are summarised in Sect. 10.

2 The CMS detector

The CMS apparatus is a multipurpose, nearly hermetic detector, designed to trigger on [37, 38] and identify electrons, muons, photons, and (charged and neutral) hadrons [39,40,41]. The central feature of the CMS apparatus is a superconducting solenoid, of \(6~\textrm{m}\) internal diameter. Within the field volume are the silicon pixel and strip tracker, the crystal electromagnetic calorimeter (ECAL), and the brass-scintillator hadron calorimeter (HCAL). Muons are measured in gas-ionisation chambers embedded in the steel flux-return yoke of the magnet. Besides the barrel and endcap detectors, CMS has extensive forward calorimetry, performed on high \(\eta \) objects in the HCAL forward calorimeter, which is located \(11.2~\textrm{m}\) from the interaction region along the beam axis. A global “particle-flow” (PF) algorithm [42] aims to reconstruct all individual particles in an event, combining information provided by all subdetectors. The reconstructed particles are used to build \(\uptau \) leptons, jets, and \(p_{\textrm{T}} ^\text {miss}\)  [43,44,45].

The first level of the CMS trigger system, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select the most interesting events, at a rate of roughly \(100~\textrm{kHz}\). The high-level trigger (HLT) processor farm performs event reconstruction similar to that of the full CMS reconstruction, but optimised for speed. This decreases the event rate from around \(100\text {\,kHz} \) to around \(1\text {\,kHz} \), before data storage [38].

The procedures for calculating the integrated luminosity recorded by the CMS detector for each data-taking year are documented in Refs. [46,47,48] for 2016–2018, respectively.

A more detailed description of the CMS experiment can be found in Ref. [49].

3 Simulated samples

Monte Carlo (MC) simulated events are used to model signal and background contributions in all analysis regions, except for quantum chromodynamics (QCD) multijet production processes, which are estimated from data using a dedicated control sample and simulation-based transfer factors. The method for estimating QCD multijet production is detailed in Sect. 6.2. In all cases, MC samples are produced using either powheg version 1.0 or higher [50] or MadGraph 5_amc@nlo version 2.4.2 or higher [51] matrix element (ME) generators. The ME is encoded with the maximum amount of information available for a hard scattering event. The parton-level simulation provided by the ME generators is interfaced with pythia version 8 [52] to model the shower and hadronisation of partons in the initial and final states, along with the underlying event description, using the tune CUETP8M1 (CP5) when simulating events for the 2016 (2017 and 2018) data-taking periods [53]. The propagation of all final state particles through the CMS detector is simulated using the Geant4  [54] toolkit. Samples for 2016 make use of the NNPDF3.0 LO or next-to-LO (NLO) parton distribution functions (PDFs) [55], whereas samples for the years 2017 and 2018 use the NNPDF3.1 next-to-NLO (NNLO) PDFs.

Processes featuring \({\textrm{H}} \rightarrow \text {inv}\) occurring in \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), VH, VBF, and ggH channels are modelled by powheg version 2.0 [56,57,58,59] at NLO in QCD. These samples require the SM Higgs boson to decay to four neutrinos (\({\textrm{H}} \rightarrow {\textrm{Z}} {\textrm{Z}} ^* \rightarrow 4{{\upnu }} \)) resulting in \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) \(=1\). The cross sections are appropriately normalised to the corresponding SM predictions computed at NLO (\({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \)), NNLO (VH, VBF), and next-to-NNLO (ggH) accuracy in QCD, and to NLO accuracy in electroweak (EW) corrections [60]. Background ZH processes with the Higgs boson decaying to \({\textrm{b}} \overline{{\textrm{b}}}\) and the associated Z boson decaying to \({\textrm{b}} \overline{{\textrm{b}}}\), \({\ell \ell }\), and \({\textrm{q}} {\overline{{{{\textrm{q}}}}}}\) (where q represents a light or charm quark) are generated at NLO using MadGraph 5_amc@nlo with the FxFx [61] matching scheme for 2016 samples, and with powheg version 2.0 [62] for 2017 and 2018 samples.

The \({\textrm{V}} + \text {jets}\) processes are generated at LO in QCD using MadGraph 5_amc@nlo with up to four partons in the final state using the MLM [63] matching scheme between hard scatters and parton showers. These processes are generated in bins of hadronic transverse energy, \(H_{\textrm{T}}\), which is the magnitude of the \({\vec p}_{\textrm{T}}\) sum of all jets reconstructed at generator level. The LO simulation of \({\textrm{V}} + \text {jets}\) processes is corrected to account for missing higher-order diagrams with K-factors derived from MadGraph 5_amc@nlo-generated NLO QCD \({\textrm{V}} + \text {jets}\) processes with up to two partons. These K-factors are extracted as a function of boson \(p_{\textrm{T}}\) and the \(p_{\textrm{T}}\) of the leading jet in the event. These K-factors are extracted as a function of boson \(p_{\textrm{T}}\) and the \(p_{\textrm{T}}\) of the leading jet in the event, and typically vary between 0.5 and 1.5 depending on the boson \(p_{\textrm{T}}\).

The \({\upgamma } + \text {jets}\) processes are generated at NLO in QCD with MadGraph 5_amc@nlo, using a binning based on the \(p_{\textrm{T}}\) of the photon. The binning scheme for this sample is defined at the ME level to increase the statistical precision in the phase space regions probed by this analysis.

Processes including t-channel single t quarks, and t \(\overline{{{{\textrm{t}}}}}\) pairs with up to two additional partons in ME computation are generated at NLO with powheg version 2.0 [64, 65]. Single t quarks produced in the s channel are modelled using MadGraph 5_amc@nlo, and also in the tW channel using powheg version 1.0 [66]. The t quark \(p_{\textrm{T}}\) spectrum in t \(\overline{{{{\textrm{t}}}}}\) processes is corrected to match the spectrum obtained from the NNLO QCD \(+\) NLO EW simulation, following Ref. [67]. Rare \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{X}}}\) \(+\) jets backgrounds cover processes where t \(\overline{{{{\textrm{t}}}}}\) is produced in association with a boson X (\(\upgamma \), V, or a visibly decaying H), generated at NLO. The \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\upgamma }}\) \(+\) jets, \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{W}}}\) \(+\) jets, and \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{Z}}}\) \(+\) jets samples are generated using MadGraph 5_amc@nlo, with subsequent decays generated using MadSpin [68] to account for spin correlations in the former two cases. The \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \)  \(+\) jets sample, where the H decays to visible states, is generated using powheg.

Diboson \({\textrm{ZZ}}\) and \({\textrm{WZ}}\) production processes are generated at LO using pythia, while the \({\textrm{WW}}\) process is simulated at NLO in QCD using the powheg version 2.0 [69]. The QCD multijet samples are generated at LO using MadGraph 5_amc@nlo in exclusive ranges of \(H_{\textrm{T}}\) in order to increase the statistical precision in the phase-space probed by this analysis.

4 Event reconstruction

During LHC runs, each beam crossing results in several \({\textrm{pp}}\) collisions in the detector. Additional \({\textrm{pp}}\) interactions within the same or nearby bunch crossing, known as pileup, make PF object reconstruction more challenging. The reduction of the effect of pileup relies on mitigation techniques [70] that filter energy deposits associated with pileup vertices and remove objects not associated with the primary interaction vertex (PV). The PV is the vertex associated with the hardest scattering in the event, according to tracking information, as described in Ref. [71]. All simulated samples from Section 3 are reweighted to match the pileup distribution observed in data. In the SR, the final state is required to contain jets, a sizeable hadronic recoil, and no isolated leptons or photons. Candidate leptons and photons are selected with \({p_{\textrm{T}} >10\,\text {Ge}\hspace{-.08em}\text {V}}\) and pseudorapidity \({|\eta | < 2.4}\) for muons [40], \({p_{\textrm{T}} >10\,\text {Ge}\hspace{-.08em}\text {V}}\) and either \({|\eta | < 1.44}\) or \({1.57< |\eta | < 2.5}\) for electrons [39], \({p_{\textrm{T}} >20\,\text {Ge}\hspace{-.08em}\text {V}}\) and \({|\eta | < 2.3}\) for hadronically decaying tau leptons [43], and \({p_{\textrm{T}} >15\,\text {Ge}\hspace{-.08em}\text {V}}\) and either \({|\eta | < 1.44}\) or \({1.57< |\eta | < 2.5}\) for photons [39]. These selection criteria are optimised to reject background contributions, mainly from QCD processes. Other selection criteria depend on the isolation of the lepton or photon from hadronic interactions in the detector within a cone of small (tight isolation) or large (loose isolation) radius. Loose identification and isolation criteria are used to veto candidate events in the SR that contain leptons or photons. The veto efficiencies are > 99, \(\simeq \) 95, and \(\simeq \) 90% for loose muons, electrons, and photons, respectively. The SR background contributions are estimated using \({\upmu } + \text {jets}\), \({\textrm{e}} + \text {jets}\), \({\upmu \upmu }+\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs. Tight and loose identification and isolation criteria are used to select and count muons, electrons, and photons in the CRs, enhancing the purity at little expense to the efficiency. These achieve typical selection efficiencies of \(\simeq \) 95, 70, and 70 (\(\simeq \) 98, 95 and 90)%, for tight (loose) muons, electrons, and photons, respectively.

Jets are reconstructed by clustering all PF candidates originating from the PV with the anti-\(k_{\textrm{T}}\) jet clustering algorithm [72, 73], using a distance parameter \({R=0.4}\) (AK4). Jet momentum is determined as the vectorial sum of all particle momenta in the jet, and is found from simulation to be, on average, within 5–10% of the true momentum over the whole \(p_{\textrm{T}}\) spectrum and detector acceptance. Charged-hadron subtraction [44] is then applied to remove charged particles from pileup vertices [74]. To ensure the measured jet energy matches that of the particle level jets, jet energy corrections (JEC) derived from simulation as functions of \(p_{\textrm{T}}\) and \(\eta \) are applied. Further corrections are applied due to residual discrepancies in the jet energy scale (JES) between data and simulated samples [44]. Additionally, each jet must pass selection criteria to remove jets adversely affected by instrumentation or reconstruction failure. The jet energy resolution (JER) in simulated samples is smeared to match the measured resolution, which is typically 15–20% at 30\(\,\text {Ge}\hspace{-.08em}\text {V}\), 10% at 100\(\,\text {Ge}\hspace{-.08em}\text {V}\), and 5% at 1\(\,\text {Te}\hspace{-.08em}\text {V}\)  [44]. The AK4 jets are required to have \({p_{\textrm{T}} >30\,\text {Ge}\hspace{-.08em}\text {V}}\) and \({|\eta |<5.0}\), and those with loose leptons and photons located within a cone of \({\Delta R<0.4}\) of the jet direction are removed.

The AK4 jets that originate from the hadronisation of a bottom quark (b-tagged jets) are identified using the DeepCSV algorithm, which correctly identifies b jets with \({p_{\textrm{T}} >20\,\text {Ge}\hspace{-.08em}\text {V}}\) with a probability of 80% and has a charm or light jet mistag probability of 10% [75]. Simulated events containing b jets are corrected to be in agreement with the data by deriving corrections from data control samples that contain b jets.

Pileup effects are mitigated at the reconstructed particle level using the pileup per particle identification algorithm (PUPPI) [76, 77] by defining a local shape variable that can discriminate between particles originating from the PV and from pileup. Charged particles originating from pileup are discarded. For neutral particles, a local shape variable is computed based on the information from charged particles in their vicinity that originate from the PV within the tracker acceptance, and information from both charged and neutral particles outside this acceptance. The momenta of neutral particles are then rescaled based on the probability that they originated from the PV as deduced from the local shape variable [76].

When a high \(p_{\textrm{T}}\) t quark or V boson decays hadronically, a large set of collimated particles cross the detector. These can be clustered within a single jet of radius \(R=0.8\) (AK8) using the anti-\(k_{\textrm{T}}\) algorithm. In order to reduce pileup effects, PUPPI PF candidates are used to seed the AK8 jet finder. The main feature that distinguishes hadronically decaying t quarks or V bosons from the quark or gluon fragmentation is the jet mass. To improve the resolution, the modified mass-drop tagger algorithm [78,79,80] (also known as the soft-drop algorithm, SD) with the angular exponent \({\beta =0}\), soft cutoff threshold \({z_{\text {cut}}<0.1}\), and characteristic radius \({R_{0}=0.8}\) [81] is applied to each AK8 jet to remove soft and wide-angle radiation. In addition, a deep neural network (DNN) classifier called the DeepAK8 [82] algorithm is employed by assigning a set of numerical scores to each reconstructed AK8 jet corresponding to the probabilities that it originates from particular final states of V boson decays, for example \({{\textrm{Z}} \rightarrow \textrm{b}\bar{\textrm{b}}}\), \({{\textrm{Z}} \rightarrow {\textrm{q}} {\overline{{{{\textrm{q}}}}}}}\), \({{\textrm{W}} \rightarrow {\textrm{cs}}}\), rather than from QCD multijet processes. For this analysis, reconstructed AK8 jets originating from t quarks (W bosons) are selected by requiring \({p_{\textrm{T}} >400~(200)\,\text {Ge}\hspace{-.08em}\text {V}}\), SD mass \(m_{\text {SD}}\) between 120 and 210 (65 and 120)\(\,\text {Ge}\hspace{-.08em}\text {V}\), and a DeepAK8 probability score for t quarks (W bosons) larger than between 72.5 and 83.4 (91.8 and 92.5)% depending on the year of data-taking. The resulting t quark (W boson) tagging efficiency at the \({p_{\textrm{T}} >400~(200)\,\text {Ge}\hspace{-.08em}\text {V}}\) threshold limit is estimated from simulation as 28 (25)% with a 1% mistag rate from QCD jets. Simulated events containing AK8 jets are corrected to agree with the data using data-derived correction factors, and dedicated JEC are also applied [82].

The calculation of energy sums such as the hadronic recoil, \({\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}\), and \(\vec {H}_{\text {T}}^{\text {miss}}\), which is the negative \({\vec p}_{\textrm{T}}\) sum of jets reconstructed at the HLT level with a \(p_{\textrm{T}} ^\text {miss}\) threshold of 20\(\,\text {Ge}\hspace{-.08em}\text {V}\) applied, are based on AK4 jets, therefore JEC are propagated through the use of the \({\vec p}_{\textrm{T}}\)-corrected jets.

5 Event selection and categorisation

In this analysis the signal is extracted from a combined fit to the hadronic recoil distribution of events in SRs and CRs as defined for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories. The CRs are used to estimate the contributions of different SM processes in each SR. Where possible, the CRs have kinematic requirements identical to the SR, and leptons or a photon are used in the CR definition, but otherwise ignored in the calculation of event observables. The \({\textrm{e}} + \text {jets}\) and \({\upmu } + \text {jets}\) CRs, enriched in \({\textrm{W}} + \text {jets}\) and t quark background processes, are used to derive corrections to \(\ell _{\text {lost}}\) contributions predicted by simulation. The ee \(+\) jets, \({\upmu \upmu }\,+\) jets, and, in the case of the VH category, \({\upgamma } + \text {jets}\) CR samples are used to derive corrections to the expected contribution from \({\textrm{Z}} + \text {jets}\) production, where the Z boson decays to a pair of neutrinos. A QCD multijet enriched CR (hadronic sideband) is also used to estimate hadronic backgrounds in the SR.

5.1 Trigger requirements

Events of interest are collected via a suite of triggers that are applied to variables calculated using PF candidates reconstructed at the level of the HLT. The trigger requirements vary amongst analysis regions and data-taking periods. Events in the SR, hadronic sideband, and muon CRs are collected using HLT selection criteria on \(p_{\textrm{T}} ^\text {miss}\) and the missing \(H_{\textrm{T}}\), \(H_\textrm{T}^\text {miss}\), which is the magnitude of \(\vec {H}_{\text {T}}^{\text {miss}}\). Muons are not considered in the calculation of PF \(p_{\textrm{T}} ^\text {miss}\) and PF \(H_\textrm{T}^\text {miss}\) to allow the same trigger to be used in the SR and the muon CRs, with a typical efficiency of >90% for \(p_{\textrm{T}} ^\text {miss}\) \(>250\,\text {Ge}\hspace{-.08em}\text {V} \). The use of the combined \(p_{\textrm{T}} ^\text {miss}\) and \(H_\textrm{T}^\text {miss}\) triggers in the muon CRs instead of single-muon triggers corresponds more closely to the selection in the SR and minimises selection biases. Trigger thresholds increase with time due to the increase in instantaneous luminosity during Run 2. In 2016, the \(p_{\textrm{T}} ^\text {miss}\) and \(H_\textrm{T}^\text {miss}\) thresholds vary between 90 and 120\(\,\text {Ge}\hspace{-.08em}\text {V}\). In 2017 and 2018, these thresholds are 120\(\,\text {Ge}\hspace{-.08em}\text {V}\). During data-taking in 2017, additional corrections were applied to account for the effect of ECAL endcap noise at high \(|\eta |\) on PF \(p_{\textrm{T}} ^\text {miss}\) measurements. Additionally, for 2016 and 2017 data-taking periods, there was an inefficiency arising from a gradual shift in the timing of the ECAL trigger inputs in the region \({|\eta |>2.0}\) [37]. This resulted in events containing an electron or photon (jet) with \({p_{\textrm{T}} >50~(100)\,\text {Ge}\hspace{-.08em}\text {V}}\) having an efficiency loss of up to 20%, depending on \(p_{\textrm{T}}\) and \(\eta \). Correction factors for this trigger inefficiency are obtained from 2016 and 2017 data and applied to simulation samples as a function of \(\eta \).

Events in the \({\textrm{e}} + \text {jets}\) and ee \(+\) jets CRs from the 2016, 2017, and 2018 data sets are required to pass a tight (loose) single-electron trigger with \(p_{\textrm{T}}\) thresholds of 27, 35, and 32 (105, 115, and 115)\(\,\text {Ge}\hspace{-.08em}\text {V}\), respectively. The low-threshold single-electron triggers require the electron candidate to pass a tight isolation condition, while the high-threshold trigger imposes a looser selection on the isolation to improve the efficiency at high \(p_{\textrm{T}}\). Photon events are required to pass a single-photon trigger with a \(p_{\textrm{T}}\) threshold of 175 (200)\(\,\text {Ge}\hspace{-.08em}\text {V}\) without any isolation condition for the 2016 (2017 and 2018) data sets. Simulated electron or photon events are accepted if they pass exactly one of the above trigger requirements, and the efficiency of this selection is corrected with data-derived efficiency correction factors.

5.2 Offline selection

In order to select events with a large amount of jet activity and sizeable hadronic recoil, a further offline selection is applied to all regions. To improve the purity of the signal, large missing energy is desirable, therefore events require the hadronic recoil, \(H_\textrm{T}^\text {miss}\), and \(H_{\textrm{T}}\) to be greater than 200\(\,\text {Ge}\hspace{-.08em}\text {V}\). Furthermore, the largest \(p_{\textrm{T}}\) of an AK4 jet in an event, \(\vec {p}^{~\text {j}}_\textrm{T,1}\), is required to be greater than 80\(\,\text {Ge}\hspace{-.08em}\text {V}\). To ensure consistency amongst different estimators of the hadronic recoil, the recoil as calculated from PF candidates in \(p_{\textrm{T}} ^\text {miss}\) and from PF jets in \(H_\textrm{T}^\text {miss}\) must satisfy \({H_\textrm{T}^\text {miss}/\text {recoil}<1.2}\) and azimuthal separation \({|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {H}_{\text {T}}^{\text {miss}}) |<0.5}\). To further improve the quality of events, a selection is made on \(p_{\textrm{T},\text {track}}^{\text {miss}}\), which is equivalent to \(p_{\textrm{T}} ^\text {miss}\) but calculated using only charged PF particles, and therefore is expected to be well-aligned with the hadronic recoil direction. Requirements of \({p_{\textrm{T},\text {track}}^{\text {miss}} >60\,\text {Ge}\hspace{-.08em}\text {V}}\) and azimuthal separation \({|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}_{\textrm{T},\text {track}}^{\text {miss}}) |<1}\) are applied in the SR and hadronic sideband. The kinematic selection for all regions is optimised according to the Asimov significance between signal (S) and background (B) yields assuming a background systematic uncertainty \(\Delta \text {B}\) of 5% or 10% [83]. The peaks of the distribution for a given variable corresponds to its selection threshold.

In order to facilitate the combination of this analysis with the results from other \({\textrm{H}} \rightarrow \text {inv}\) searches, additional selections are introduced to reduce the potential event overlap. A veto is implemented to ensure orthogonality with the VBF phase space, through a veto on events with leading (subleading) AK4 jets with \({|\eta _{1} |~(|\eta _{2} |)>2.4}\), and an inversion of the kinematic selection employed by the VBF \({\textrm{H}} \rightarrow \text {inv}\) analysis [36]. This removes events containing two AK4 jets with \(\vec {p}^{~\text {j}}_\textrm{T,1} >80\,\text {Ge}\hspace{-.08em}\text {V} \) and the subleading jet \(p_{\textrm{T}}\), \(\vec {p}^{~\text {j}}_\textrm{T,2}\), to be greater than 40\(\,\text {Ge}\hspace{-.08em}\text {V}\), where the jets are from opposite detector hemispheres (\({\eta _{1}\eta _{2} < 0}\)), have a large \(m_{\text {jj}}\) (>200\(\,\text {Ge}\hspace{-.08em}\text {V}\)), small azimuthal separation (\({\Delta \phi _{\text {jj}} < 1.5}\)), and a large \(\eta \) gap (\({|\eta _{m_{\text {jj}}} | > 1.0}\)). Moreover, orthogonality to leptonic \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) decays is ensured in the single-lepton CRs by requiring the transverse mass of the combined single-lepton and hadronic recoil system, defined as

$$\begin{aligned} m_\textrm{T}^{\ell } = \sqrt{2 p^{\ell }_{\textrm{T}} (\text {recoil}) [1 - \cos {(\phi (\vec {p}^{\ell }_{\textrm{T}}) - \phi (\overrightarrow{\text {recoil}}))}]}, \end{aligned}$$
(1)

to be lower than 110\(\,\text {Ge}\hspace{-.08em}\text {V}\). Orthogonality between leptonic \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) decays in the dilepton CRs is ensured by requiring the invariant mass of the charged lepton pair, \(m_{\ell \ell }\), to be lower than 120\(\,\text {Ge}\hspace{-.08em}\text {V}\) in these CRs. Selecting on the invariant masses of lepton pairs also suppresses the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) signal contamination in the CRs. Overlap between the ggH/boosted VH \({\textrm{H}} \rightarrow \text {inv}\) analysis and the resolved VH category of this analysis is rendered negligible by explicitly removing events from the low-purity boosted VH category defined in Ref. [35] if they contain exactly two AK4 jets with an invariant mass, \(m_{\text {jj}}\), forming a dijet candidate with \(65< m_{\text {jj}} < 120\,\text {Ge}\hspace{-.08em}\text {V} \). No corresponding selection is necessary for the resolved VH category as a result, while there is negligible change to the sensitivity of the boosted VH category.

During significant periods of data-taking in 2018, the HCAL portion corresponding to the region \(-1.57<\phi <-0.87\), \(-3.0<\eta <-1.39\) was not functional. Events from 2018 with \(-1.8<\phi (\overrightarrow{\text {recoil}})<-0.6\) are vetoed if they contain jets within the affected region, which removes \(\approx \)65% of the total data from the affected region. To ensure good correspondence between data and simulation, the simulation is reweighted to account for the efficiency loss. A summary of the offline requirements are provided in Table 1.

Table 1 Offline selection applied to all categories and regions in this analysis to improve signal purity and reduce overlap with the phase space of other \({\textrm{H}} \rightarrow \text {inv}\) searches

5.3 Signal regions

The search focuses on three types of hadronic final states: those with boosted t quarks and/or boosted W bosons reconstructed with dedicated merged jet algorithms; those with one or more b jets and no boosted t quark or W boson, targetting the bulk of hadronic \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) events; and those with two resolved jets with the \(m_{\text {jj}}\) compatible with that of a W or Z boson. The latter complements the boosted VH channel analysed in Ref. [35].

Events are categorised into boosted and resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), and resolved VH topologies. The \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category requires that at least five AK4 jets and one b jet are present. The boosted \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) topology requires that at least one AK8 jet is reconstructed and either t- or W-tagged, and is subcategorised by the AK8 jet and b jet multiplicities. Events without such t- or W-tagged jets are categorised as belonging to a resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) topology, with further selections on the leading AK4 jet (leading or subleading b jet) \({\vec p}_{\textrm{T}}\) and the hadronic recoil, \(|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}^{~\text {j}}_\textrm{T,1}) |\) (\(|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}^{~\text {b}}_\textrm{T,1}) |\) or \(|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}^{~\text {b}}_\textrm{T,2}) |\)) applied to discriminate between \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} + \text {jets}\) processes. Finally, the remaining events are allocated to the resolved VH topology category if they have exactly two AK4 jets with \(m_{\text {jj}}\) between 65 and 120\(\,\text {Ge}\hspace{-.08em}\text {V}\), compatible with a W or Z boson decay. The resolved VH subcategories are separated according to the b jet multiplicity. Subcategories are also defined based on \(\vec {p}^{~\text {j}}_\textrm{T,2}\) to suppress QCD multijet background. The subcategory definitions are summarised in Table 2. The intended outcome of this categorisation is a set of event samples with high purity for a given production mode, and minimal background contamination or signal cross-contamination.

Table 2 Categorisation of the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH production modes in the analysis. No additional selections are applied to the boosted \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) subcategories

A requirement on \(|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} |\), defined as the minimum azimuthal separation between the hadronic recoil and the momentum direction of any of the four highest \(p_{\textrm{T}}\) jets, of \({>}0.5\) is applied to suppress QCD multijet events where the hadronic recoil is aligned with a jet. A parameter \(\tilde{\omega }_{\text {min}}\) is designed to suppress events where missing energy is the result of a jet \(p_{\textrm{T}}\) mismeasurement, and is especially effective in categories with no b jets. For the ith jet in the event, \(\omega _{i}\) is defined as \(\arctan {(H_{\textrm{T}, \text {min}}^{\text {miss}} / p_{\textrm{T},i})}\), where \(p_{\textrm{T},i}\) is the \(p_{\textrm{T}}\) of jet i, and \(H_{\textrm{T}, \text {min}}^{\text {miss}}\) is the minimum value of \(H_\textrm{T}^\text {miss}\) that can be obtained by changing the value of \(p_{\textrm{T},i}\). The value of \(\omega _{i}\) minimised over i is \(\tilde{\omega }_{\text {min}}\). A detailed derivation of this variable is given in Ref. [84]. QCD multijet events in the SR are further suppressed by requiring \(\tilde{\omega }_{\text {min}} > 0.3\). Requirements to suppress QCD events are applied in the SR only for \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) categories, and to both SR and CRs in the VH categories in order to ensure good correspondence amongst the regions. The selections applied to \(\tilde{\omega }_{\text {min}}\) and \(|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} |\) are not applied in the CRs used for background estimation of the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) categories, where the hadronic recoil does not stem from jet mismeasurement. This is to increase event counts in the CRs, particularly in the boosted \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) categories.

The hadronic recoil in \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) production is closely aligned with the direction of the Higgs boson typically. In t \(\overline{{{{\textrm{t}}}}}\) events, the \({\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}\) is usually parallel or antiparallel to the direction of the leading b jet, as the t quarks are produced back-to-back. Therefore, the angles between the direction of the hadronic recoil and the leading or subleading jet or b jet \({\vec p}_{\textrm{T}}\) directions provide additional features for t \(\overline{{{{\textrm{t}}}}}\) background suppression in the resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) categories. The angular variables \(|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}^{~\text {j}}_\textrm{T,1}) |\), \(|\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}^{~\text {b}}_\textrm{T,1}) |\), and \(|\Delta \phi (\overrightarrow{\text {recoil}},\vec {p}^{~\text {b}}_\textrm{T,2}) |\) are the most sensitive discriminators between \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and t \(\overline{{{{\textrm{t}}}}}\). The selection based on these angular variables has been optimised by maximising the combined expected sensitivity of the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) analysis and is summarised in Table 2.

6 Control regions and background estimation

The analysis makes use of the \({\upmu } + \text {jets}\) and \({\textrm{e}} + \text {jets}\) CRs to estimate \(\ell _{\text {lost}}\) background contributions, which are mainly from \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} + \text {jets}\), single t quark, and \({\textrm{W}} + \text {jets}\) events. The background contributions from \({\textrm{Z}} \rightarrow \text {inv}\), which include \({\textrm{ZZ}}\), \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{Z}}}\), and Drell-Yan (DY) contributions, are estimated from the \({\upmu \upmu }\,+\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs. Hadronic backgrounds in the SR such as QCD multijet contributions are estimated using a transfer factor method applied to a QCD enriched sideband CR.

6.1 Estimation of \(\ell _{\text {lost}}\) and \({\textrm{Z}} \rightarrow \text {inv}\) backgrounds

The \({\upmu } + \text {jets}\) (\({\textrm{e}} + \text {jets}\)) CR is defined by requiring exactly one tightly-isolated muon (electron) with \(p_{\textrm{T}} >20 (40)\,\text {Ge}\hspace{-.08em}\text {V} \). Both CRs require \({50< m_\textrm{T}^{\ell } < 110\,\text {Ge}\hspace{-.08em}\text {V}}\). The single-lepton CRs are used to constrain the \(\ell _{\text {lost}}\) background, which is the main source of background in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH  2j2b categories. In the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category, the \(\ell _{\text {lost}}\) contribution arises mainly from t \(\overline{{{{\textrm{t}}}}}\), single t quark, and \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{V}}}\) processes, while in the VH category it is from W  \(+\) jet events.

In the \({\upmu \upmu }\,+\) jets (ee \(+\) jets) CR, one tightly-isolated muon (electron) with \({p_{\textrm{T}} >20~(40)\,\text {Ge}\hspace{-.08em}\text {V}}\), and one loose muon (electron) with the opposite charge and \(p_{\textrm{T}} >10 (10)\,\text {Ge}\hspace{-.08em}\text {V} \) are required with invariant mass, \(m_{\upmu \upmu }\) (\(m_{\textrm{ee}}\)), compatible with a Z boson. For the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) (VH) category, the invariant mass is required to be between 75 and 105 (60 and 120)\(\,\text {Ge}\hspace{-.08em}\text {V}\). The processes \({\textrm{Z}} \rightarrow {{\upnu }} {{{{\upnu }}}} \) and \({{\textrm{Z}} \rightarrow \ell \ell }\) are kinematically nearly identical, largely due to lepton universality, hence the dilepton regions can be used to constrain the \({\textrm{Z}} \rightarrow \text {inv}\) background and minimise theoretical uncertainties. This is important for the \({\textrm{Z}} \rightarrow \text {inv}\) background, which dominates the VH category and contributes to the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category especially at high hadronic recoil. In the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category, events are selected for which \({\Delta \phi (\overrightarrow{\text {recoil}}, \vec {p}_{\textrm{T},\text {track}}^{\text {miss}})>\pi /2}\), which reduces the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} + \text {jets}\) background and favours DY production in the dilepton CRs.

The \({\upgamma } + \text {jets}\) CR is used for background estimation in the VH category only, and requires exactly one loose photon with \({p_{\textrm{T}} >230\,\text {Ge}\hspace{-.08em}\text {V}}\). This region is used to constrain the \({\textrm{Z}} \rightarrow \text {inv}\) background as the event kinematics and topologies are similar for \({\textrm{Z}} + \text {jets}\) and \({\upgamma } + \text {jets}\) events, improving the sensitivity to the VH signal primarily at high hadronic recoil compared to the dilepton CRs because of the larger number of events.

Photons can usually be discriminated from other sources of ECAL deposits using the properties of the deposits themselves, such as isolation in ECAL and HCAL, or the shape of the electromagnetic showers. However, occasionally other particles will be incorrectly identified as photons, for example where a jet is misidentified as a photon in QCD multijet events. In order to estimate the contribution from misidentified photons in the \({\upgamma } + \text {jets}\) CR, a purity measurement is performed. The purity is defined as the fraction of reconstructed photon candidates that correspond to genuine isolated photons originating from the PV in the event. The photon purity is measured in data based on the lateral width \(\sigma _{\eta \eta } \) [85], which parametrises the shape of the energy deposit associated with the photon in the ECAL. The characteristic \(\sigma _{\eta \eta } \) distribution from genuine photons peaks at \(\sigma _{\eta \eta } <1\), while the distribution due to misidentified photons possesses a less pronounced peak with a much broader decline for \(\sigma _{\eta \eta } >1\). A template fit to the \(\sigma _{\eta \eta } \) distribution is performed, where for genuine photons simulated \({\upgamma } + \text {jets}\) events are used to build the signal templates, while for misidentified photons a data sample enriched in misidentified photon events is obtained by inverting the isolation requirements in the \({\upgamma } + \text {jets}\) CR. The purity is defined as the fraction of genuine photons extracted from the fit that pass the \(\sigma _{\eta \eta } \) selection. The photon purity is measured separately in bins of \(p_{\textrm{T}}^{{\upgamma }}\) and for each data-taking period and varies between 1.5 and 4.5%. The contamination is the fraction of misidentified photons in the \({\upgamma } + \text {jets}\) CR, and is estimated at around 4% for \({p_{\textrm{T}}^{{\upgamma }}>200\,\text {Ge}\hspace{-.08em}\text {V}}\). The QCD multijet contribution in the \({\upgamma } + \text {jets}\) CR is then estimated by weighting events in data for each \(p_{\textrm{T}}^{{\upgamma }}\) bin by the corresponding contamination. A 25% systematic uncertainty is attributed to the QCD multijet background normalisation, and is estimated by performing the procedure for different \(\sigma _{\eta \eta } \) binning in the template fit, which accounts for any mismodelling of the simulated \(\sigma _{\eta \eta } \) distribution. The statistical uncertainty in the photon purity estimate in each \(p_{\textrm{T}}^{{\upgamma }}\) bin is found to be much smaller than the systematic one. The full requirements for the analysis CRs are shown in Table 3.

Table 3 Summary of all CR requirements, excluding selections suppressing the QCD multijet background, and excluding the requirement of \({\Delta \phi (\overrightarrow{\text {recoil}},\vec {p}_{\textrm{T},\text {track}}^{\text {miss}})>\pi /2}\) applied to the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category in the dilepton CRs. No mass requirements are imposed in the \({\upgamma } + \text {jets}\)

6.2 Residual backgrounds from QCD multijet production

The event selection aims to reduce background contributions from QCD multijet production as much as possible by requiring \({|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} | >0.5}\) and \({\tilde{\omega }_{\text {min}} >0.3}\), although a QCD multijet background enriched sideband is used to estimate any remaining background contribution with the help of a transfer factor between sideband and SR, which is derived from simulation. The sideband is defined with an identical selection to that of the SR, but with an inversion on the requirements on \(|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} |\) and \(\tilde{\omega }_{\text {min}}\), such that \(|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} | <0.5\) and more stringently \(\tilde{\omega }_{\text {min}} < 0.2\). The criteria for \(\tilde{\omega }_{\text {min}}\) is determined by optimising the sideband to be as QCD multijet-enriched as possible while ensuring the SR has negligible QCD multijet background. For the VH category, the \(m_{\text {jj}}\) requirement is also inverted in order to have the sideband sufficiently populated.

The SRs in both the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories suffer from limited simulated QCD multijet event counts, so it is not possible to reliably define a transfer factor for each SR bin in individual subcategories. Within the statistical precision of the simulated QCD multijet samples, the shape of the hadronic recoil and relative population of the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) subcategories are observed not to depend on \(\tilde{\omega }_{\text {min}}\) and \(|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} |\). Therefore, the expected QCD sideband yields are integrated over all \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) subcategories and hadronic recoil intervals, and over hadronic recoil intervals for each VH category, in the sideband and SR, to construct the transfer factors. The resulting hadronic recoil distributions are used to predict the relative QCD multijet background in each subcategory and hadronic recoil interval.

The estimated QCD multijet background yield in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) SR for subcategory i and hadronic recoil interval j, \(N^{\text {QCD, SR}_{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }}_{i,j}\), is given by

$$\begin{aligned} N^{\text {QCD, SR}_{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }}_{i,j}= & {} \sum _p \sum _q \Bigg (N^{\text {data, CR}_{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }}_{p,q}- N^{\text {EW, CR}_{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }}_{p,q}\Bigg ) \text {TF}^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{\text {QCD}} f^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{c_{i}} f^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{m_{j}}, \quad \end{aligned}$$
(2)

where EW refers to processes that are not QCD multijet, summation indices p and q are the subcategory and hadronic recoil bins, respectively, \(\text {TF}^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{\text {QCD}}\) is the QCD multijet simulation transfer factor defined as the ratio between the expected QCD multijet background contribution in the SR and the sideband, and \(f^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{c_{i}}\) and \(f^{{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} }_{m_{j}}\) are the fractions of simulated QCD multijet events in each subcategory and hadronic recoil bin, respectively.

In the VH category, the sideband regions are defined for each subcategory, as the number of simulated QCD multijet events is sufficient to derive the hadronic recoil fractions \(f_{m_{j}}\) separately for each subcategory. The method is otherwise analogous to that of \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), given by Eq. 2.

The results of the QCD prediction aggregated over data sets from the 2016–2018 period are found to be small in comparison to background contributions from \(\ell _{\text {lost}}\) and \({\textrm{Z}} \rightarrow \text {inv}\) processes. In addition to the statistical uncertainties, a 100% systematic uncertainty is assigned to the predicted background yields from QCD multijet production. The actual uncertainty in the QCD prediction is measured at around 50%, derived by calculating the QCD contribution in the entire \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category for a signal-depleted validation region analogous to the SR but requiring \(0.2<\tilde{\omega }_{\text {min}} <0.3\) and \({|\Delta \phi _\text {min}({{\vec p}_{\textrm{T}}^{\hspace{1.0pt}\text {miss}}, \vec {p}_\textrm{T,1234})} |}>0.5\), and comparing the estimate to data. It is inflated to 100% to be more conservative when handling the individual \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) subcategories that are limited by event counts at larger hadronic recoil, which was found to have negligible impact on the final fit.

7 Statistical interpretation

A maximum likelihood fit method is used to obtain an upper limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\). The fit is performed simultaneously across each year, region, category, and hadronic recoil interval, with systematic uncertainties acting as nuisance parameters in the fit correlated to varying degrees across year and category.

7.1 Likelihood model

The limits on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) are extracted via a simultaneous binned maximum likelihood fit to the hadronic recoil distributions obtained in the SR and CRs. The likelihood can be written as

$$\begin{aligned} \mathcal {L}=\mathcal {L}_{\text {SR}}~\mathcal {L}_{{\upmu }}~\mathcal {L}_{{\textrm{e}}} ~\mathcal {L}_{{\upmu } {\upmu }}~\mathcal {L}_{\textrm{ee}}~\mathcal {L}_{{\upgamma }}, \end{aligned}$$
(3)

where \(\mathcal {L}_{\text {SR}}\) is the likelihood function for the SR (boosted \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), VH), and \(\mathcal {L}_{{\upmu }}\), \(\mathcal {L}_{{\textrm{e}}}\), \(\mathcal {L}_{{\upmu } {\upmu }}\), \(\mathcal {L}_{{\textrm{ee}}}\), and \(\mathcal {L}_{{\upgamma }}\) designate the likelihood functions for the \({\upmu } + \text {jets}\), \({\textrm{e}} + \text {jets}\), \({\upmu \upmu }+\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs, respectively. The likelihood function for the SR is defined as

$$\begin{aligned} \mathcal {L}_\text {SR}=\prod _{\text {cat}=i}^{n_{\text {cat}}} \prod _{\text {recoil}=j(i)}^{n_{\xi (i)}}\text {Poisson}\left( n_\text {obs}^{i,j}\mid n^{i,j}_\text {pred}\right) , \end{aligned}$$
(4)

with

$$\begin{aligned} \begin{aligned} n^{i,j}_\text {pred}&= \hat{\mu } s^{i,j} \rho _{s}^{i,j}\\&\quad + b^{i,j}_{\ell _{\text {lost}}} I^{i,j} \rho _{\ell _{\text {lost}}}^{i,j}\\&\quad + b^{i,j}_{{\textrm{Z}} \rightarrow \text {inv}} L^{i,j} \rho _{{\textrm{Z}} \rightarrow \text {inv}}^{i,j}\\&\quad + b^{i,j}_\text {QCD} \rho ^{i,j}_\text {QCD}, \end{aligned} \end{aligned}$$
(5)

where the symbols are defined in Table 4. The signal strength, \(\hat{\mu }\), is interpreted as the maximum likelihood estimator for \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\), where the signal prediction assumes that \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})} =1\). The fit also includes additional free parameters \(I^{i,j}\) and \(L^{i,j}\), which depend on category i, hadronic recoil bin j, and the number of recoil bins in each category \(n_{\xi (i)}\). The first of these parameters, \(I^{i,j}\), simultaneously scales the normalisation of the \(\ell _{\text {lost}}\) background in the SR and the sum of the \({\textrm{W}} + \text {jets}\), t \(\overline{{{{\textrm{t}}}}}\)  \(+\) jets, and single t quark backgrounds, \(\text {X}^{i,j}_{{\textrm{t}},{\textrm{W}}}\), in the \({\upmu } + \text {jets}\) and \({\textrm{e}} + \text {jets}\) CRs. The second of these parameters, \(L^{i,j}\), simultaneously scales the normalisation of the \({\textrm{Z}} \rightarrow \text {inv}\) background in the SR (Z(\({{{\upnu }} {{{{\upnu }}}}}\)\(+\) jets and \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{Z}} ({{\upnu }} {{{{\upnu }}}})}\)) and the sum of the \(\upgamma \)  \(+\) jets, DY \(+\) jets, \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{Z}}}\) \(+\) jets, and multiboson backgrounds, \(\text {X}^{i,j}_{{\textrm{Z}}/{\upgamma }}\), in the \({\upmu \upmu } +\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs.

Table 4 Meaning of the symbols used in Eqs. 4 and 5 that define the likelihood function

The likelihood for the \({\upmu } + \text {jets}\) and \({\textrm{e}} + \text {jets}\) CRs is given by

(6)

and for the \({\upmu \upmu }\,+\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs is given by

$$\begin{aligned}{} & {} \mathcal {L}_{\upmu \upmu ,{\textrm{ee}},{\upgamma }}= \prod _{\text {cat}=i}^{n_\text {cat}}\prod _{\text {recoil}=j(i)}^{n_{\xi (i)}} \nonumber \\{} & {} \quad \times \text {Poisson}\left( n_\text {obs}^{i,j}\mid \text {X}^{i,j}_{{\textrm{Z}}/{\upgamma }}L^{i,j}\rho _{{\textrm{Z}}/{\upgamma }}^{i,j} + \text {X}^{i,j}_\text {other} \rho _\text {other}^{i,j}\right) , \end{aligned}$$
(7)

where X\(^{i,j}\) is the sum of background yields, and \(\rho ^{i,j}\) refers to the associated systematic uncertainty.

Because of the low event counts in the dilepton CRs, the subcategory yields are summed into the boosted and resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) categories. For the boosted \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category, the \({\upmu \upmu }+\) jets and ee \(+\) jets CR yields are summed together to form a single \(\ell \ell + \text {jets}\) CR. Furthermore, in the boosted and resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) subcategories, \(I^{i,j}\) are shared across subcategories, therefore i takes only two values corresponding to the boosted and resolved \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) classes.

7.2 Systematic uncertainties

The model on which the maximum likelihood fit is based is inclusive of experimental and theoretical uncertainties. These are modelled as nuisance parameters, which are typically constrained by a template fit where there is a dependence on the hadronic recoil distribution, but are otherwise constrained by a log-normal distribution for those that affect the overall normalisation of a given process.

Theoretical uncertainties related to the PDF parameters and missing higher order corrections in the QCD and EW perturbative expansions are estimated by following the procedure outlined in Ref. [60] for \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH processes, and in Ref. [86] for \({\textrm{V}} + \text {jets}\) and \({\upgamma } + \text {jets}\) processes. Systematic uncertainties related to the PDF, and the renormalisation and factorisation scales, are treated as independent nuisance parameters but are correlated across years in the fit.

A photon normalisation uncertainty of 40% is included in the \({\upgamma } + \text {jets}\) CR, to cover uncertainties in the translation between the \({\upgamma } + \text {jets}\) and \({{\textrm{Z}} \rightarrow \ell \ell }\) yields, and is only correlated between 2017 and 2018 samples given the \({\upgamma } + \text {jets}\) sample for 2016 is generated with a different tune.

Data-derived correction factors are applied to simulated events containing b, t, and W jets, and therefore the systematic uncertainties due to the limited precision in these corrections are propagated to the simulated samples. These are referred to as tagging uncertainties, and also account for the uncertainties in the tagging efficiencies and misidentification probabilities. The tagging methods and uncertainty propagation are consistent between years, and therefore are correlated across years in the fit.

The uncertainty in the combined PF \(p_{\textrm{T}} ^\text {miss}\) and \(H_\textrm{T}^\text {miss}\) trigger efficiency is computed using the \({\upmu } + \text {jets}\) and \({\upmu \upmu } +\) jets CRs. These are independent of the \(p_{\textrm{T}} ^\text {miss}\) and \(H_\textrm{T}^\text {miss}\) data sets in the SR, ensuring an unbiased measurement of the uncertainty. This uncertainty is measured at 2%, and is applied independently in each year of data-taking due to variations in the trigger performance. The same uncertainty is measured in the electron and photon trigger efficiency, and is similarly uncorrelated between years. An additional trigger inefficiency uncertainty due to the mistiming of ECAL trigger inputs detailed in Sect. 5.1 is applied to the data-taking years 2016 and 2017.

The uncertainty in the integrated luminosity varies between 1.2–2.5% depending on the data-taking year, with an overall uncertainty of 1.6% for the 2016–2018 period [46,47,48]. The uncertainty is applied with correlated and uncorrelated components across years.

The uncertainties considered in the analysis are presented in Table 5 with the pre-fit ranges corresponding to the maximum and minimum deviations of the event yields from their nominal values across each region, year of data-taking, category, recoil bin, and all SM background processes, when the respective systematic uncertainty is changed within ±1 standard deviation. Systematic uncertainties not specified above are typically assumed to be uncorrelated from year to year when performing the fit. Those for which the source of the systematic uncertainty is identical for each year are treated as correlated. All systematic uncertainties are correlated across regions.

The overall experimental uncertainty is found to be dominated by W tagging for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and b tagging for the VH categories in the SR. The lepton and photon candidate efficiencies for identification, isolation, and reconstruction, and uncertainties in the JER, JES, and trigger efficiencies also make significant contributions. The theoretical uncertainty is dominated by variations in the renormalisation scale, factorisation scale, and PDF for \({\textrm{V}} + \text {jets}\) processes, although these are particularly sensitive to the high exclusive jet multiplicity characterising the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories.

Table 5 The ranges corresponding to the maximum and minimum deviations of the event yields from their nominal values, provided where applicable across each region, year of data-taking, category, recoil bin, and all SM background processes, when the respective systematic uncertainty is changed within ±1 standard deviation

8 Results

The hadronic recoil distributions across all \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH subcategories are shown in Figs. 2, 3, 4 and 5. The predicted background yield from the fit to the CRs only is shown with the result of a fit including the data in the SR. The agreement between the data and simulation is presented below each distribution, with the uncertainty in the predicted background uncertainty (Bkg. unc.) accounting for both systematic and simulated statistical contributions. Figures 2 (3) shows the \({\upmu } + \text {jets}\) (\({\textrm{e}} + \text {jets}\)) CR yields for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories, respectively, aggregated over 2016–2018. In these CRs, \(\ell _{\text {lost}}\) background from t \(\overline{{{{\textrm{t}}}}}\), \({\textrm{W}} \rightarrow \ell {{\upnu }} \), and single t quark production dominates, with smaller contributions from multiboson and \({{{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{X}}}\) processes. The \({\upmu \upmu }+\) jets, ee \(+\) jets, \(\ell \ell + \text {jets}\) (only for \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \)), and \({\upgamma } + \text {jets}\) (only for VH) CR distributions used for the prediction of backgrounds stemming from \({\textrm{Z}} \rightarrow \text {inv}\) decays are shown in Fig. 4 for 2016–2018. In addition, the total SM background prediction in the SR, consisting of \(\ell _{\text {lost}}\), \({\textrm{Z}} \rightarrow \text {inv}\), and QCD backgrounds, is shown for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH category in Fig. 5. The SR distributions contain all the Higgs boson production modes in the fitted \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) signal, including the ggH and VBF contamination in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories, with the prevalence of the ggH process due to its high production cross section. The post-fit event yields for each subcategory and recoil bin in the SR are tabulated in Table 6. For these results, a fit assuming \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) \(=0\) such that only SM background contributions are considered (B-only) is performed simultaneously using only the CRs (CR only), which are independent of the SR, or across both SR and CRs (CR\(+\)SR). A fit across all regions, including signal and background contributions (S\(+\)B fit), is also performed, in which the signal contribution is weighted by the best-fit signal strength, \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\). In all cases, uncertainties are inclusive of statistical and systematic contributions.

Fig. 2
figure 2

Distributions of hadronic recoil in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) (upper plot) and VH (lower plot) categories for the \({\upmu } + \text {jets}\) CR. The black histogram shows the total background (bkg.) prediction from a CR only, B-only fit, while the red histogram shows the yields from a CR + SR S + B fit. The uncertainty in the predicted background (Bkg. unc.) accounts for both systematic and simulated statistical contributions

Fig. 3
figure 3

Distributions of hadronic recoil in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) (upper plot) and VH (lower plot) categories for the \({\textrm{e}} + \text {jets}\) CR. The black histogram shows the total background (bkg.) prediction from a CR only, B-only fit, while the red histogram shows the yields from a CR + SR S + B fit. The uncertainty in the predicted background (Bkg. unc.) accounts for both systematic and simulated statistical contributions

Fig. 4
figure 4

Distributions of hadronic recoil in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) category for the \({\upmu \upmu } +\) jets, ee \(+\) jets, and \(\ell \ell + \text {jets}\) CRs (upper plot), and the VH category for the \({\upmu \upmu }+\) jets, ee \(+\) jets, and \({\upgamma } + \text {jets}\) CRs (lower plot). The black histogram shows the total background (bkg.) prediction from a CR only, B-only fit, while the red histogram shows the yields from a CR + SR S + B fit. The uncertainty in the predicted background (Bkg. unc.) accounts for both systematic and simulated statistical contributions

Fig. 5
figure 5

Distributions of hadronic recoil in the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) (upper plot) and VH (lower plot) categories for the SR, showing the signal contributions from \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \), VH, ggH, and VBF weighted by \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})} =0.07\). The black histogram shows the total background (bkg.) prediction from a CR only, B-only fit, while the red histogram shows the yields from a CR + SR S + B fit. The uncertainty in the predicted background (Bkg. unc.) accounts for both systematic and simulated statistical contributions

Table 6 Total post-fit yields in the SRs in each recoil bin and analysis category obtained by summing the contributions from the individual data-taking periods. B-only fits are performed for either CR\(+\)SR or CR only cases. The extracted signal yields from an S\(+\)B fit are also reported, where the signal strength is weighted by \({{\mathcal {B}({\textrm{H}} \rightarrow \text {inv})} =0.07}\)
Fig. 6
figure 6

Left: observed and expected limits at 95% \(\text {CL}\) for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories using 2016–2018 data. Right: the profile likelihood scan corresponding to observed and expected (where \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})} =0\)) limits in the fit to the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories

The best-fit value for \(\hat{\mu }\) and corresponding 68 and 95% \(\text {CL}\) confidence intervals are extracted following the procedure outlined in Refs. [87, 88]. The computing of upper limits adheres to the CLs criterion [89, 90] under the asymptotic approximation [83]. The upper limits on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) as extracted from the likelihood fit presented in Sect. 7.1 are found to be 0.43 (0.52 expected) and 0.74 (0.53 expected) at 95% \(\text {CL}\) for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories, respectively, with a combined upper limit of 0.54 (0.39 expected). These results are shown in Fig. 6 together with the observed and expected profile likelihood distribution. The expected distribution assumes \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})} =0\). The results are compatible with the background expectation. The best-fit \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories is \(\hat{\mu }=0.07_{-0.10}^{+0.10}(\text {stat.})\) \(_{-0.17}^{+0.18}(\text {syst.})\) (\(0.00_{-0.10}^{+0.10}(\text {stat.})\) \(_{-0.16}^{+0.17}(\text {syst.})\) expected), where the pre-fit normalisation assumes that \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) \(=1\). The systematic uncertainty with the largest impact on the \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) measurement for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and VH categories using 2016–2018 data are those associated with the JES, while the statistical uncertainty contributes significantly to the overall uncertainty on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\). The breakdown of the impacts into uncertainty groups are presented in Table 7, together with the expectation assuming \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) \(=0\). The best-fit estimate for the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) (VH) category is \(\hat{\mu }=-0.16_{-0.26}^{+0.26}\) \((0.00_{-0.25}^{+0.26})\) (\(\hat{\mu }=0.28_{-0.27}^{+0.27}\) \((0.00_{-0.26}^{+0.27})\)).

9 Combined \({\textrm{H}} \rightarrow \text {inv}\) limits

A variety of production modes of the Higgs boson can be used for searches for \({\textrm{H}} \rightarrow \text {inv}\) decays. A combination of the results of this analysis, analyses covering the years 2016–2018, and earlier published CMS combination results using Run 1 (years 2011–2012) and 2015 data [30] at \(\sqrt{s}=7,~8,~\)and 13\(\,\text {Te}\hspace{-.08em}\text {V}\), detailed in Table 8, is performed by means of a combined likelihood fit in which systematic uncertainties are correlated across search regions where appropriate. Unless explicitly specified below, parameters of the individual likelihood functions are treated as independent.

Table 7 The observed and expected impacts on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) for different groups of uncertainties, where the expected results are produced with \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) \(=0\)
Table 8 Data sets and their respective integrated luminosities used for each production mode across Run 1 and Run 2. For some data-taking periods, no \({\textrm{H}} \rightarrow \text {inv}\) search have been performed for the given production mode, and are not included in the combination

For the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) analysis with fully leptonic final states, a reinterpretation of the supersymmetry searches in the semileptonic and dileptonic t \(\overline{{{{\textrm{t}}}}}\) decay channels in Refs. [31, 32] in the context of the t \(\overline{{{{\textrm{t}}}}}\)  \(+\) DM model studied in Ref. [33] has been performed. Another leptonic channel included in this combination is from the \({{\textrm{Z}} (\ell \ell ){\textrm{H}}}\) analysis [34] using 2016–2018 data.

Analyses with hadronic final states partially overlap in their phase space selection, and this must be accounted for in the statistical combination. Those affected by overlap are the VBF analysis [36], the analysis targetting hadronic ggH and boosted VH final states [35], and the resolved VH channel described in this paper.

To remove the overlap between the VBF analysis and ggH/boosted VH analysis, events are considered for rejection in the ggH/boosted VH analysis if they have at least two AK4 jets each with \(|\eta | < 4.7\). Specifically, an inversion of the VBF kinematic selection is applied similarly to the \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) and resolved VH analysis as described in Sect. 5.2. These requirements mirror the selection used to enhance the characteristic VBF phase space in Ref. [36], with negligible effect on the sensitivity of the ggH/boosted VH analysis to \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\).

The overlap between the ggH/boosted VH analysis and the VH 2j0b category of this analysis is driven by the low-purity VH category of the boosted analysis. By removing events from the low-purity boosted VH category that contain exactly two AK4 jets forming a dijet candidate with \({65<m_{\text {jj}} <120\,\text {Ge}\hspace{-.08em}\text {V}}\), there is negligible reduction in the exclusion sensitivity of that analysis. The overlap meanwhile is reduced from 30-40% in the CR phase spaces to about 1%.

The uncertainties in the overall cross section for the signal processes are treated as correlated amongst analysis channels, and amongst data sets with the same centre-of-mass energy. The uncertainties related to missing higher-order corrections, as well as PDF variations, are obtained from Ref. [60]. In some of the channels, additional uncertainty contributions relating to signal acceptance modelling are considered. These are treated as uncorrelated amongst the different analysis channels.

The main sources of theoretical modelling uncertainties in the background estimate vary for the different analysis channels. The analyses preferentially select different phase space regions, and employ different assumptions for the modelling of theoretical uncertainties in transfer factors amongst different analysis regions. The resulting uncertainties are therefore treated as uncorrelated.

Significant correlations appear in the treatment of experimental uncertainties. The determination of the integrated luminosity estimate is affected by a number of sources of uncertainty, which are assumed to be correlated amongst all channels, and partially correlated amongst data sets. Some of the analysis channels share trigger requirements, and the uncertainties in the efficiencies of these common triggers are assumed to be correlated amongst channels and uncorrelated amongst data sets. Furthermore, analysis channels often share criteria used for identifying b-tagged jets, as well as the hadronic decay products of tau leptons. The uncertainties in the efficiencies of these identification criteria are assumed to be correlated amongst channels using the same criteria in the same data set. Finally, uncertainties in the calibration of the JER and JES are treated as correlated amongst this analysis, the VBF, and the ggH/boosted VH channels. All other experimental uncertainties are assumed to be uncorrelated amongst channels. For earlier analyses using Run 1 and 2015 data, the correlation scheme established in Ref. [30] is used.

Exclusion limits on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) are calculated assuming SM production cross sections. The 2016–2018 data yields an overall limit of 0.16 (0.09 expected). If the Run 1 and 2015 data-taking periods are included, values larger than 0.15 (0.08 expected) are excluded at 95% \(\text {CL}\). This value is dominated by the VBF channel, which yields a limit for \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) of 0.18 (0.10 expected). The limits for Run 1 and Run 2, separated by the Higgs boson production mode as tagged by the input analyses, are presented in Fig. 7. The integrated luminosities of the Run 1 and Run 2 data sets [30, 33,34,35,36] are described in Table 8. The final combination represents an improvement in sensitivity of approximately 20% relative to the most sensitive single channel (VBF).

Maximum likelihood fits to the individual production channels are performed, as well as to the combination of all channels. The dependence of the profile negative log-likelihood functions on the signal strength parameter \(\hat{\mu }\) is shown in Fig. 7 (right). The best-fit values of \(\hat{\mu }\) for the individual production channels are compatible with one another and with the combined value of \({0.08^{+0.04}_{-0.04}}\), and the observed signal strength is compatible with the absence of a \({\textrm{H}} \rightarrow \text {inv}\) signal within two standard deviations. A breakdown of the best-fit values of \(\hat{\mu }\) for each channel are presented in Table 9. A saturated goodness-of-fit test is performed using the final combined likelihood function [93], yielding a probability of 12% that the S\(+\)B model is consistent with the observed results from the CMS experiment. Tabulated yields and fit results are provided in HEPData [94].

Table 9 The observed best-fit estimates of \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\), for each analysis channel in the combination, and the 95% \(\text {CL}\) observed and expected (exp) upper limits on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\)
Fig. 7
figure 7

Top: exclusion limits at 95% \(\text {CL}\) on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\). The results are shown separately for each Higgs boson production mode as tagged by the input analyses for Run 1 and Run 2, as well as combined across modes. Bottom: scan of the profile negative log-likelihood as a function of \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) broken down by the Higgs boson production mode as tagged by the input analyses for Run 1 and Run 2

The upper limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) is interpreted in the context of a set of Higgs portal models of DM interactions, where a stable weakly interacting massive particle (WIMP), such as a singlet scalar, fermion, or vector, has a substantial coupling to a Higgs boson of mass 125\(\,\text {Ge}\hspace{-.08em}\text {V}\)  [19, 20]. The interaction of a WIMP with an atomic nucleus can occur via the exchange of a Higgs boson, and the resulting nuclear recoil is measured to obtain an upper bound on the spin-independent DM-nucleon scattering cross section, \(\sigma ^{\text {SI}}_{\text {{DM{-}nucleon}}}\). An effective field theory (EFT) approach is considered for scalar and fermionic WIMPs, while in the vectorial case two UV-complete DM models are considered, given the EFT appraoch violates unitarity [23, 95]. The vector-spin WIMP model (Vector DM\(^{\text {UV-comp}}\)) described in Ref. [20], and its radiative portal analogue (Vector DM\(^{\text {radiative}}_{m_{2}}\)) introduced in Ref. [23] for dark Higgs boson masses \(m_2 = 65\) and \(100\,\text {Ge}\hspace{-.08em}\text {V} \), and with a mixing angle between the SM and dark Higgs bosons \(\theta = 0.2\), are presented. The results are compared to direct-detection searches, where in these experiments it is assumed DM particles interact with atomic nuclei. Direct-detection limits are reported by the XENON1T-Migdal [96], DarkSide-50 [97], Panda-X 4T [98], and LUX-ZEPLIN [99] experiments. Upper limits on \(\sigma ^{\text {SI}}_{\text {{DM{-}nucleon}}}\) for DM masses ranging from \(0.1\,\text {Ge}\hspace{-.08em}\text {V} \) to \(m_{{\textrm{H}}}/2\) are presented in Fig. 8 at the 90% \(\text {CL}\) using the full CMS data set. The uncertainties in \(\sigma ^{\text {SI}}_{\text {{DM{-}nucleon}}}\) are obtained from the extrema of a coupling parametrisation factor as derived from lattice theory [19, 100, 101]. Results of the Higgs portal interpretation and direct-detection comparison are also provided in HEPData [94].

Fig. 8
figure 8

Upper limits on \(\sigma ^{\text {SI}}_{\text {{DM{-}nucleon}}}\) as a function of DM candidate mass \(m_{\text {DM}}\). Results are presented for a fermion (red) and scalar (yellow) DM candidate. In addition, a vector DM candidate is studied using two UV-complete approaches, the first denoted Vector DM\(^{\text {UV-complete}}\) [20] (burgundy), and the second a radiative portal version denoted Vector DM\(^{\text {radiative}}_{m_{2}}\) [23] (orange) with a dark Higgs boson mass of \(m_2 = 65\) and \(100\,\text {Ge}\hspace{-.08em}\text {V} \). Uncertainties are derived from Refs. [19, 100, 101]. Results are compared to direct-detection searches from XENON1T-Migdal [96], DarkSide-50 [97], PandaX-4T [98], and LUX-ZEPLIN [99]

The sensitivity of the Run 1 and Run 2 combination depends on the cross sections assumed for the different Higgs boson production modes: VBF, VH, ggH, and \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \). Cross sections can be parameterised by the coupling strength of the Higgs boson to V bosons and fermions. The cross sections can be directly scaled by coupling strength modifiers \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\) to investigate BSM scenarios [102]. In this context, the observed 95% \(\text {CL}\) upper limits on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) are evaluated as a function of \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\) and shown in Fig. 9. Best estimates of \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\) from CMS [11] are presented with the 68 and 95% \(\text {CL}\) contours. For the best estimate of \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\) by CMS, the 95% \(\text {CL}\) limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) is found to be 0.15 and varies between 0.13 and 0.17 inside the 95% \(\text {CL}\) contour.

Fig. 9
figure 9

Observed 95% \(\text {CL}\) upper limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) as a function of coupling strength modifiers, \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\), for a Higgs boson of mass 125\(\,\text {Ge}\hspace{-.08em}\text {V}\). Best estimates for \(\kappa _{\text {V}}\) and \(\kappa _{\text {F}}\) from Ref. [11] are shown as a black cross, together with 68 and 95% \(\text {CL}\) contours

10 Summary

The results of a search for invisible decays of the Higgs boson produced in association with a top-antitop quark pair (\({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \)) or a vector boson (VH, where V stands for either a W or Z boson), which decays to a fully hadronic final state, are presented. The analysis is based on proton-proton collision data collected at \(\sqrt{s}=13\,\text {Te}\hspace{-.08em}\text {V} \) during the 2016–2018 data-taking period by the CMS experiment at the LHC, corresponding to an integrated luminosity of 138\(\,\text {fb}^{-1}\). The \({{\textrm{t}} {}{\overline{{{{\textrm{t}}}}}}} {\textrm{H}} \) production mechanism is investigated using final states containing b jets, or boosted t quarks or W bosons. The VH production channel focuses on resolving a dijet pair with an invariant mass that is compatible with that of a W or Z boson. No significant excess of events is observed above the predicted SM background. A 95% confidence level upper limit of 0.54 (0.39 expected) is set on the branching fraction of the decay of the Higgs boson to an invisible final state, \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\), assuming SM production cross sections.

The results are combined with previous \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) searches carried out at \(\sqrt{s}=7\), 8, and 13\(\,\text {Te}\hspace{-.08em}\text {V}\) in complementary production modes. The combined 95% confidence level upper limit on \({\mathcal {B}({\textrm{H}} \rightarrow \text {inv})}\) of 0.15 (0.08 expected) is obtained using Run 1 (2011–2012) and Run 2 (2015–2018) data. The combination represents an improvement in sensitivity of 20% relative to the most sensitive single channel. The results are interpreted in the context of a set of Higgs portal models of dark matter interactions for dark matter masses in the range \(0.1\,\text {Ge}\hspace{-.08em}\text {V} \) and \(m_{{\textrm{H}}}/2\). Model-dependent exclusion limits are found to complement direct-detection experiments for light mass dark matter candidates.