1 Introduction

Supersymmetric (SUSY) [1,2,3,4,5,6] extensions to the Standard Model (SM) have the potential to resolve the SM gauge hierarchy problem, explain the origin of dark matter [7, 8], and lead to a grand unified theory of nature [9,10,11,12]. Partner particles, called sparticles, differ from their SM counterparts by half a unit of spin. The scalar partners of quarks and leptons are squarks (\(\tilde{q}\)) and sleptons (\(\tilde{\ell }\)) respectively. Gluinos (\(\tilde{g}\)) are the fermionic partners of gluons. Charginos (\(\tilde{\chi }_i^\pm \)) and neutralinos (\(\tilde{\chi }_i^0\)) are the mass eigenstates of the mixing between the SUSY partners of the Higgs (higgsinos) and electroweak bosons, where i runs from the lightest to heaviest mass. If R-parity is conserved [13], then the lightest SUSY particle (LSP) is stable and a good candidate for dark matter.

This paper reports on searches for electroweak and strong production of sparticles in events with exactly two same-flavour (SF) opposite-sign (OS) electrons or muons, jets, and missing transverse momentum (\(\vec {p}_{\text {T}}^{\text {miss}}\), with magnitude \(E_{\text {T}}^{\text {miss}}\)). The full Run 2 dataset of 13  proton–proton collisions collected by the ATLAS detector [14] at the Large Hadron Collider (LHC) [15] is used, corresponding to an integrated luminosity of 139 fb\(^{-1}\). The same-flavour lepton final state is used in order to make use of the dilepton system’s invariant mass (\(m_{\ell \ell }\)) as a discriminant to search for events in models with leptonic Z boson decays or models where the \(m_{\ell \ell }\) distribution has a kinematic endpoint. Two searches targeting electroweak production of sparticles and one targeting strong production are considered. The first consists of two signal regions (SRs), using recursive-jigsaw reconstruction (RJR) variables [16] targeting electroweak production, which check whether previously observed excesses of 2.0\(\sigma \) and 1.4\(\sigma \) above the SM expectations in the 36 fb\(^{-1}\) 13  dataset collected during 2015–2016 [17] persist with more data. This is referred to as the RJR search, and only model-independent upper limits are presented for it. The same 36 fb\(^{-1}\) 13  analysis also included three-lepton regions which had 3.0\(\sigma \) and 2.1\(\sigma \) excesses above the Standard Model expectations, which were not observed with more data [18]. This previous search, and this update, used RJR variables designed to target electroweak production of SUSY particles. The second targets electroweak (EWK) production of chargino–neutralino pairs decaying to W and Z bosons along with two \(\tilde{\chi }_1^0\) neutralinos, and also includes a new search inspired by gauge-mediated SUSY breaking (GMSB) [19,20,21] targeting the pair production of higgsino next-to-lightest SUSY particles (NLSPs) decaying into a ZZ or Zh pair and gravitino LSPs. This is referred to as the EWK search, and it follows a methodology similar to that used in the two-lepton channel in a previous search [22] using the 36 fb\(^{-1}\) 13  dataset, but with optimizations for the full Run 2 dataset and a new region targeting off-shell Z boson decays. The third, the Strong search, targets the production of gluino or squark pairs that produce lepton pairs from \(\tilde{\chi }^0_2 \rightarrow \ell ^+\ell ^- \tilde{\chi }^0_1\) decays and follows a methodology similar to that in a previous search [23] based on 36 fb\(^{-1}\) of 13  data, also with optimizations for the full Run 2 dataset. These updated EWK and Strong searches benefit from a larger dataset and an optimization of the analysis, generally resulting in tighter selection requirements for signal-like events. The EWK search now includes additional binning of the SRs, further improving sensitivity to the considered signal models.

The EWK and Strong searches interpret the results with simplified models, described in the next section, by performing separate model-dependent profile likelihood fits [24] in their respective regions. All three searches also report upper limits on possible beyond-the-SM (BSM) event yields from model-independent fits to single-bin regions, where the BSM signal is assumed to only populate the SR.

The EWK search presented here extends the sensitivity to GMSB models with \(\tilde{\chi }_1^0\) masses in the 400–500  range, between the limits from ATLAS searches in a four-lepton final state [25] and an all-hadronic final state [26]. It also reaches higher \(\tilde{\chi }_1^0\) masses around chargino masses of 600  for the C1N2 model described in Table 1, between the limits from ATLAS searches in a three-lepton final state [18] and the same electroweak all-hadronic search. The Strong search has a gluino mass sensitivity similar to that of an ATLAS search in a single-lepton final state [27]. The zero-lepton ATLAS search [28] has sensitivity to gluinos a few hundred  heavier than in both of these searches, but the Strong search presented here has sensitivity to higher \(\tilde{\chi }_1^0\) masses. A strong all-hadronic search also has sensitivity to squarks a few hundred  heavier than in the Strong search. Unlike the EWK searches targeting the same model with different SM boson decays, the decay chains of the gluinos and squarks differ between the various strong searches. Thus each analysis complements each other by testing different assumptions of the SUSY particle spectra.

Similar searches have been performed by the CMS Collaboration with the full Run 2 dataset. For electroweak production, limits on chargino–neutralino production (C1N2 model in Table 1) were presented in Ref. [29] and on the GMSB model in Ref. [30], albeit with a different final state requiring same-charge leptons or more than two leptons. For strong production, limits on similar models but with different parameterizations were presented in Ref. [29].

This paper is organized with the common aspects of the searches preceding sections with the details specific to each search. Section 2 describes the SUSY signal models targeted in this paper. Section 3 describes the ATLAS detector. Section 4 describes the data and simulated samples used to guide the analysis strategy and estimate background and signal yields. Section 5 describes the event reconstruction and criteria used to identify the physics objects used in the searches. Section 6 describes the event selections that define the various search regions in each search. Section 7 describes the background estimation for each search. Section 8 describes the uncertainties. Sections 9 and 10 present the results and interpretations of the results, respectively. The conclusions are presented in Sect. 11.

2 SUSY signal models

Simplified models [31,32,33] inspired by SUSY are used to guide the search strategy and interpret the results. Two classes of models that contain production of weakly interacting or strongly interacting SUSY particles are used. Each model is scanned over a two-dimensional space, varying the masses or decay branching ratios of sparticles. Table 1 summarizes the simplified models considered for analysis. As mentioned in Sect. 1, electrons or muons are required to be in the final state, including those from leptonic decays of \(\tau \)-leptons. However, these will often fail SR requirements related to boson mass compatibility.

Table 1 Summary of the simplified signal model topologies used in this paper. Here x and y denote the xy plane across which the signal model masses, or branching fraction B, are varied to construct the signal grid. A dash (−) signifies the column is irrelevant for that model. For the gluino–slepton model, the masses of the superpartners of the left-handed leptons are given by \([m(\tilde{\chi }_2^0)+m(\tilde{\chi }_1^0)]/2\), while the superpartners of the right-handed leptons are decoupled. The gluino and squark decays have equal branching fractions for \(q=u,d,c,s,b\)
Fig. 1
figure 1

Example decay topologies for the (top row) two electroweak SUSY models and (bottom row) three strong SUSY models considered in the analysis. Model (a) shows the process \(\tilde{\chi }_{2}^{0} \tilde{\chi }_{1}^{\pm } \rightarrow W^{(*)}(jj)Z^{(*)}(\ell \ell )+E_{\text {T}}^{\text {miss}} \) and model (b) shows a gauge-mediated SUSY breaking model with a higgsino next-to-lightest SUSY particle and gravitino lightest SUSY particle. For the strong-production models, the left two decay topologies involve gluino pair production, with the gluinos following an effective three-body decay for \(\tilde{g} \rightarrow q \bar{q} \tilde{\chi }_2^0\), with \(\tilde{\chi }_{2}^{0} \rightarrow \tilde{\ell }^{\mp }\ell ^{\pm } / \tilde{\nu }\nu \) for the gluino–slepton model (c) and \(\tilde{\chi }_2^0\rightarrow Z^{(*)} \tilde{\chi }_1^0\) in the gluino–\(Z^{(*)}\) model (d). The diagram (e) illustrates the squark–\(Z^{(*)}\) model, where squarks are pair-produced, followed by the decay \(\tilde{q}\rightarrow q \tilde{\chi }_2^0\), with \(\tilde{\chi }_2^0\rightarrow Z^{(*)} \tilde{\chi }_1^0\)

Two models of electroweak sparticle production are considered. The first is the production of a chargino (\(\tilde{\chi }_1^\pm \)) and the second-lightest neutralino (\(\tilde{\chi }_2^0\)), henceforth the C1N2 model, which decay via a W boson and Z boson respectively into LSPs, \(\tilde{\chi }_1^0\). The \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\) are assumed to have equal masses and always decay into a W and Z boson respectively. A diagram of this model is shown in Fig. 1a. The second is the pair production of higgsino neutralinos (\(\tilde{\chi }_1^0\)), which decay into a Higgs or Z boson and a nearly massless gravitino \((\tilde{G})\). This model is inspired by GMSB [19,20,21] and referred to as the GMSB model. A diagram of this model is shown in Fig. 1b. The branching ratio of the \(\tilde{\chi }_1^0\) to a Higgs boson, alternatively a Z boson, is varied from 0 to 100%.

Three models of strong sparticle pair production are considered. The choice of mass parameterization, namely setting intermediate particles halfway between the parent and child, enhances the topological differences between these simplified models and others with fewer intermediate particles in the decay chains [34] or very small mass differences between particles. Changing this assumption matters when interpreting the particle spectrum of a new signal, but does not affect the sensitivity of the analysis to generic kinematic endpoint features. In all three models, the gluino (or squark) and \(\tilde{\chi }_1^0\) masses are varied to produce a two-dimensional grid of signal models for interpretation. The gluino and squark decays have equal branching fractions for \(q=u,d,c,s,b\). The first strong-production model, shown in Fig. 1c, is referred to as the gluino–slepton model, and assumes that the sleptons are lighter than the \(\tilde{\chi }_2^0\). The gluino decays via a \(\tilde{\chi }_2^0\), which subsequently decays via a slepton or sneutrino \((\tilde{\nu })\) to the LSP (\(\tilde{\chi }_1^0\)). The two decay channels, \(\tilde{\chi }_{2}^{0} \rightarrow \tilde{\ell }^{\mp }\ell ^{\pm }\) with \(\tilde{\ell } \rightarrow \ell \tilde{\chi }_{1}^{0}\) or \(\tilde{\chi }_{2}^{0} \rightarrow \tilde{\nu }\nu \) with \(\tilde{\nu } \rightarrow \nu \tilde{\chi }_{1}^{0}\), have equal probability. Only the superpartners of the left-handed leptons are allowed in the decays and they are taken to be mass degenerate, with a mass equal to the average of the \(\tilde{\chi }_2^0\) and \(\tilde{\chi }_1^0\) masses. The superpartners of the right-handed leptons are decoupled. In this model, a kinematic endpoint in the dilepton invariant mass forms near half of the difference between the gluino and \(\tilde{\chi }_1^0\) masses, given by \(m_{\ell \ell } = \sqrt{ (m^2_{\tilde{\chi }_2^0}-m^2_{\tilde{\ell }})(m^2_{\tilde{\ell }}-m^2_{\tilde{\chi }_1^0}) / m^2_{\tilde{\ell }}}\) [35]. This endpoint serves as an unambiguous signature of new physics that probes a wide variety of signal models, which can produce excesses over the entire \(m_{\ell \ell }\) spectrum.

The second model, shown in Fig. 1d, is referred to as the gluino–\(Z^{(*)}\) model, where the \(\tilde{\chi }_2^0\) from the gluino decay then decays as \(\tilde{\chi }_{2}^{0} \rightarrow Z^{(*)}\tilde{\chi }_{1}^{0}\). The \(\tilde{\chi }_2^0\) mass is set to the average of the gluino and \(\tilde{\chi }_1^0\) masses.

The final model, shown in Fig. 1e, is referred to as the squark–\(Z^{(*)}\) model, where the squark decays through a \(\tilde{\chi }_2^0\) to a Z boson and the LSP (\(\tilde{\chi }_1^0\)). Similarly to the gluino–\(Z^{(*)}\) model, the \(\tilde{\chi }_2^0\) mass is set to the average of the \(\tilde{q}\) and \(\tilde{\chi }_1^0\) masses. Both of these models with decays to a Z boson result in a kinematic endpoint in the dilepton invariant mass if the mass-splitting between the \(\tilde{\chi }_2^0\) and \(\tilde{\chi }_1^0\) is smaller than the Z boson mass. If the mass-splitting is larger than the Z boson mass, the models result in a peak in \(m_{\ell \ell }\) at the Z boson mass. The superpartners of \(q=u,d,c,s,b\) are all set to the same mass, with the superpartner of the t-quark decoupled.

3 ATLAS detector

The ATLAS detector [14] is a multipurpose particle detector with a forward–backward symmetric cylindrical geometry and a near \(4\pi \) coverage in solid angle.Footnote 1 It consists of an inner tracking detector surrounded by a thin superconducting solenoid providing a 2T axial magnetic field, electromagnetic and hadronic calorimeters, and a muon spectrometer. The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors. An additional layer of silicon pixels, the insertable B-layer [36, 37], was installed before Run 2. Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A steel/scintillator-tile hadron calorimeter covers the central pseudorapidity range (\(|\eta | < 1.7\)). The endcap and forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta | = 4.9\). The muon spectrometer surrounds the calorimeters and is based on three large superconducting air-core toroidal magnets with eight coils each. The field integral of the toroids ranges between 2.0 and 6.0  Tm across most of the detector. The muon spectrometer includes a system of precision chambers for tracking and fast detectors for triggering. A two-level trigger system is used to select events. The first-level trigger is implemented in hardware and uses a subset of the detector information to accept events at a rate below 100  kHz. This is followed by a software-based trigger that reduces the accepted event rate to 1  kHz on average depending on the data-taking conditions. An extensive software suite [38] is used in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

4 Data and simulated samples

The LHC pp collision data used in this analysis were collected by the ATLAS detector during 2015–2018 at a centre-of-mass collision energy of 13 . After imposing requirements for beam and detector conditions and data quality [39], the dataset corresponds to an integrated luminosity of 139 fb\(^{-1}\). The uncertainty in the combined 2015–2018 integrated luminosity is 1.7% [40], obtained using the LUCID-2 detector [41] for the primary luminosity measurements.

Data events were collected using dilepton triggers with \(p_{\text {T}}\) thresholds of 7–26  varying with lepton flavour and data-taking period [42, 43]. In 2017 and 2018, the asymmetric electron–muon trigger had thresholds of 26 and 8  respectively, where 26  is above the lepton \(p_{\text {T}}\) requirement used in the analysis. The \(p_{\text {T}}\) range below the trigger threshold is covered by an asymmetric muon–electron trigger with thresholds of 24 and 7  respectively. The use of the electron–muon trigger compensates for trigger inefficiencies due to the first-level muon trigger as it is seeded only by the first-level electromagnetic trigger. The rest of the triggers used have thresholds of at most 24 . Typical efficiencies for the muon part of a dilepton trigger requiring a single muon, including a Level-1 accept, is between 75 and 85% depending on the muon \(p_{\text {T}}\) and \(\eta \). For electrons, the typical efficiency for a single part of a dilepton trigger is between 85 and 97% depending on the electron \(p_{\text {T}}\) and \(\eta \).

Simulated event samples are used to help estimate the SM backgrounds, validate the analysis techniques, optimize the event selection, and provide predictions of the SUSY signal processes. The majority of the SM process samples were generated with [44] or with  [45,46,47] and [48] for the simulation of the parton shower (PS), hadronization, and underlying event. The details of the matrix element (ME) generator, PS and parameter values (tune), parton distribution function (PDF) choice, and cross-section for the SM processes are listed in Table 2. The \(t\bar{t}\) and Wt processes are referred to as ‘Top’ events. The VV processes, where \(V=W\) or Z, are referred to as ‘Diboson’ events. The remaining smaller backgrounds, except \(Z/\gamma ^*\text {\,+\,jets}\), are referred to as ‘Other’ events. The Other events category also includes the fake and non-prompt (FNP) lepton background estimated from data as described in Sect. 7.

Table 2 Simulated background event samples used in this analysis with the corresponding matrix element and parton shower generators, cross-section order in perturbative QCD (pQCD) used to normalize the event yield, underlying-event tune and PDF set. V represents either a W or Z boson

The signal samples were generated using MG5_aMC@NLO 2.2.3 [71] interfaced to with the A14 tune for the modelling of the PS, hadronization and underlying event. The ME calculation was performed at tree level and includes the emission of up to two additional partons. The PDF set used for event generation was NNPDF2.3lo. The ME–PS matching used the CKKW-L prescription, with a matching scale set to a quarter of the mass of the initial SUSY particle.

For the C1N2 and GMSB signal models, cross-sections are calculated to next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-leading-logarithm accuracy (NLO+NLL) [78,79,80,81,82]. The nominal cross-section and its uncertainty are taken from an envelope of cross-section predictions using different PDF sets and factorization and renormalization scales, as described in Ref. [83].

For the gluino and squark signal models, cross-sections are calculated to approximate next-to-next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-next-to-leading-logarithm accuracy (approximate NNLO+NNLL) [84,85,86,87,88,89,90,91]. The nominal cross-section and its uncertainty are derived using the PDF4LHC15\(\_{\textrm{mc}}\) PDF set, following the recommendations of Ref. [92].

The SM background Monte Carlo (MC) samples were passed through a full simulation of the ATLAS detector [93] using Geant4 [94]. A fast simulation [93] was used for the signal MC samples; it relies on a parameterization of the lateral and longitudinal shower shapes of single particles in the calorimeters, and on Geant4 elsewhere. The effect of multiple pp interactions per bunch crossing (pile-up) and detector-response effects due to interactions in neighbouring bunch crossings were included by overlaying the simulated hard-scattering events with additional inelastic pp collision events generated with .

5 Analysis object identification and selection

Leptons and jets selected for analysis are categorized as ‘baseline’ or ‘signal’ objects by using various quality and kinematic requirements. Baseline objects are used in the computation of the missing transverse momentum (\(\vec {p}_{\text {T}}^{\text {miss}}\)) and its magnitude (\(E_{\text {T}}^{\text {miss}}\)), to resolve ambiguities between closely spaced analysis objects, and to ensure orthogonality to other analyses. The signal objects used in the analysis selection are required to pass more stringent requirements.

Electron candidates are reconstructed using energy clusters in the electromagnetic calorimeter matched to inner-detector tracks. Baseline electrons are required to have \(p_{\text {T}} >4.5\) , satisfy the ‘loose likelihood’ criteria described in Ref. [95], and reside within the region \(|\eta |<2.47\). Additionally, the baseline-electron tracks must pass within \(|z_0\sin \theta | = 0.5\) mm of the primary vertex, defined as the vertex with the largest sum of track \(p_{\text {T}}^{2}\), where \(z_0\) is the longitudinal impact parameter with respect to the primary vertex. Signal electrons are required to satisfy the ‘medium likelihood’ criteria of Ref. [95] and to have \(p_{\text {T}} >25\) . The transverse-plane distance of closest approach of the signal electron to the beamline, divided by the corresponding uncertainty, must satisfy \(|d_0/\sigma _{d_0}|<5\). These electrons must also be isolated from other objects in the event according to a \(p_{\text {T}}\)-dependent isolation requirement based on calorimeter (\(\text {E}_\text {T}^{\text {cone20}}/p_{\text {T}} <0.06\)) and tracking information (\(p_{\text {T}} ^{\text {varcone20}}/p_{\text {T}} < 0.06\)). In RJR search regions with more than two leptons (only used for validation regions (VRs) and control regions (CRs)), the electron \(p_{\text {T}} \) requirement is lowered to 20  for the third and fourth electrons. Measured in \(Z\rightarrow ee\) events, the ‘medium’ electron identification efficiency increases from 75% at a \(p_{\text {T}}\) of 20  to 91% at 100 . The isolation requirement is approximately 70% efficient for ‘medium likelihood’ electrons at a \(p_{\text {T}}\) of 20 , rising to above 98% at 60  and beyond [95].

Baseline muons are reconstructed from either inner-detector tracks matched to muon track segments in the muon spectrometer or combined tracks formed in the inner detector and muon spectrometer. They are required to satisfy the ‘medium’ selection criteria described in Ref. [96], \(p_{\text {T}}>3\) , and \(|\eta |<2.7\). For consistency with the previous analysis, the RJR search limits the muon acceptance to \(|\eta |<2.4\). Similarly to electrons, the muon must pass within \(|z_0\sin \theta | = 0.5\) mm of the primary vertex. Signal muons are required to be isolated with \(\text {E}_\text {T}^{\text {topocone20}}/p_{\text {T}} <0.15\) and \(p_{\text {T}} ^{\text {varcone30}}/p_{\text {T}} < 0.04\), and they must have \(|d_0/\sigma _{d_0}|<3\), \(|\eta |<2.6\), and \(p_{\text {T}} >25\) . In RJR search regions with more than two leptons (only used for VRs and CRs), the muon \(p_{\text {T}} \) requirement is lowered to 20  for the third and fourth muon. Measured in \(Z\rightarrow \mu \mu \) events, the ‘medium’ muon identification efficiency exceeds 98% in regions with muon spectrometer coverage [96]. The isolation requirement efficiency increases from approximately 85% at a \(p_{\text {T}}\) of 20  to more than 99% at 60  and above. Proximity to a jet, which can occur in the Strong search, lowers the muon isolation efficiency by up to 10%.

Jets are reconstructed from topological clusters of energy [97] in the calorimeter using the anti-\(k_{t}\) algorithm [98, 99] with a radius parameter of 0.4 by making use of utilities within the FastJet package [100]. The reconstructed jets are then calibrated to parton level by the application of a jet energy scale (JES) derived from 13  data and simulation [101]. A residual correction applied to jets in data is based on studies of the \(p_{\text {T}} \) balance between jets and well-calibrated objects in the MC simulation and data. Baseline jets are defined as jet candidates that have \(p_{\text {T}} >20\)  and reside within the region \(|\eta |<2.8\). Additional track-based criteria designed to select jets from the hard scatter and reject those originating from pile-up are applied to jets with \(p_{\text {T}} <120\)  and \(|\eta |<2.5\). These are imposed by using the ‘medium’ working point of the jet vertex tagger (JVT) described in Refs. [102, 103]. For jets with in the region \(|\eta |>2.5\), outside of the inner-detector acceptance, the tighter working point of forward JVT, \(\textrm{fJVT}<0.4\) as defined in Ref. [104], uses shape and topological information to suppress jets originating from pile-up. Signal jets are further required to have \(p_{\text {T}} >30\) . To be consistent with the previous analysis, the RJR search limits the jet acceptance to \(|\eta |<2.4\). Finally, events containing a baseline jet that does not pass jet quality requirements are vetoed in order to remove events impacted by detector noise and non-collision backgrounds [105, 106].

The MV2C10 boosted decision tree algorithm [107] identifies jets containing b-hadrons by using quantities such as the impact parameters of associated tracks and the positions of any good reconstructed secondary vertices. A selection that provides 77% efficiency for tagging jets from b-quarks in simulated \(t\bar{t}\) events is used. These tagged jets are called b-jets. The corresponding rejection factors against jets originating from c-quarks and light quarks in the same sample for this selection are 5 and 110 respectively [107, 108].

Photon candidates, used only in the computation of \(\vec {p}_{\text {T}}^{\text {miss}}\), are reconstructed using energy clusters in the electromagnetic calorimeter without matching inner-detector tracks, or with tracks consistent with having originated from a photon conversion vertex. Photons are required to satisfy the ‘tight’ selection criteria described in Ref. [95], have \(p_{\text {T}} >25~\), and reside within the region \(|\eta |< 2.37\), excluding the transition region (\(1.37<|\eta |<1.52\)) between the barrel and endcaps of the electromagnetic calorimeter.

To avoid the duplication of analysis objects and resolve ambiguities, an overlap removal procedure is applied to baseline jets and signal leptons in the following order. Electron candidates that share an inner-detector track with a baseline muon are rejected to remove electrons originating from photons radiated from muons. Any baseline jet within \(\Delta R=0.2\) of a baseline electron is removed. Any electron that lies within of a baseline jet is removed in order to suppress electrons from heavy-flavour decays. If a baseline muon either resides within \(\Delta R=0.2\) of, or has a track associated with, a remaining baseline jet, that jet is removed. Muons are removed in favour of baseline jets with the same \(p_{\text {T}}\)-dependent \(\Delta R\) requirement as electrons. For the purpose of computing \(\vec {p}_{\text {T}}^{\text {miss}}\), photon candidates sharing their calorimeter cluster with an electron are removed.

The \(\vec {p}_{\text {T}}^{\text {miss}}\) is defined as the negative vector sum of the transverse momenta of all baseline electrons, muons, jets, and photons [109]. Low-momentum contributions from particle tracks originating from the primary vertex that are not associated with reconstructed analysis objects are also included in the calculation of \(\vec {p}_{\text {T}}^{\text {miss}}\).

All MC samples have corrections applied to take into account small differences between data and MC simulation in the identification, reconstruction and trigger efficiencies. The \(p_{\text {T}} \) values of leptons in MC samples are smeared to match the momentum resolution in data.

6 Event selection

The searches are carried out in SRs designed to be sensitive to heavy new particles inspired by SUSY. Auxiliary measurements are performed in CRs, which are orthogonal to their associated SRs, designed to be enriched in a particular background process and have low contamination from signal processes. To validate the background estimation in the SRs, VRs are defined to be similar, but orthogonal, to the SRs and CRs. The VRs typically have signal contaminations of less than 10% near the exclusion limits, but in a few regions can be larger, close to 20%.

All three searches share some common event requirements. Events entering the SRs must have exactly two OS SF signal leptons (electrons or muons) with \(p_{\text {T}} >25\) , without any additional baseline leptons. This requirement rejects events with a third, low-momentum lepton, such as from WZ production. This also means that for signal models which may result in more than two leptons, those decay modes are suppressed in favour of hadronic decays of bosons or additional neutrinos in the final state. The majority of the search regions require events to have at least two jets with \(p_{\text {T}} >30\) . Analysis objects are ordered by decreasing \(p_{\text {T}}\) for further selections. Sections 6.1 and 6.2 describe variables used to construct the regions in the searches. The RJR, EWK, and Strong search-specific selections are described in Sects. 6.3, 6.4, and 6.5 respectively. The regions defined in the searches, in the same order, have a suffix -RJR, -EWK, or -STR. Except in the case of the RJR search, which replicates the selection of the previous search, the event selections are optimized for the different final states depending on the type of sparticles and sparticle masses in the targeted models.

6.1 Additional event selection variables

Signal models with large hadronic activity are targeted by placing additional requirements on the quantity \(H_{\text {T}}\), defined as the scalar sum of the \(p_{\text {T}} \) values of all signal jets. For the purpose of rejecting \(t\bar{t}\) background events, the \(m_{\textrm{T2}}\) [110, 111] variable is used, defined as an extension of the transverse mass \(m_{\text {T}}\) for the case of two missing particles:

$$\begin{aligned}{} & {} m_{\text {T}}^2\left( \,\vec {p}_{\text {T},a},\vec {p}_{\text {T}}^{\text {miss}}\right) = 2 \times \left( p_{\text {T},a} \times E_{\text {T}}^{\text {miss}}- \vec {p}_{\text {T} a} \cdot \vec {p}_{\text {T}}^{\text {miss}} \right) , \end{aligned}$$
(1)
$$\begin{aligned}{} & {} m_{\textrm{T2}}^2 = \min _{\vec {x}_{\text {T,1}}+\vec {x}_{\text {T,2}}=\vec {p}_{\text {T}}^{\text {miss}}} \left[ \max \{ m_{\text {T}}^2\left( \,\vec {p}_{\text {T} 1},\vec {x}_{\text {T,1}}\right) , m_{\text {T}}^2\left( \,\vec {p}_{\text {T} 2},\vec {x}_{\text {T,2}}\right) \} \right] , \nonumber \\ \end{aligned}$$
(2)

where \(\vec {p}_{\text {T}, a}\) is the transverse-momentum vector of the highest-\(p_{\text {T}}\) (\(a=1\)) or second-highest-\(p_{\text {T}}\) (\(a=2\)) lepton, and \(\vec {x}_{\text {T,b}}\) (\(b=1,2\)) are two vectors representing the possible momenta of the invisible particles that minimize the \(m_{\textrm{T2}}\) in the event. The masses of the invisible particles are set to zero in the calculation. The \(m_{\textrm{T2}}\) variable tends to have an endpoint near the mass of the parent particle, e.g. the W boson mass for \(t\bar{t}\) events when considering the leptons and \(E_{\text {T}}^{\text {miss}}\). Thus it can be used to separate signal models with larger mass-splittings from typical \(t\bar{t}\) events.

The \(E_{\text {T}}^{\text {miss}}\) significance \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) quantifies how consistent the \(E_{\text {T}}^{\text {miss}}\) is with mismeasurements of objects in events without any genuine source of \(E_{\text {T}}^{\text {miss}}\). It is used to select events where it is more likely that invisible particles are contributing to the \(E_{\text {T}}^{\text {miss}}\). The \(E_{\text {T}}^{\text {miss}}\) significance is defined [112] as

$$\begin{aligned} \mathcal {S}^2 = \frac{|E_{\text {T}}^{\text {miss}} |^2}{\sigma _\text {L}^2 (1-\rho _\text {LT}^2)}, \end{aligned}$$

where \(\sigma _\text {L}^2\) is the total variance in the longitudinal direction along \(\vec {p}_{\text {T}}^{\text {miss}}\), with the resolutions of all of the objects taken into account, and \(\rho _\text {LT}\) is the correlation between the longitudinal and transverse resolutions of the objects.

6.2 Recursive-jigsaw reconstruction

The RJR technique [16] is a method for decomposing measured properties event by event to provide a basis of kinematic variables. This is achieved by approximating the rest frames of intermediate particle states in each event. This reconstructed view of the event gives rise to a natural basis of kinematic observables, calculated by evaluating the momentum and energy of different objects in these rest frames. Backgrounds are reduced by testing whether each event exhibits the anticipated properties of the imposed decay tree under investigation while only applying minimal selection criteria to visible-object momenta and missing momenta. The RJR technique is described in detail in Refs. [16, 113, 114], and has been used in previous ATLAS searches [115,116,117].

Fig. 2
figure 2

a The standard decay tree applied to pair-produced (PP) sparticles (parent objects), P\(_{\text {a,b}}\), decaying to visible states V\(_{\text {a,b}}\) and invisible states I\(_{\text {a,b}}\). b Decay tree for the C1N2 model with decays to the \(2\ell \) + 2-jets final state. c The decay tree for compressed scenarios with an initial-state radiation jet (ISR). The signal sparticle system S recoils against the jet-radiation system ISR. CM centre-of-mass frame

Electrons, muons, jets, and \(\vec {p}_{\text {T}}^{\text {miss}}\) are used as input to the RJR algorithm. A decay tree for the \(2\ell \) + 2j final state in this search, shown in Fig. 2b, is constructed following the canonical process shown in Fig. 2a. These decay trees are motivated by the pair production of sparticles in R-parity-conserving models. Each event is evaluated as if two sparticles (labelled PP) were produced, assigned to two hemispheres (P\(_{\text {a}}\) and P\(_{\text {b}}\)), and subsequently decay into the particles observed in the detector, with V denoting the visible objects and I the invisible objects. The benchmark signal models probed in this search give rise to signal events with at least two weakly interacting particles associated with two systems of invisible particles (shown in green). Both of the leptons must be assigned to either V\(_{a}\) or V\(_{b}\), not split across both. The jets must also be assigned to either of the two visible frames.

After partitioning the visible objects, the remaining unmeasured quantities related to the invisible particles (masses, longitudinal momenta, and contribution to \(\vec {p}_{\text {T}}^{\text {miss}}\)) are associated with the two collections of invisible particles \(\text {I}_{\text {a/b}}\). The RJR algorithm determines values for the unmeasured particles’ four-momenta, while requiring the masses of the invisible particles to be positive. In cases with multiple solutions, the solution with the smallest invariant mass of the visible system is chosen [16]. Once all measured and unmeasured momenta are defined, a set of variables can be constructed, such as multi-object invariant masses and angles between objects. The primary energy-scale-sensitive observables used in the search presented here are a suite of variables denoted by H. As shown in Eq. (3), the H variables are constructed using different combinations of object momenta, including contributions from the invisible particles’ four-momenta, and are not necessarily evaluated in the lab frame, nor only in the transverse plane. The H variables are labelled with a superscript F and two subscripts n and m:

$$\begin{aligned} H_{n,m}^{\text {F}} = \sum _{i=1}^{n} |\vec {p}_{\text {vis},\ i}^{\text {~F}}| + \sum _{j=1}^{m} |\vec {p}_{\text {inv},\ j}^{\text {~F}}|. \end{aligned}$$
(3)

The F represents the rest frame in which the momenta are evaluated. In this analysis, this may be the lab frame, the proxy for the sparticle–sparticle frame PP, or the proxy for the rest frame of an individual sparticle, P. The subscripts n and m represent the number of visible and invisible momentum vectors considered, respectively. For events with fewer than n visible objects, the sum only runs over the available momenta. Only the leading \(n-n_{\ell }\) jets are considered, where \(n_{\ell }\) is the number of reconstructed leptons in the event. An additional subscript ‘T’ denotes a transverse version of the variable, where the transverse plane is defined in a frame F as follows: the Lorentz transformation relating F to the lab frame is decomposed into a boost along the beam axis, followed by a subsequent transverse boost. The transverse plane is defined to be perpendicular to the longitudinal boost. In this analysis, it is the plane transverse to the beamline.

The following variables are used in the definition of the SRs. The value of n depends on the number of visible objects; for the \(2\ell \) + 2-jets final state, \(n=4\).

  • \(H_{4,1}^{~\text {PP}}\) is a scale variable as described above that behaves similarly to the effective mass, \(m_{\textrm{eff}}\) (defined as the scalar sum of the transverse momenta of the visible objects and \(E_{\text {T}}^{\text {miss}} \)), used in previous ATLAS SUSY searches.

  • \(H_{1,1}^{~\text {PP}}/H_{4,1}^{~\text {PP}}\) provides additional information when testing the balance between the two scale variables. This provides excellent discrimination against unbalanced events where the large scale is dominated by a particular object’s \(p_{\text {T}}\) or by large \(E_{\text {T}}^{\text {miss}} \). It behaves similarly to \(E_{\text {T}}^{\text {miss}}/m_{\textrm{eff}}\) and is used to reduce the \(Z/\gamma ^*\text {\,+\,jets}\) background in cases where one high-\(p_{\text {T}}\) jet dominates.

  • \(p_{\text {T}~\text {PP}}^{\text {lab}}/(p^{\text {lab}}_{\text {T}~\text {PP}} + H_{\text {T}~4,1}^{~\text {PP}})\) compares the magnitude of the vector sum of the transverse momenta of all objects assigned to the PP system in the lab frame (\(p^{\text {lab}}_{\text {T}~\text {PP}}\)) with the overall transverse scale variable considered. This quantity tests for a significant boost in the transverse direction. For signal events this quantity peaks sharply towards zero while for background processes the distribution is broader. It is a test of how much a given process resembles the imposed PP system in the decay tree.

  • \(\text {min}(H^{\text {P}_{\text {a}}}_{1,1},H^{\text {P}_{\text {b}}}_{1,1})/\text {min}(H^{\text {P}_{\text {a}}}_{2,1},H^{\text {P}_{\text {b}}}_{2,1})\) compares the scale due to one visible object and \(E_{\text {T}}^{\text {miss}}\) (\(H^{\text {P}_{\text {a}}}_{1,1}\) and \(H^{\text {P}_{\text {b}}}_{1,1}\) in their respective production frames) with the scale due to two visible objects (\(H^{\text {P}_{\text {a}}}_{2,1}\) and \(H^{\text {P}_{\text {b}}}_{2,1}\)). The numerator and denominator are each defined by finding the smaller of the values of these quantities. This variable tests against a single object taking a large portion of the hemisphere momentum. It is particularly useful in discriminating against \(Z/\gamma ^*\text {\,+\,jets}\) events.

  • \(\Delta \phi _{\text {V}}^{\text {P}}\) is the azimuthal opening angle between the visible system V in frame P and the direction of the boost from the PP frame to the P frame. Standard Model backgrounds from diboson, top and \(Z/\gamma ^*\text {\,+\,jets}\) processes peak towards zero and \(\pi \) due to their topologies not obeying the imposed decay tree while signal events tend to have a flat distribution in this variable.

For selections involving three charged leptons, the W boson transverse mass, \(m_{\text {T}}^{W}\), is used. It is derived as in Eq. (2) from \(\vec {p}_{\text {T}}^{\text {miss}}\) and the transverse momentum of the charged lepton not associated with the Z boson. The three-lepton regions are only used for CRs and VRs.

For sparticle spectra with smaller mass-splittings and lower intrinsic \(E_{\text {T}}^{\text {miss}}\), it can be useful to define a decay tree with a partially resolved sparticle system recoiling against a high-\(p_{\text {T}}\) initial-state radiation (ISR) jet, instead of the fully resolved decay tree described above. This is shown Fig. 2c. This tree is simpler and attempts to identify visible (V) and invisible (I) systems that are the result of an intermediate state corresponding to the system of sparticles and their decay products (S). Since the \(E_{\text {T}}^{\text {miss}}\) is used to identify which jets come from ISR, a transverse view of the reconstructed event is used which ignores the longitudinal momentum of the jets and leptons, as described in Ref. [113]. The reference frames appearing in the decay tree shown in Fig. 2c, such as the centre-of-mass (CM) frame of the whole interaction, are approximations in this transverse projection. The variables which are derived from this process leverage the relationship between the ISR and the total S system:

  • \(p_{\mathrm {T\ ISR}}^{~\text {CM}}\) is the magnitude of the vector-summed transverse momenta of all jets assigned to the ISR system.

  • \(p_{\mathrm {T\ I}}^{~\text {CM}}\) is the magnitude of the vector-summed transverse momenta of the invisible system, which behaves similarly to \(E_{\text {T}}^{\text {miss}} \).

  • \(p_{\textrm{T}}^{~\text {CM}}\) is the magnitude of the vector-summed transverse momenta of the CM system.

  • \(R_{\text {ISR}} \equiv \vec {p}_{\text {I}}^{~\text {CM}}\cdot \hat{p}_{\textrm{T}~\text {S}}^{~\text {CM}}/p_{\textrm{T}~\text {S}}^{~\text {CM}}\) serves as an estimate of \(m_{\tilde{\chi }^{0}_{1}}/m_{\tilde{\chi }^{0}_{2}/\tilde{\chi }^{\pm }_{1}}\). This corresponds to the fraction of the momentum of the S system that is carried by its invisible system I, with momentum \(\vec {p}_{\text {I}}^{~\text {CM}}\) in the CM frame. As \(p_{\textrm{T}~\text {S}}^{~\text {CM}}\) grows it becomes increasingly hard for backgrounds to possess a large value in this ratio – a feature exhibited by compressed signals [113].

  • \(N_{\text {jet}}^{\text {S}}\) is the number of jets assigned to the signal system S.

  • \(N_{\text {jet}}^{~\text {ISR}}\) is the number of jets assigned to the ISR system.

  • \(\Delta \phi _{~\text {ISR},\text {I}}^{~\text {CM}}\) is the azimuthal opening angle between the ISR system and the invisible system in the CM frame.

  • \(m_\text {T}^Z\) is the transverse mass of the dilepton pair assigned to the signal system.

  • \(m_\text {T}^J\) is the transverse mass of the jet system assigned to the signal system.

6.3 Recursive-jigsaw reconstruction search selections

The RJR search is a follow-up to the \(36.1~\textrm{fb}^{-1}\) search [17] which observed 1.4\(\sigma \) and 2.0\(\sigma \) excesses in regions named SR\(2\ell \)-Low-RJR and SR\(2\ell \)-ISR-RJR respectively. Thus the selection is kept the same and not optimized for the full Run 2 dataset. These SRs were designed to target C1N2 models with \(\tilde{\chi }_1^\pm \)\(\tilde{\chi }_1^0\) mass-splittings of approximately . The selections begin with the common selections defined in Sect. 6, and include requirements on the lepton multiplicity (\(n_{\textrm{leptons}}\)), the jet multiplicity (\(n_{\textrm{jets}}\)), the b-tagged jet multiplicity (\(n_{b\text {-tag}}\)), the transverse momenta of the leading \((p^{\ell _{1}}_\textrm{T},p^{j_{1}}_\textrm{T})\) and subleading \((p^{\ell _{2}}_\textrm{T},p^{j_{2}}_\textrm{T})\) leptons and jets, as well as the invariant masses of the dilepton \(m_{\ell \ell }\) and dijet \(m_{jj}\) systems. The selections used to define SR\(2\ell \)-Low-RJR, and the CRs and VRs associated with it, are summarized in Tables 3 and 4. Selections which enforce orthogonality between the CRs and VRs are in boldface. CRs are defined to extract data-driven normalization factors for the main background processes: diboson and \(t\bar{t}\) + Wt. Details of the background estimation are described in Sect. 7.1.

Most of the regions are defined to have exactly two OS SF leptons with transverse momentum greater than , and a dilepton invariant mass consistent with originating from a Z boson. The diboson CR (CR2\(\ell \)-VV-RJR) is an exception and requires three or four leptons, which ensures orthogonality to the SRs while also enriching the sample with diboson events. The OS SF dilepton pair with an invariant mass closest to the Z boson mass is chosen as the Z boson candidate. For the purpose of RJR calculations, the third and fourth leptons are treated as invisible objects contributing to \(\vec {p}_{\text {T}}^{\text {miss}}\). An additional requirement on \(m_{\text {T}}^{W}\) is applied, which ensures orthogonality to the \(3\ell \) searches using the \(36.1~\textrm{fb}^{-1}\) dataset. Both the top CR (CR\(2\ell \)-\(\textrm{Top}\)-RJR) and VR (VR\(2\ell \)-\(\textrm{Top}\)-RJR) require a b-tagged jet. Inverting the \(m_{\ell \ell }\) requirement makes the VR orthogonal to the CR. The SRs require \(m_{jj}\) to be consistent with a W boson, whereas VR\(2\ell \)-VV-RJR selects events outside the W boson mass window. The min \(\Delta \phi (\varvec{j}_{1,2},\vec {p}_{\text {T}}^{\text {miss}})\) variable corresponds to the azimuthal angle between the jets and \(\vec {p}_{\text {T}}^{\text {miss}}\) and is applied to suppress \(Z/\gamma ^*\text {\,+\,jets}\) contributions to SR\(2\ell \)-Low-RJR. VR\(2\ell \)-VV is the only region with an \(H_{1,1}^{\textrm{PP}}\) requirement, and it suppresses the \(Z/\gamma ^*\text {\,+\,jets}\) contribution.

SR\(2\ell \)-ISR-RJR requires three or four jets, which makes it orthogonal to SR\(2\ell \)-Low-RJR. All CRs and VRs for SR\(2\ell \)-ISR-RJR require at least one jet to be assigned to the ISR system (\(N_{J}^{\textrm{ISR}}\)), and at least two to the signal system (\(N_{J}^{\textrm{S}}\)). The assignment of the jets is determined by the configuration that minimizes the mass of both S and ISR systems. Both CR\(2\ell \)-ISR-VV-RJR and VR\(2\ell \)-ISR-VV-RJR require three or four leptons. To increase the number of events in VR\(2\ell \)-ISR-VV-RJR, the transverse momentum requirement for jets is relaxed to 20 , compared to 30  in other regions. The ISR regions are further defined by a series of requirements based on the variables from the ISR decay tree, described in Sect. 6.2. These requirements are listed in Tables 5 and 6. Selections that enforce orthogonality to the SR are in boldface. SR\(2\ell \)-ISR-RJR requires a highly energetic ISR jet system which recoils against the signal system in the CM frame. In VR\(2\ell \)-ISR-VV-RJR the \(m_\text {T}^Z\) requirement is inverted in order to be orthogonal to CR\(2\ell \)-ISR-VV-RJR. The top CR (CR\(2\ell \)-ISR-Top-RJR) and VR (VR\(2\ell \)-ISR-Top-RJR) both require a b-tagged jet and have broader \(m_\text {T}^Z\) and \(m_\text {T}^J\) requirements. These regions are orthogonal due to the inversion of the \(p_\textrm{T}^{~\textrm{CM}}\) requirement.

Table 3 Preselection criteria for the standard-decay-tree \(2\ell \) SR (SR\(2\ell \)-Low-RJR) and the associated CRs and VRs. Selections which enforce orthogonality between the CRs and VRs are in boldface
Table 4 Selection criteria for the standard-decay-tree \(2\ell \) SR (SR\(2\ell \)-Low-RJR) and the associated CRs and VRs
Table 5 Preselection criteria for the ISR-decay-tree \(2\ell \) SR (SR\(2\ell \)-ISR-RJR) and the associated CRs and VRs. Selections which enforce orthogonality between the CRs and VRs are in boldface
Table 6 Selection criteria for the ISR-decay-tree \(2\ell \) SR (SR\(2\ell \)-ISR-RJR) and the associated CRs and VRs. Selections which enforce orthogonality between the CRs and VRs are in boldface

6.4 Electroweak search selections

The EWK search uses 13 orthogonal SRs designed to cover different regions of the C1N2 and GMSB models’ parameter spaces. In addition to the use of new kinematic variables, the strategy from the 36 fb\(^{-1}\) search [22] is extended by optimizing binned SRs to maximize the model-dependent search sensitivity. The SRs labelled -OffShell, -Low, -Int, and -High target increasing NLSP–LSP mass-splittings. The SR labelled -\(\ell \ell bb\) targets the GMSB model with either a Higgs or Z boson decaying into two b-quarks. The selections defining each region in the search are summarized in Tables 7, 8 and 9, along with the control and validation regions used for the estimation and validation of the SM backgrounds. The acceptance times efficiency for several example signal models are listed in Table 10.

Lepton pair and jet pair mass windows are used to select events with a ZW, Zh, or ZZ topology. In all regions, except for the OffShell regions, the dilepton invariant mass is required to be on the Z boson mass peak, and the mass of the jet system is required to be consistent with a hadronically decaying boson. Most regions require the jet system mass to be around the W or Z boson mass, \(60<m_X<110\) , where \(m_X\) refers to the dijet system for all regions except for SR-1J-High-EWK, where it is the single-jet mass. The \(\ell \ell bb\) region expands this window to \(60<m_{bb}<150\)  in order to additionally account for Higgs boson decays in the GMSB model.

When performing model-dependent fits, SR-High-EWK uses a two-dimensional binning in \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) with boundaries \((18,21,\infty )\) and \(\Delta R_{jj}\) with boundaries [0, 0.8, 1.6), labelled as \(\Delta R_{X}\) in Tables 7 and 8. The variable \(\Delta R_{X}\) is sensitive to signal events where the leptons or jets are expected to be closer together due to the boost of the decay system, which is increased by the large mass splittings targeted by these regions. SR-Int-EWK and SR-Low-EWK are binned only in \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\), with boundaries (12, 15, 18) and (6, 9, 12) respectively. SR-OffShell-EWK is split into ranges of 12–40 and 40–71  in \(m_{\ell \ell }\). The SR binning was optimized by checking the performance of a few variables, e.g. binning in jet \(p_{\text {T}}\) did not perform as well as \(\Delta R_{jj}\), and binning the main discriminant, \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\), so that at least one background event is expected in each bin. All remaining SRs are treated as single bins. For all of the SRs, VRs are defined using the jet system mass sidebands or by inverting criteria for kinematic variables defining the SRs. The criteria ensuring orthogonality to the corresponding SRs are highlighted in boldface in Tables 7, 8 and 9. CRs are defined in order to extract data-driven normalization factors for the main background processes: diboson, \(t\bar{t}\), \(Z/\gamma ^*\text {\,+\,jets}\), and low-mass off-shell \(Z/\gamma ^*\text {\,+\,jets}\). These are discussed further in Sect. 7.2.

Table 7 Overview of the High and Intermediate signal, control, and validations regions of the electroweak search. The main requirements that distinguish the control and validation regions from the signal regions are highlighted in boldface. The bracketed requirements denote the binning used in model-dependent fits, as described in the text
Table 8 Overview of the Low signal, control, and validations regions of the electroweak search. The main requirements that distinguish the control and validation regions from the signal regions are highlighted in boldface. The bracketed requirements denote the binning used in model-dependent fits, as described in the text
Table 9 Overview of the OffShell signal, control, and validations regions of the electroweak search. The main requirements that distinguish the control and validation regions from the signal regions are highlighted in boldface. The bracketed requirements denote the binning used in model-dependent fits, as described in the text

6.5 Strong search selections

The SRs targeting production of gluinos and squarks start with the common selection described at the beginning of Sect. 6. The four overlapping SRs designed for a kinematic endpoint, or ‘edge’ feature, are binned in the dilepton invariant mass, \(m_{\ell \ell }\). They are named SRC-STR, SRLow-STR, SRMed-STR, and SRHigh-STR in order of sensitivity to gluino–\(\tilde{\chi }_1^0\) mass splittings from those that are compressed, or small, to those that are large. SRs designed for an excess of events near the Z boson mass, ‘on-\(Z\)’, are a single bin in the window \(81<m_{\ell \ell }<101\) . They share the same naming as the edge SRs, with the addition of a ‘Z’ in the name. The selection used for each region in the analysis is summarized in Table 11. The bin boundaries in \(m_{\ell \ell }\) used for interpretations were chosen such that there is a finer division in the mass region targeted by each SR, while keeping enough events per bin for the background estimates and a bin around the Z mass where possible. The boundaries in units of are as follows:

  • SRC-STR: 12, 31, 46, 61, 71, 81, 101, 201;

  • SRLow-STR: 12, 41, 61, 81, 101, 141, 201, 301, 501;

  • SRMed-STR: 12, 81, 101, 201, 301, 601;

  • SRHigh-STR: 12, 101, 301, 1001.

The acceptance times efficiency for the simulated simplified models in the various regions depends on the model and region. SRHigh-STR has an acceptance times efficiency of 10% for the gluino-slepton model with and . SRC-STR has an acceptance times efficiency of 0.02% for the gluino-\(Z^{(*)}\) model with and . This is very small due to the compressed mass-splittings between sparticles, resulting in low-momentum decay products, and the Z boson branching fraction to leptons. SRZMed-STR has an acceptance times efficiency of 1% for the squark-\(Z^{(*)}\) model with and .

As described before, all of the SRs require exactly two OS SF signal leptons with \(p_{\text {T}} >25\) , without any additional baseline leptons. This also applies to the Strong search CRs and VRs, except for VR3L-STR, which requires exactly three signal leptons. All signal models studied are expected to have several quarks in the final state, so events are further required to have at least two jets with \(p_{\text {T}} >30\) . Since the SR selection requires exactly two leptons, it is likely that events passing the selection from models with Z bosons contain both a leptonically and hadronically decaying Z boson. Thus the on-\(Z\) SRs require at least four jets with \(p_{\text {T}} >30\)  to take this into account. All regions require \(m_{\ell \ell }>12\)  in order to reduce contributions from low-mass resonances. Several variables, such as \(E_{\text {T}}^{\text {miss}}\) and \(H_{\text {T}}\), are used to isolate SUSY-like events from the background. Most regions require the angular separation in \(\phi \) between the \(\vec {p}_{\text {T}}^{\text {miss}}\) and the two leading jets, \(\Delta \phi (\varvec{j}_{1,2},\vec {p}_{\text {T}}^{\text {miss}})\), to be greater than 0.4 to remove events with \(E_{\text {T}}^{\text {miss}}\) arising from jet mismeasurements. CRs for \(Z/\gamma ^*\text {\,+\,jets}\) are defined, identified with a ‘-Z’, by inverting this requirement, adding an \(m_{\ell \ell }\) window of 81–101 , and otherwise keeping the selections the same. CRs for flavour-symmetric (FS) processes, such as \(t\bar{t}\) and WW, are defined (labelled with ‘-FS’) by requiring the leptons to be different-flavour (DF). VRs are defined for each edge SR, below their \(E_{\text {T}}^{\text {miss}}\) requirement, by \(150<E_{\text {T}}^{\text {miss}} <250\) . Additional VRs targeting WZ and FNP leptons are defined by requiring three leptons and same-charge (SS) leptons respectively; the \(E_{\text {T}}^{\text {miss}}\) requirement in these regions is lowered in order to increase the number of events.

Table 10 Acceptance times efficiency in the summed EWK signal regions for example C1N2 and GMSB signal models

The requirements chosen for each SR are based on an optimization performed with a few test points with different mass-splittings between the \(\tilde{g}\) or \(\tilde{q}\) and the \(\tilde{\chi }_1^0\) from each model. One of the main changes with larger mass-splittings is the increase in jet activity. The requirement on \(H_{\text {T}}\) varies from \(>250\)  in the more compressed regions to \(>800\)  in the regions targeting large splittings. The lower bound on the dilepton system’s momentum, \(p_{\text {T}}^{\ell \ell }>40\) , was imposed to allow use of the \(\gamma \) + jets process to estimate the \(Z/\gamma ^*\text {\,+\,jets}\) background, but this method was not used due to poor modelling. After all of the other requirements this removes less than one expected event from each SR. The more compressed signal models tend to have small values of \(p_{\text {T}}^{\ell \ell }\), so an upper bound is imposed to reduce the \(Z/\gamma ^*\text {\,+\,jets}\) contribution. The requirement is relaxed for SRs targeting larger mass-splittings. The \(E_{\text {T}}^{\text {miss}}\) requirement removes a large portion of the SM backgrounds and is set to \(>250\)  for the C and Low regions and \(>300\)  for the Med and High regions. Requirements on \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) and \(m_{\textrm{T2}}\) are used to reduce the contribution from \(t\bar{t}\) in the signal regions.

Table 11 Overview of all of the signal, control, and validation regions of the Strong search. The flavour of the lepton pair is denoted by ‘SF’ for same-flavour and ‘DF’ for different-flavour. All regions require exactly two opposite-charge signal leptons with \(p_{\text {T}} >25\) , with the exception of VR3L which requires exactly three signal leptons (3 L). The main requirements that distinguish the control and validation regions from the signal regions are highlighted in boldface. The validation regions have an equivalent set of CRs with the lepton flavour and \(\Delta \phi (\varvec{j}_{1,2},\vec {p}_{\text {T}}^{\text {miss}})\) flipped, but are omitted here for brevity

7 Background estimation

This section describes the methods used to estimate the contributions from SM processes to each of the search regions. An overview of how the various processes are estimated in each search is shown in Table 12. Several processes are estimated using data-driven methods in order to estimate both the yield and distribution from data. Depending on the search and SR, some processes are estimated using orthogonal CRs to normalize the yield to data while taking the shape from MC simulation. The choice of background estimation method depends on the dominant backgrounds, expected yields, and whether the shape of the background is important. For the RJR search, the background estimation methods are repeated from the 36 fb\(^{-1}\) analysis. The use of b-tagging in the EWK search requirements allows for a \(t\bar{t}\) CR with inverted b-jet requirements. While the Strong search is inclusive in b-jets and modelling the shape of the \(m_{\ell \ell }\) spectrum is important, thus a method that also estimates the shape of the backgrounds from data is used. Finally, processes with smaller expected yields such as triboson production (VVV), Higgs production, and rare top processes (‘Other Top’) such as \(t\bar{t}\) + V production are estimated directly from MC simulation. The common estimate of FNP leptons is described, and then methods specific to each search are described in Sects. 7.17.3.

Table 12 Breakdown of background estimation methods for the three searches. ‘MC’ processes are estimated directly from a Monte Carlo (MC) simulated sample, ‘CR’ processes are MC samples normalized to data in a control region (CR), and ‘DD’ processes are estimated via a data-driven method where both the shape and normalization are obtained from data. Entries with multiple categories depend on the specific region or process, e.g. \(t\bar{t}\) and Wt

All three searches estimate events with fake, misidentified and non-prompt leptons using the matrix method [118]. This method inverts a system of equations relating the number of observed baseline and signal leptons to the estimated number of real and FNP leptons via measured real- and FNP-lepton efficiencies. These events come mostly from semileptonic \(t\bar{t}\), \(W\rightarrow \ell \nu \), and single top (s- and t-channel) decays, which enter the dilepton channel when one hadron, photon, or non-prompt lepton from a heavy-flavour decay is misidentified as a signal lepton. In most SRs, the contribution from this background is less than 5%. However, in a few regions it is up to 20% of the background yield.

A control sample is constructed for each region in the searches with the same selection but removing the requirement of two signal leptons, requiring only two baseline leptons instead. The number of leptons that pass or fail the stricter signal lepton requirements is counted. A system of equations is constructed with the measured efficiencies for real and FNP leptons to pass these requirements and then inverted to solve for the number of FNP leptons passing the signal lepton requirement. In the case of a single-lepton selection, the number of FNP-lepton events in a given region would be estimated according to:

$$\begin{aligned} N_{\text {pass}}^{\text {FNP}} = \frac{N_{\text {fail}} - (1/\epsilon ^{\text {real}} - 1) \times N_{\text {pass}} }{1/\epsilon ^{\text {FNP}} - 1/\epsilon ^{\text {real}}}, \end{aligned}$$

where \(\epsilon ^{\text {real}}\) is the efficiency for a real, prompt lepton to pass the signal lepton requirements, and \(\epsilon ^{\text {FNP}}\) is the same for FNP leptons. The real-lepton efficiency is obtained from MC simulation which has been corrected to match the efficiency found in data and uses the MC simulation particle-level information to select prompt leptons. The FNP-lepton efficiency is measured using a tag-and-probe method. It is measured separately for misidentified leptons originating from light-flavour jets, heavy-flavour jets, and from the conversion of photons to electrons. The heavy-flavour and photon-conversion efficiencies are measured with data in regions with a b-jet or \(\mu \mu e\) events on the Z boson mass peak respectively. The light-flavour efficiency is measured in SS events with MC simulation since it is difficult to construct a region pure in FNP leptons from light-flavour jets. The three sources are combined according to their expected relative contributions to the search regions, measured using MC simulation with selections similar to those for each search.

7.1 Recursive-jigsaw reconstruction search backgrounds

In the RJR search, the diboson and top backgrounds are normalized in CRs, the \(Z/\gamma ^*\text {\,+\,jets}\) background is estimated using the so-called ABCD method, the FNP-lepton background is estimated with the data-driven approach previously described, and the remaining backgrounds are estimated via MC simulation.

The ABCD method requires two SR selections that are uncorrelated with respect to the \(Z/\gamma ^*\text {\,+\,jets}\) background. These selections are used to construct three CRs: A, B, and D (region C is the SR), which are adjacent to the SR. Table 13 lists the requirements on these variables and their adjacency to the SRs for both cases. As the selections are uncorrelated, there is a linear relation between the regions, \(N_\text {SR} \approx N_\text {D}\cdot N_\text {A}/N_\text {B}\), where N is the fitted \(Z/\gamma ^*\text {\,+\,jets}\) estimate in each region. This provides a background estimate in each of the SRs.

For SR\(2\ell \)-Low-RJR, the CRs are defined with \(m_{jj}\) and \(H_{1,1}^{\textrm{PP}}/H_{4,1}^{\textrm{PP}}\), and for SR\(2\ell \)-ISR-RJR, \(m_\text {T}^{J}\) and \(p_{\mathrm {T\ I}}^{~\text {CM}}\) are used to define the CRs. For \(\textrm{A}_{\textrm{Low}}\), \(\textrm{B}_{\textrm{Low}}\), and \(\textrm{D}_{\textrm{Low}}\), the \(Z/\gamma ^*\text {\,+\,jets}\) purities are 77%, 98%, and 96% respectively. For \(\textrm{A}_{\textrm{ISR}}\), \(\textrm{B}_{\textrm{ISR}}\), \(\textrm{D}_{\textrm{ISR}}\), the \(Z/\gamma ^*\text {\,+\,jets}\) purities are 36%, 85%, and 91% respectively. The ABCD regions are included in the simultaneous fits. A cross-check \(Z/\gamma ^*\text {\,+\,jets}\) estimate obtained by fitting the sidebands in \(m_{jj}\), i.e. outside of the W boson mass window, produces a compatible yield, but the ABCD method results in a lower overall uncertainty.

Table 13 The selection criteria for the regions used in the ABCD \(Z/\gamma ^*\text {\,+\,jets}\) background estimate for the RJR search. The left three columns coincide with the Standard Region, and the right three columns with the ISR Region

The top and diboson backgrounds are normalized to data in the CRs defined in Sect. 6.3 with a separate simultaneous fit to each corresponding SR. The \(t\bar{t}\) and Wt processes share the same normalization factors of \(1.03\pm 0.08\) and \(0.96\pm 0.19\) in SR\(2\ell \)-Low-RJR and SR\(2\ell \)-ISR-RJR respectively. The normalization factors derived for the diboson backgrounds are \(1.09\pm 0.10\) and \(0.96\pm 0.13\) in the same regions. The yields in these CRs, along with the corresponding VRs, are listed in Tables 14 and 15 for the Low and ISR regions respectively. Distributions in the Low and ISR top and diboson VRs are shown in Fig. 3. The yields and distributions for \(Z/\gamma ^*\text {\,+\,jets}\) in the tables and figure are from MC simulation because the ABCD method is only constructed for the SRs. The expected and observed distributions agree well in the VRs.

Table 14 Breakdown of expected and observed yields in the diboson (VV) and top CRs and VRs for the standard-decay-tree regions associated with SR\(2\ell \)-Low-RJR after a simultaneous fit to the control regions. The yields for \(Z/\gamma ^*\text {\,+\,jets}\) are from MC simulation. The uncertainties include both the statistical and systematic sources
Table 15 Breakdown of expected an observed yields in the diboson (VV) and top CRs and VRs for the ISR-decay-tree regions associated with SR\(2\ell \)-ISR-RJR after a simultaneous fit to the control regions. The yields for \(Z/\gamma ^*\text {\,+\,jets}\) are from MC simulation. The uncertainties include both the statistical and systematic sources
Fig. 3
figure 3

Distributions of RJR variables in VR\(2\ell \)-VV-RJR (top-left), VR\(2\ell \)-Top-RJR (top-right), VR\(2\ell \)-ISR-VV-RJR (bottom-left), and VR\(2\ell \)-ISR-Top-RJR (bottom-right) from the RJR search. The background estimate models the data well in all regions after a simultaneous fit of the control regions. The standard and ISR regions are fit separately. The hatched band includes both the systematic and statistical uncertainties. The last bin contains the overflow

7.2 Electroweak search backgrounds

The dominant backgrounds in the EWK SRs are WZ, ZZ, and \(t\bar{t}\) production. Additionally, \(Z/\gamma ^*\text {\,+\,jets}\) is an important background for the Low and OffShell regions. Data-driven normalization factors for these backgrounds are extracted using a simultaneous likelihood fit to data in the SRs and CRs that are designed to be enriched in each background. CR-tt-EWK targets \(t\bar{t}\) production and requires at least one b-jet and intermediate \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\in [ 9,12 ]\) to ensure orthogonality to the SRs; unlike the RJR search, the resulting normalization factor is not applied to Wt, which is instead normalized to the theoretical cross-section. CR-VZ-EWK targets WZ/ZZ production and uses the sideband of the SR-Int-EWK \(m_{jj}\) distribution, relaxing the \(p_\text {T}^{j_1}\) requirement. A common diboson normalization factor is applied to the SRs. To account for different kinematics, two separate normalization factors are defined for \(Z/\gamma ^*\text {\,+\,jets}\). CR-Z-EWK targets on-shell Z + jets production (\(m_{\ell \ell }> 71\) , which is the lowest edge of the Z mass window) while CR-DY-EWK targets low-mass \(Z/\gamma ^*\text {\,+\,jets}\) production (\(12<m_{\ell \ell }<71\) ). The resulting Z + jets normalization factor is applied everywhere, except for the OffShell regions, where the low-mass \(Z/\gamma ^*\text {\,+\,jets}\) normalization factor is applied. The definitions of the CRs are provided in Tables 7, 8 and 9. The resulting normalization factors are \(0.93\pm 0.09\), \(0.84\pm 0.08\), \(1.21\pm 0.14\), and \(0.86\pm 0.25\) for \(t\bar{t}\), diboson, Z + jets, and \(Z/\gamma ^*\text {\,+\,jets}\) respectively.

To validate the normalization and modelling of the SM predictions, eight VRs are defined. For the High SRs, two VRs use the sideband of the \(m_{j_1}\) or \(m_{jj}\) distribution, while VR-High-R-EWK provides additional validation of the diboson background modelling inside the \(m_{jj}\) mass window by inverting the \(\Delta R_{jj} \) requirement. VR-\(\ell \ell bb\)-EWK validates the top background in a \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) window of 12–18 below the SR-\(\ell \ell bb\)-EWK threshold. Distributions of \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in the High and \(\ell \ell bb\) VRs, with the normalizations from the simultaneous fit applied, are shown in Fig. 4. Good agreement between the data and the background prediction is observed.

VR-Int-EWK has the same requirements as SR-Int-EWK but with the requirement on \(p_\text {T}^{j_1}\) inverted and is dominated by the diboson processes. For the Low SRs, two VRs are defined. VR-Low-EWK requires a larger value of \(\Delta R_{\ell \ell }\) than SR-Int-EWK, and VR-Low-2-EWK inverts the \(m_{jj}\) window. For the OffShell regions, VR-OffShell-EWK maintains orthogonality by using an \(m_{\textrm{T2}}\) window below that in the SR. Distributions of \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in the Int, Low, and OffShell VRs are shown in Fig. 5 with the normalization from the simultaneous fit applied. Good agreement is observed in all VRs.

Fig. 4
figure 4

Distributions of \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in VR-High-Sideband-EWK (top-left), VR-High-R-EWK (top-right), VR-1J-High-EWK (bottom-left), and VR-\(\ell \ell bb\)-EWK (bottom-right) from the EWK search after a simultaneous fit of the control regions. The hatched band includes both the systematic and statistical uncertainties. The last bin includes the overflow

Fig. 5
figure 5

Distributions of \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in VR-Int-EWK (top-left), VR-Low-EWK (top-right), VR-Low-2-EWK (bottom-left), and VR-OffShell-EWK (bottom-right) from the EWK search after a simultaneous fit of the control regions. The hatched band includes both the systematic and statistical uncertainties. The last bin includes the overflow

7.3 Strong search backgrounds

For the Strong search, the \(Z/\gamma ^*\text {\,+\,jets}\) background is normalized in CRs, the FS backgrounds are estimated from \(e\mu \) data, the FNP-lepton background is estimated in a data-driven way as previously described, and the remaining backgrounds are fully estimated via MC simulation. FS processes include those with independent leptonic decays such that the expected number of \(e\mu \) events is the same as the number ee + \(\mu \mu \) events, e.g. \(t\bar{t}\), WW, and \(Z\rightarrow \tau \tau \). VRs are constructed with an \(E_{\text {T}}^{\text {miss}}\) requirement, , below the SR requirement in order to validate the background estimation. VRs with SS instead of OS leptons and with three instead of two leptons are also constructed in order to validate the FNP-lepton and WZ/ZZ backgrounds respectively.

The \(Z/\gamma ^*\text {\,+\,jets}\) background, with decays to ee or \(\mu \mu \), mainly enter the SRs due to \(E_{\text {T}}^{\text {miss}}\) from the mismeasurement of jets or from contributions by neutrinos in heavy-flavour decays. The requirement of \(\Delta \phi (\varvec{j}_{1,2},\vec {p}_{\text {T}}^{\text {miss}})>0.4\) removes \(Z/\gamma ^*\text {\,+\,jets}\) events where the \(\vec {p}_{\text {T}}^{\text {miss}}\) is aligned with a jet, as is the case for mismeasured jets. A CR is constructed for each SR or VR by inverting this requirement, and roughly half of the yield in these CRs is from \(Z/\gamma ^*\text {\,+\,jets}\) events. The normalization of the \(Z/\gamma ^*\text {\,+\,jets}\) MC prediction is extracted with a simultaneous likelihood fit to the CR and respective SR or VR. The resulting factors are within one standard deviation of one for every region except SRC-STR, where the small MC simulation prediction is pulled up by approximately 1.5 standard deviations, and VRLow-STR, where the MC simulation prediction is pulled down by approximately 1.3 standard deviations.

The FS method makes use of a CR for each SF region, with the lepton flavour requirement changed to select \(e\mu \) data events. Differences between the efficiencies to select events with electrons and events with muons are used to adjust the \(e\mu \) data events to more accurately predict ee + \(\mu \mu \) events. For the on-\(Z\) SRs, the \(e\mu \) data yield is small. In order to decrease the statistical uncertainty of the FS prediction, the \(m_{\ell \ell }\) window used to collect \(e\mu \) events is widened to 61–121 . The yield is then scaled down by the ratio of the yields in the narrower and wider \(m_{\ell \ell }\) windows obtained from MC simulation. The estimated ee + \(\mu \mu \) yield \((N^\text {est})\) is obtained as:

$$\begin{aligned} N^\text {est} = \frac{1}{2} \cdot \left[ \sum ^{N_{e\mu }^\text {data}}_{i} \Bigl ( k_{e}(p_{\text {T}} ^{i,\mu }, \eta ^{i,\mu }) + k_{\mu }(p_{\text {T}} ^{i,e}, \eta ^{i,e}) \Bigr ) \cdot \alpha (p_{\text {T}} ^{i,\ell _{1}}, \eta ^{i,\ell _{1}}) \nonumber \right. \nonumber \\ \left. - \sum ^{N_{e\mu }^\text {MC}}_{i} \Bigl ( k_{e}(p_{\text {T}} ^{i,\mu }, \eta ^{i,\mu }) + k_{\mu }(p_{\text {T}} ^{i,e}, \eta ^{i,e}) \Bigr ) \cdot \alpha (p_{\text {T}} ^{i,\ell _{1}}, \eta ^{i,\ell _{1}}) \right] , \end{aligned}$$
(4)

where \(N_{e\mu }^\text {data}\) is the number of data events observed in a given CR. Events from non-FS processes, e.g. WZ/ZZ, are subtracted from the \(e\mu \) data events using MC simulation. This is the second term in Eq. (4), where \(N_{e\mu }^\text {MC}\) is the number of events from non-FS processes in MC simulation in the CR. The factor \(\alpha (p_{\text {T}} ^i, \eta ^i)\) accounts for the different trigger efficiencies for \(ee/\mu \mu \) and \(e\mu \) events, and \(k_{e}(p_{\text {T}} ^i, \eta ^i)\) and \(k_{\mu }(p_{\text {T}} ^i, \eta ^i)\) are the electron and muon selection efficiency factors for the kinematics of the lepton being replaced in event i. For example, if an event with a muon and electron is used to predict a \(\mu \mu \) event, the kinematics of the electron are used to evaluate \(k_\mu \). The trigger and selection efficiency correction factors are derived from data events in an inclusive region with two signal leptons and at least two signal jets, according to:

$$\begin{aligned} k_{e}(p_{\text {T}}, \eta ) = \sqrt{\frac{N_{ee}^{\text {meas}}(p_{\text {T}}, \eta )}{N_{\mu \mu }^{\text {meas}}(p_{\text {T}}, \eta )}}, \\ k_{\mu }(p_{\text {T}}, \eta ) = \sqrt{\frac{N_{\mu \mu }^{\text {meas}}(p_{\text {T}}, \eta )}{N_{ee}^{\text {meas}}(p_{\text {T}}, \eta )}}, \\ \alpha (p_{\text {T}}, \eta ) = \frac{\sqrt{\epsilon ^\text {trig}_{ee}(p_{\text {T}} ^{\ell _1},\eta ^{\ell _1})\times \epsilon ^\text {trig}_{\mu \mu }(p_{\text {T}} ^{\ell _1},\eta ^{\ell _1})}}{\epsilon ^\text {trig}_{e\mu }(p_{\text {T}} ^{\ell _1},\eta ^{\ell _1})}, \end{aligned}$$

where \(\epsilon ^\text {trig}_{ee/\mu \mu /e\mu }\) is the trigger efficiency as a function of the leading-lepton (\(\ell _1\)) kinematics and \(N_{ee}^{\text {meas}}\) \((N_{\mu \mu }^{\text {meas}})\) is the number of ee \((\mu \mu )\) data events in the inclusive region mentioned above. The factors \(k_{e}(p_{\text {T}}, \eta )\) and \(k_{\mu }(p_{\text {T}}, \eta )\) are calculated separately for leading and subleading leptons. The correction factors are typically within 20% of unity, except in the region \(|\eta |<0.1\), where they deviate by up to 40% from unity because of a lack of coverage by the muon spectrometer.

Comparisons of the \(m_{\ell \ell }\) distribution between the background estimate and data in the VRs corresponding to the four edge SRs are shown in Fig. 6. Good agreement is observed in each of the regions. Since the on-\(Z\) SRs require at least four jets, their equivalent VRs correspond to the high-\(n_{\textrm{jets}}\) regions in the distribution of the number of jets for each VR shown in Fig. 7. Good agreement is seen in each VR. In order to validate the MC simulation of WZ/ZZ production, a three-lepton VR is constructed by selecting events with three signal leptons and applying requirements similar to those in the SRs, as shown near the bottom of Table 11. Figure 8 shows the \(m_{\ell \ell }\) distribution in this VR3L-STR with a 13% theory uncertainty assigned to the WZ/ZZ background. Good agreement within the statistical uncertainty is observed.

Fig. 6
figure 6

Observed and expected dilepton mass distributions in VRC-STR (top-left), VRLow-STR (top-right), VRMed-STR (bottom-left), and VRHigh-STR (bottom-right). Each validation region is fit separately with the corresponding control region. All statistical and systematic uncertainties are included in the hatched band. The entries are normalized to the bin width, and the last bin is the overflow

Fig. 7
figure 7

Observed and expected jet multiplicity in VRLow-STR (top-left), VRMed-STR (top-right), and VRHigh-STR (bottom) after a fit performed on the \(m_{\ell \ell }\) distribution and corresponding control region. All statistical and systematic uncertainties are included in the hatched band. The last bin contains the overflow

Fig. 8
figure 8

Observed and expected dilepton mass distributions in VR3L-STR without a fit to the data. The ‘Other’ category includes the negligible contributions from \(t\bar{t}\) and \(Z/\gamma ^*\text {\,+\,jets}\) processes. The hatched band contains the statistical uncertainty and the theoretical systematic uncertainties of the WZ/ZZ prediction, which are the dominant sources of uncertainty in VR3L-STR. No fit is performed. The last bin contains the overflow

8 Systematic uncertainties

Theoretical and experimental uncertainties are taken into account for the signal and background models. For processes that are not normalized to data, a 1.7% uncertainty is assigned to the integrated luminosity [40, 41]. The estimated efficiencies from MC simulation to trigger on, reconstruct, and measure the objects required for this analysis are all subject to uncertainties to account for differences between data and simulation. For the jet energy scale, uncertainties due to the jet flavour composition, pile-up, and the jet and event kinematics [103] are considered. The jet energy resolution (JER) is also subject to additional uncertainties to account for differences between data and simulation [101]. Uncertainties in the lepton energy scales, resolutions, identification efficiencies, isolation efficiencies, reconstruction efficiencies, and trigger efficiencies are also considered. The uncertainties associated with the objects used to compute the \(\vec {p}_{\text {T}}^{\text {miss}}\) are propagated through the computation, and additional uncertainties in the scale and resolution of the contribution from low-momentum tracks not associated with the primary objects are also included [119]. These experimental uncertainties are correlated between regions and processes, including signal models. The largest experimental uncertainties are related to the measurement of jets and typically result in an uncertainty in the total background of a few percent, but reach 10–20% in a couple of search regions. All of the uncertainties quoted below are for the expected yield of the process in analysis regions.

Uncertainties in the predicted cross-sections and modelling of background processes predicted via MC simulation are considered. If the background process is normalized to data, the total cross-section uncertainty is not applied, but instead an uncertainty for the extrapolation from the CR to the SR is evaluated. For background processes making large contributions to regions in the searches, acceptance uncertainties due to various choices, such as the PDF set and QCD scales, are applied. For small-background processes, these are ignored and only the total cross-section uncertainty is applied. In a few cases, negligibly small uncertainties are ignored in order to simplify the statistical evaluation of the level of data-to-background agreement. As an example, uncertainties due to the assumed value of the strong coupling, \(\alpha _{\text {s}} \), are ignored as they are found to be negligible. These uncertainties from theoretical predictions are correlated between regions for a given process, but uncorrelated between processes.

For the RJR and EWK searches, uncertainties in the \(t\bar{t}\) ME are evaluated by a comparison of the default Powheg generator and MG5_aMC@NLO. The effects of uncertainties in the PS are evaluated by comparing the default showering with showering [120]. Uncertainties due to the amount of initial- and final-state radiation are evaluated by varying the relevant parameters in the generator. The effects of uncertainties in the QCD renormalization and factorization scales are evaluated by representing these scale uncertainties by three nuisance parameters in the fit: two nuisance parameters to independently vary each scale upwards and downwards by a factor of two, and one nuisance parameters for a correlated variation of both. Finally, uncertainties due to the choice of PDF are evaluated following the recommendations of Ref. [121]. The largest \(t\bar{t}\) uncertainties in the RJR search regions SR\(2\ell \)-Low-RJR and SR\(2\ell \)-ISR-RJR are due to the uncertainty in the PS, 20% and 30% respectively. The uncertainty due to the amount of initial-state radiation is also large, 20% in SR\(2\ell \)-ISR-RJR. The largest \(t\bar{t}\) uncertainties in the EWK SRs tend to be from the ME comparison and range from 20% to 60%. The PS and radiation uncertainties are of similar size.

For the diboson processes, uncertainties from the choice of PDF set and QCD scales are evaluated in the same way as for \(t\bar{t}\). Additional uncertainties are evaluated by varying the resummation scale upwards and downwards by a factor of two and by varying the merging scale between 15 and 30 . Finally, an alternative PS recoil scheme in Sherpa is evaluated, and the difference from the nominal sample is taken as the one standard deviation variation. For the Strong search, where the diboson processes are not normalized to data, a 6% uncertainty [62] is applied to the total cross-section. The uncertainty due to the choice of various scales is generally the largest one for the diboson backgrounds and varies from 10 to 30% depending on the region.

For the EWK and Strong searches, the \(Z/\gamma ^*\text {\,+\,jets}\) processes are also subject to the same uncertainties from the choice of PDF, QCD scales, resummation scale, and merging scale as the diboson processes. An additional uncertainty from the ME calculation and is evaluated by comparing the nominal Sherpa sample with an alternative MG5_aMC@NLO sample. Uncertainties in the \(Z/\gamma ^*\text {\,+\,jets}\) background in the EWK search regions range from 10 to 20% from the choice of scale and up to 60% from the ME comparison in regions with at least one predicted \(Z/\gamma ^*\text {\,+\,jets}\) event. The largest uncertainty in the \(Z/\gamma ^*\text {\,+\,jets}\) background in the Strong search regions is typically from the ME comparison, and ranges from 20% to 60% in regions with at least one predicted \(Z/\gamma ^*\text {\,+\,jets}\) event.

In the SR-\(\ell \ell bb\) region of the EWK search a large fraction of the background events are from \(t\bar{t}\) + Z. For \(t\bar{t}\) + Z in the EWK search regions, uncertainties from the ME and PS are evaluated by comparisons with a Sherpa sample and a MG5_aMC@NLO +  sample respectively. The same renormalization scale, factorization scale, and PDF uncertainties described above are also evaluated. The largest uncertainty is from the ME comparison and ranges from 10% to 50%, depending on the region. In addition, a 6% uncertainty on the Wt background is applied.

Uncertainties in the cross-sections of smaller backgrounds estimated directly from MC simulation are applied as follows. A conservative uncertainty of 10% is applied to the entire Higgs boson sample, based on the \(t\bar{t} H\) cross-section uncertainty in Ref. [68]. The Other Top sample is predominantly \(t\bar{t}\) + V, so a cross-section uncertainty of 13%, also from Ref. [68], is applied to this background category. An uncertainty of 32% is applied to the VVV background, based on comparisons between Sherpa and VBFNLO [62].

For data-driven background estimates, uncertainties in the prediction are derived from the limited number of events used in the prediction and by varying the assumptions used. In cases where CRs are used to normalize the yield of a process, theoretical uncertainties are evaluated, as described above, for the acceptance of the CR relative to the SR.

Uncertainties in the matrix-method estimate of the FNP leptons arise from the size of the samples used and the composition of the FNP-lepton sources, and the total uncertainty of the estimate is obtained by combining uncertainties from several sources. The statistical uncertainties arise from the limited number of events in regions used to derive the various efficiencies and are propagated to the final estimate. The variations of the lepton efficiency corrections are propagated to the light-flavour efficiencies because MC simulation is used in the derivation. For heavy-flavour efficiencies, an uncertainty in the prompt-lepton contamination from MC simulation is taken into account. Finally, an uncertainty in the estimated proportions of light-flavour jets, heavy-flavour jets, and photon conversions contributing to the processes producing FNP leptons is included. This is computed by varying the composition, using MC simulation, of the regions with selections similar to various analysis regions used to construct the weights. In cases where the number of events in a region is not sufficient to predict the FNP lepton contribution with the matrix method, the central value and uncertainty are taken from the larger of two computations. The first computation is the mean of the weights in the matrix method with the root mean square of these weights as the uncertainty. This would be the average prediction if there were one event for the matrix method. The second computation is the FNP-lepton yield predicted by MC simulation with a 100% uncertainty. These extra computations of the FNP-lepton background are only used in the EWK search, where the search regions have small yields.

For the flavour-symmetric background estimate in the Strong search regions, the uncertainties due to a limited number of events in the DF regions and the regions used to derive the correction factors are propagated to the final estimate. An additional uncertainty of 10% is applied to the estimate, based on closure tests. The closure tests were performed on both data and MC events in the region used to derive the weights, with \(E_{\text {T}}^{\text {miss}}\) restricted to be between 100 and 200 , and on MC events in the SRs. Finally, for the on-\(Z\) regions where the \(m_{\ell \ell }\) window used to collect DF events is expanded, an additional uncertainty is applied to account for the limited number of events in the MC simulation used to evaluate this extrapolation.

Uncertainties are applied to the estimate of \(Z/\gamma ^*\text {\,+\,jets}\) with the ABCD method in the RJR search to account for the limited number of events in the regions used to extract the estimate and the theoretical and experimental uncertainties, discussed above, in the non-\(Z/\gamma ^*\text {\,+\,jets}\) MC simulation backgrounds. The correlations between the ABCD variables are found to be very small and the related systematic uncertainties are treated as negligible.

For signal models with the production of squarks or gluinos, the nominal cross-section and its uncertainty due to scale variations and the choice of PDF are determined as described in Ref. [84]. For signal models containing the direct production of electroweakinos, the cross-section and its uncertainty are taken from Ref. [82]. The uncertainties in the acceptance of signal model events, from scale variations and the PS tuning and radiation, are estimated and summed in quadrature.

The most important uncertainties in the total background estimate across the various searches are related to the theoretical prediction of the largest backgrounds (e.g. diboson and \(t\bar{t}\) production), experimental uncertainties related to jets, and the number of simulated MC events for the estimation of SM processes. The most important uncertainties in the RJR search SRs (SR\(2\ell \)-Low-RJR / SR\(2\ell \)-ISR-RJR), relative to the total expected background, are from the normalization of \(Z/\gamma ^*\text {\,+\,jets}\) via the ABCD background estimate (19% / 28%), the size of the simulated samples (7%), and the matching scale for the diboson processes (5% / 6%). The largest jet-related experimental uncertainty is a component of the JER (1%) in SR\(2\ell \)-Low-RJR, and the b-jet tagging efficiency (2%) in SR\(2\ell \)-ISR-RJR. The sizes of the uncertainties in the total background, grouped into categories after fits to the SRs, are summarized in Fig. 9a.

The most important uncertainties in the EWK search SRs depend on the region. The largest uncertainties in the total background in the SR-High-EWK regions arise from the size of the simulated samples (4–23%), the prediction of the fake and non-prompt leptons (3–17%), and in SR-1J-High-EWK the uncertainty in the jet mass measurement (37%). The largest jet-related experimental uncertainties in regions that do not use the jet mass are 2–3%. In SR-Int_a-EWK, the dominant uncertainty is from the amount of final-state radiation in the simulation of \(t\bar{t}\) production (16%). In SR-Int_b-EWK, the dominant uncertainty is related to the JER, with several components in the 4–6% range. In the SR-Low-EWK regions, the largest uncertainties are from the size of the simulated samples (6–35%), the JER (6–18%), and the choice of generator for the \(Z/\gamma ^*\text {\,+\,jets}\) ME (13–23%). In the SR-OffShell-EWK and SR-\(\ell \ell bb\)-EWK regions, the largest uncertainties arise from the size of the simulated samples (7–12%), the choice of generator for the \(Z/\gamma ^*\text {\,+\,jets}\) ME (4–10%), and the JER (4–11%). Additionally, in SR-\(\ell \ell bb\)-EWK, the uncertainties from the \(t\bar{t}\) PS and ME calculation are 12% for each. The sizes of the uncertainties in the total background, grouped into categories after a fit to the regions, are summarized in Fig. 9b.

The most important uncertainties in the Strong search SRs are those from the various scales used in the simulation of diboson production and the PS recoil scheme, typically 3–6% in the edge SRs and up to 18% in the on-Z SRs; the non-closure of the flavour-symmetric background estimate (2–6%) in the edge SRs; the ME generator for \(Z/\gamma ^*\text {\,+\,jets}\), up to 7% in SRZ-High-STR; and the size of the simulated samples (2–11%). The largest jet-related experimental uncertainties are 1–5% from uncertainties in the parton origin, or flavour, of the jet, depending on the region. The sizes of the uncertainties in the total background, grouped into categories after fits to the SRs, are summarized in Fig. 9c.

Fig. 9
figure 9

Breakdown of the uncertainties in each of the (a) RJR, (b) EWK, and (c) strong searches into categories. The total fractional uncertainty of the background is shown as a shaded histogram. The experimental uncertainties include those from object reconstruction and identification efficiencies, object resolutions and scales, pile-up distribution, and trigger efficiencies. The background sample statistical uncertainty includes the uncertainty in the number of Monte Carlo and data-driven background events. The data-driven background uncertainty includes additional uncertainties, e.g. the uncertainty in the FNP-lepton estimate. The theory uncertainty includes any theory uncertainties in background estimates. The background normalization category includes the uncertainty associated with the normalization factors derived from CRs

9 Results

The data are compared with SM predictions using the profile likelihood method [24]. The HistFitter framework [122], built on RooFit, RooStats, HistFactory, and ROOT [123,124,125,126], is used to produce the presented statistical results. Model-independent upper limits on the number of events \((S^{95})\) at 95% CL that could be attributed to BSM processes are evaluated using the CL\(_\text {s}\) prescription [127]. Yields are reported after a fit to the background-only model, i.e. the zero signal-strength model. Compatibility tests are reported for the observed data and the background-only hypothesis (\(p(s=0)\)) and for the observed data and the hypothesis with the signal strength at the 95% CL limit (CL\(_\text {b}\)).

9.1 Recursive-jigsaw reconstruction search results

A breakdown of the observed and expected yields in the RJR search SRs is shown in Table 16. The background yields result from a simultaneous profile likelihood fit of the CRs, excluding the SRs themselves. The fits for SR\(2\ell \)-Low-RJR and SR\(2\ell \)-ISR-RJR are performed separately and are not combined. The background estimate agrees with the observed data in both cases. Figure 10 summarizes the event yields in the CRs, VRs, and SRs of the RJR search. The asymptotic approximation [24] is used to set model-independent upper limits on the number of events, or cross-section, which could arise from BSM processes in the two SRs. These are presented in Table 17. The asymptotic approximation was validated against pseudo-experiments.

Table 16 Breakdown of expected and observed yields in the two recursive-jigsaw reconstruction signal regions after a simultaneous fit of the CRs. The two sets of regions are fit separately. The uncertainties include both the statistical and systematic sources
Fig. 10
figure 10

The observed and expected yields in the control regions, validation regions, and signal regions of the RJR search after a simultaneous fit of the background-only model to control regions. The standard regions are shown to the left, and the ISR regions are shown to the right. These two sets of regions are fit separately. The hatched band includes the statistical and systematic uncertainties of the background prediction in each region. The significance of the difference between the observed data and the expected yield in each region is shown in the lower panel using the profile likelihood method of Ref. [128]; the colours black, grey, and red separate the CRs, VRs, and SRs, respectively. For the cases where the expected yield is larger than the data, a negative significance is shown. The relevant backgrounds are normalized to data in the CR by the fit and the significances are therefore zero

Table 17 Model-independent upper limits on the observed visible cross-section in the two recursive-jigsaw reconstruction signal regions, derived using the asymptotic approximation. Left to right: background-only model post-fit total expected background, with the combined statistical and systematic uncertainties; observed data; 95% CL upper limits on the visible cross-section (\(\langle A\epsilon \sigma \rangle _{\text {obs}}^{95}\)) and on the number of signal events (\(S_{\text {obs}}^{95}\) ). The sixth column (\(S_{\text {exp}}^{95}\)) shows the expected 95% CL upper limit on the number of signal events, given the expected number (and \(\pm 1\sigma \) excursions of the expectation) of background events. The last two columns indicate the confidence level of the background-only hypothesis (CL\(_\text {b}\)) and discovery p-value with the corresponding Gaussian significance (\(Z(s=0)\)). CL\(_\text {b}\) provides a measure of compatibility of the observed data with the signal strength hypothesis at the 95% CL limit relative to fluctuations of the background, and \(p(s=0)\) provides a measure of compatibility of the observed data with the background-only hypothesis relative to fluctuations of the background. The p-value is capped at 0.5
Table 18 Breakdown of expected and observed yields in the electroweak search High and \(\ell \ell bb\) signal regions after a simultaneous fit to the signal regions and control regions. All statistical and systematic uncertainties are included
Table 19 Breakdown of expected and observed yields in the electroweak search Int, Low, and OffShell signal regions after a simultaneous fit to the signal regions and control regions. All statistical and systematic uncertainties are included
Fig. 11
figure 11

Observed and expected distributions in five EWK search regions after a simultaneous fit to the signal regions and control regions. In the top row, left-to-right, are \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in SR-High_8-EWK and \(m_{bb}\) in SR-\(\ell \ell bb\)-EWK. In the middle row, left-to-right, are \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in SR-Int-EWK and \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in SR-Low-EWK. In the bottom row is \(m_{\ell \ell }\) in SR-OffShell-EWK. Overlaid are example C1N2 and GMSB signal models, where the numbers in the brackets indicate the masses, in , of the \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\) or the mass of the \(\tilde{\chi }_1^0\) and branching ratio to the Higgs boson respectively. All statistical and systematic uncertainties are included in the hatched bands. The last bin includes the overflow

Fig. 12
figure 12

The observed and expected yields in the control regions (left), validation regions (middle), and signal regions (right) of the EWK search after a simultaneous fit to the signal regions and control regions. The hatched band includes the statistical and systematic uncertainties of the background prediction in each region. The significance of the difference between the observed data and the expected yield in each region is shown in the lower panel using the profile likelihood method of Ref. [128]; the colours black, grey, and red separate the CRs, VRs, and SRs, respectively. For the cases where the expected yield is larger than the data, a negative significance is shown

9.2 Electroweak search results

The yields in the High, \(\ell \ell bb\), Int, Low, and OffShell SRs of the EWK search are listed in Tables 18 and 19. Each SR in the tables is split by the binning described in Sect. 6.4. For SR-High-EWK the 8 or 16 suffix denotes the \(\Delta R_{jj}\) ranges of 0–0.8 and 0.8–1.6 respectively. Similarly, the a and b suffixes denote the \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) ranges of 18–21 and \(>21\) respectively. For SR-Int-EWK and SR-Low-EWK, the a and b suffixes denote the lower and upper ranges of \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) in the binning (12, 15, 18) and (6, 9, 12) respectively. For SR-OffShell-EWK, the a and b suffixes denote the lower and upper \(m_{\ell \ell }\) ranges of 12–40 and 40–70 . The results are extracted after a simultaneous profile likelihood fit to the four CRs and all of the SRs. SR-High-EWK regions have small deficits in the observed data compared to the expected background yield. No systematic trends are observed in the VRs around these regions, VR-High-Sideband-EWK and VR-High-R-EWK. Figure 11 shows distributions in five regions, plotting the variable used for the binning denoted by a and b above. Example signal distributions are overlaid from models near the edge of sensitivity for this search. Figure 12 summarizes all of the CRs, VRs, and SRs of the EWK search.

In order to perform model-independent tests, the five SR categories are merged into single discovery region (DR) bins. DR-High-EWK is SR-High_8-EWK merged over its \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) bins, and DR-Low-EWK is SR-Low-EWK merged over its \(\mathcal {S}(E_{\text {T}}^{\text {miss}})\) bins. DR-High-EWK has the largest deficit of data compared to the expected background, resulting in a local significance of \(-2.8\sigma \). The resulting upper limits, based on pseudo-experiments, on the number of possible BSM events in each region are summarized in Table 20.

Table 20 Model-independent upper limits on the observed visible cross-section in the five electroweak search discovery regions, derived using pseudo-experiments. Left to right: background-only model post-fit total expected background, with the combined statistical and systematic uncertainties; observed data; 95% CL upper limits on the visible cross-section (\(\langle A\epsilon \sigma \rangle _{\text {obs}}^{95}\)) and on the number of signal events (\(S_{\text {obs}}^{95}\) ). The sixth column (\(S_{\text {exp}}^{95}\)) shows the expected 95% CL upper limit on the number of signal events, given the expected number (and \(\pm 1\sigma \) excursions of the expectation) of background events. The last two columns indicate the confidence level of the background-only hypothesis (CL\(_\text {b}\)) and discovery p-value with the corresponding Gaussian significance (\(Z(s=0)\)). CL\(_\text {b}\) provides a measure of compatibility of the observed data with the signal strength hypothesis at the 95% CL limit relative to fluctuations of the background, and \(p(s=0)\) measures compatibility of the observed data with the background-only hypothesis relative to fluctuations of the background. The p-value is capped at 0.5
Table 21 Breakdown of expected and observed yields in the four edge signal regions, integrated over the \(m_{\ell \ell }\) distribution after a separate simultaneous fit to each signal region and control region pair. The uncertainties include both the statistical and systematic sources
Table 22 Breakdown of expected and observed yields in the three on-\(Z\) signal regions after a separate simultaneous fit to each signal region and control region pair. The uncertainties include both the statistical and systematic sources
Fig. 13
figure 13

Observed and expected dilepton mass distributions in SRC-STR (top-left), SRLow-STR (top-right), SRMed-STR (bottom-left), and SRHigh-STR (bottom-right), with the binning used for interpretations after a separate simultaneous fit to each signal region and control region pair. The red dashed lines are example signal models overlaid on the figure. All statistical and systematic uncertainties are included in the hatched bands. The last bins are the overflow

Fig. 14
figure 14

The observed and expected yields in the control regions (left), validation regions (middle), and signal regions (right) of the Strong search after a separate simultaneous fit to each signal region and control region pair. The hatched band includes the statistical and systematic uncertainties of the background prediction in each region. The significance of the difference between the observed data and the expected yield in each region is shown in the lower panel using the profile likelihood method of Ref. [128]; the colours black, grey, and red separate the CRs, VRs, and SRs, respectively. For the cases where the expected yield is larger than the data, a negative significance is shown. The absence of significant yield differences in the control regions is by construction

9.3 Strong search results

The integrated data yields in the edge and on-\(Z\) regions are compared to the expected background yields in Tables 21 and 22 respectively. The \(m_{\ell \ell }\) distributions in the four edge regions are shown in Fig. 13. The yields and distributions are shown after a profile likelihood fit to the SR and its respective CR. The SM expectations agree well with the observed data. Figure 14 summarizes the observed and expected yields in all of the CRs, VRs, and SRs in the Strong search.

Since the signal models may produce a kinematic edge anywhere along the \(m_{\ell \ell }\) distribution, below the Z boson mass for those with a Z boson in the decay, several \(m_{\ell \ell }\) windows are used to search for an excess of data events above the expectation. Twelve windows covering various \(m_{\ell \ell }\) ranges in the edge SRs are considered, along with the three single-bin on-\(Z\) SRs. Results in each of these windows are summarized in Table 23. The data are generally consistent with the background to one standard deviation. The high \(m_{\ell \ell }\) bins of SRLow-STR have a slightly higher local deviation above the expectation. The table also includes model-independent limits on the number of possible BSM events, as described at the beginning of this section, evaluated with pseudo-experiments.

Table 23 Expected and observed yields in 12 \(m_{\ell \ell }\) windows of the edge signal regions and model-independent upper limits derived using pseudo-experiments. The \(m_{\ell \ell }\) range is indicated in the leftmost column of the table, with units of . Left to right: background-only model post-fit total expected background, with the combined statistical and systematic uncertainties; observed data; 95% CL upper limits on the visible cross-section (\(\langle A\epsilon \sigma \rangle _{\text {obs}}^{95}\)) and on the number of signal events (\(S_{\text {obs}}^{95}\) ). The sixth column (\(S_{\text {exp}}^{95}\)) shows the expected 95% CL upper limit on the number of signal events, given the expected number (and \(\pm 1\sigma \) excursions of the expectation) of background events. The last two columns indicate the confidence level of the background-only hypothesis (CL\(_\text {b}\)) and discovery p-value with the corresponding Gaussian significance (\(Z(s=0)\)). CL\(_\text {b}\) provides a measure of compatibility of the observed data with the signal strength hypothesis at the 95% CL limit relative to fluctuations of the background, and \(p(s=0)\) measures compatibility of the observed data with the background-only hypothesis relative to fluctuations of the background. The p-value is capped at 0.5
Fig. 15
figure 15

Expected and observed exclusion contours from the EWK analysis for the C1N2 model (left) and GMSB model (right). The dashed line indicates the expected limits at 95% CL and the surrounding band shows the \(1\sigma \) variation of the expected limit as a consequence of the uncertainties in the background prediction and experimental uncertainties of the signal (\(\pm 1\sigma _\text {exp}\)). The red dotted lines surrounding the observed limit contours indicate the variation resulting from changing the signal cross-section within its uncertainty (\(\pm 1\sigma _\text {theory}^\text {SUSY}\)). The grey-shaded areas indicate observed limits on these models from the two-lepton channels in Ref. [22] and Ref. [129]

Fig. 16
figure 16

Expected and observed exclusion contours derived from the combination of all of the Strong search SRs for the \(\tilde{g}\)\(\tilde{\ell }\) (top-left), \(\tilde{g}\)Z (top-right), and \(\tilde{q}\)–Z (bottom) models. The dashed line indicates the expected limits at 95% CL and the surrounding band shows the \(1\sigma \) variation of the expected limit as a consequence of the uncertainties in the background prediction and experimental uncertainties of the signal (\(\pm 1\sigma _\text {exp}\)). The red dotted lines surrounding the observed limit contours indicate the variation resulting from changing the signal cross-section within its uncertainty (\(\pm 1\sigma _\text {theory}^\text {SUSY}\)). The grey-shaded area indicates the observed limits on these models from Ref. [23]

10 Interpretation

This section contains the model-dependent interpretations of the EWK and Strong searches. The RJR search does not interpret any signal models as it is a model-independent follow-up to a previously observed excess. The exclusion contours are presented on a two-dimensional plane of sparticle masses, or in the case of the GMSB model the branching ratio to the \(\tilde{\chi }_1^0\) to a Higgs boson and gravitino and the mass of the \(\tilde{\chi }_1^0\). They are evaluated at 95% CL using the CL\(_\text {s}\) method. The dashed lines represent the expected exclusion contour given the background estimates and is surrounded by a yellow-shaded region identifying the \(\pm 1\sigma \) variation of the median expected limit. The red solid line denotes the observed exclusion contours. Uncertainties in the signal cross-sections are not directly included in this contour, but instead shown as red dashed lines denoting \(\pm 1\sigma \) variations of the signal cross-section. Grey-shaded areas cover regions excluded by previous similar analyses.

10.1 Electroweak search interpretation

This section presents mass and branching fraction exclusion limits for the C1N2 and GMSB models described in Sect. 2. All of the SRs and CRs are fitted simultaneously in order to test each model.

The left panel of Fig. 15 shows the exclusion contours in the \(m_{\tilde{\chi }_1^\pm ,\tilde{\chi }_2^0}\)\(m_{\tilde{\chi }_1^0}\) plane for the C1N2 model. Masses of the \(\tilde{\chi }_2^0\) and \(\tilde{\chi }_1^\pm \) up to 820  (740  expected) are excluded for \(\tilde{\chi }_1^0\) masses up to 375 . In the region where the Z boson decay is off-shell at , masses of the \(\tilde{\chi }_1^0\) up to 100  are excluded. The observed limit at high \(\tilde{\chi }_2^0\) and \(\tilde{\chi }_1^\pm \) masses is stronger than expected due to the data event deficits observed in SR-High-EWK.

The right panel of Fig. 15 shows the exclusion contours in the \(m_{\tilde{\chi }_1^0}\)\(B(\tilde{\chi }_1^0 \rightarrow h \tilde{G})\) plane. For a branching fraction entirely to Z bosons, masses of the \(\tilde{\chi }_1^0\) up to 900  (800  expected) are excluded. Branching fractions up to 95% (85% expected) of the \(\tilde{\chi }_1^0\) to Higgs bosons are excluded for \(\tilde{\chi }_1^0\) masses around 450 . Sensitivity to models with decays entirely through Higgs bosons is limited since the majority of Higgs boson decays do not produce a dilepton system with a mass around the Z boson mass. The observed limit at moderate to high \(\tilde{\chi }_1^0\) mass is stronger than expected due to the deficits observed in SR-\(\ell \ell bb\)-EWK and SR-High-EWK.

10.2 Strong search interpretation

This section presents mass exclusion limits for the squark- and gluino-initiated SUSY models described in Sect. 2. These model-dependent exclusion limits are computed separately for each SR, and then combined according to which SR has the best expected sensitivity for a particular set of the model parameters. This can result in sharp contour features where the best expected sensitivity switches between SRs with different data-to-background ratios. A shape fit of the binned \(m_{\ell \ell }\) distribution and the \(Z/\gamma ^*\text {\,+\,jets}\) CR is performed for the four edge SRs, shown in Fig. 13. A fit to the single-bin SR and its \(Z/\gamma ^*\text {\,+\,jets}\) CR is performed for three on-\(Z\) regions in Table 22.

The first plot in Fig. 16 shows the exclusion contours in the \(m(\tilde{g})\)\(m(\tilde{\chi }_1^0)\) plane for a simplified model where the gluino decays via sleptons. The limit is derived from only the four edge SRs. SRHigh-STR drives the limit for large splittings between the \(\tilde{g}\) and \(\tilde{\chi }_1^0\) masses. Moving up in \(\tilde{\chi }_1^0\) mass, SRLow-STR takes over around , . In SRLow-STR, the \(m_{\ell \ell }\) bins above a value of 200  have a slight excess, as seen in Table 23, hence the weaker observed exclusion limit in this part of the combined contour. The largest excluded gluino mass is observed (expected) to be 2.25 (2.20) .

The second plot in Fig. 16 shows the exclusion contours in the \(m(\tilde{g})\)\(m(\tilde{\chi }_1^0)\) plane for a simplified model where the gluino decays via a Z boson. The limit is derived using the four edge SRs and three on-\(Z\) SRs. The three kinks in the observed limit, from left to right, are the transition from SRLow-STR to SRZLow-STR, then to SRZMed-STR, and then to SRZHigh-STR. The largest excluded gluino mass is observed (expected) to be 1.95 (1.90) .

The third plot in Fig. 16 shows the exclusion contours in the \(m(\tilde{q})\)\(m(\tilde{\chi }_1^0)\) plane for a simplified model where the squark decays via a Z boson. As above, all of the SRs are included. The three kinks in the observed limit, from left to right, are the same transitions as in the \(m(\tilde{g})\)\(m(\tilde{\chi }_1^0)\) contour. The largest excluded squark mass is observed (expected) to be 1.55 (1.50) .

11 Conclusion

This paper presents searches for new phenomena in final states with exactly two oppositely charged same-flavour leptons, jets, and missing transverse momentum using the 139 fb\(^{-1}\) Run 2 dataset of proton–proton collision data collected by the ATLAS detector at the LHC. The analysis is split into three searches, probing both strong and electroweak production of sparticles. The data are found to be consistent with SM expectations in each, and strong limits are set on possible beyond-the-SM contributions to the signal regions. The RJR search does not see significant excesses above the background expectation. Thus the small excesses in the two-lepton channel of the 36 fb\(^{-1}\) dataset did not persist with more data. The EWK search and Strong search interpret the data with simplified models inspired by SUSY and set exclusion limits on these models, expanding the sensitivity reach compared to previous analyses. The EWK search targets electroweak production of chargino–neutralino pairs and a GMSB model with higgsino NLSPs. Limits up to 820  and 900  are set on the masses of the mass-degenerate chargino/neutralino and higgsino NLSP respectively. This is an improvement on the previous result by roughly 200  in chargino/neutralino masses. The EWK search covers masses and branching ratios of the GMSB model that are between the limits from the ATLAS four-lepton and all-hadronic searches. The Strong search targets strong production of gluino- or squark-pairs decaying via a Z boson or the NLSP to lepton-pairs and jets. Limits up to 2250  and 1550  are set on the masses of the gluino and squarks respectively. Compared to the previous result, the limits on the gluino mass improved by 400  and the limits on squark masses by 300 . Improvements to both the Strong and EWK searches result from the increased size of the dataset and subsequent optimizations of analysis requirements.