1 Introduction

Supersymmetry (SUSY) [19] is an extension to the Standard Model (SM) that introduces supersymmetric particles (sparticles), which differ by half a unit of spin from their SM partners. The squarks (\(\tilde{q}\)) and sleptons (\(\tilde{\ell }\)) are the scalar partners of the quarks and leptons, and the gluinos (\(\tilde{g}\)) are the fermionic partners of the gluons. The charginos (\({\tilde{\chi }}_{i}^{\pm }\) with \(i=1,2\)) and neutralinos (\({\tilde{\chi }}_{i}^{0}\) with \(i=1,2,3,4\)) are the mass eigenstates (ordered from the lightest to the heaviest) formed from the linear superpositions of the SUSY partners of the Higgs and electroweak gauge bosons. SUSY models in which the gluino, higgsino and top squark masses are not much higher than the \({\mathrm {TeV}}\) scale can provide a solution to the SM hierarchy problem [1015].

If strongly interacting sparticles have masses not higher than the \({\mathrm {TeV}}\) scale, they should be produced with observable rates at the Large Hadron Collider (LHC). In the minimal supersymmetric extension of the SM, such particles decay into jets, possibly leptons, and the lightest sparticle (LSP). If the LSP is stable due to R-parity conservation [1519] and only weakly interacting, it escapes detection, leading to missing transverse momentum (\(\mathbf {p}_{\mathrm {T}}^\mathrm {miss}\) and its magnitude \({E_{\mathrm{T}}^{\mathrm{miss}}} \)) in the final state. In this scenario, the LSP is a dark-matter candidate [20, 21].

Leptons may be produced in the cascade decays of squarks and gluinos via several mechanisms. Here two scenarios that always produce leptons (electrons or muons) in same-flavour opposite-sign (SFOS) pairs are considered: the leptonic decay of a Z boson, \(Z\rightarrow \ell ^+\ell ^-\), and the decay \(\tilde{\chi }_{2}^{0}\rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\), which includes contributions from \(\tilde{\chi }_{2}^{0} \rightarrow \tilde{\ell }^{\pm (*)}\ell ^\mp \rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\) and \(\tilde{\chi }_{2}^{0} \rightarrow Z^*\tilde{\chi }_{1}^{0} \rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\). In models with generalised gauge-mediated (GGM) supersymmetry breaking with a gravitino LSP (\(\tilde{G}\)), Z bosons may be produced via the decay \(\tilde{\chi }_{1}^{0} \rightarrow Z \tilde{G}\). Z bosons may also result from the decay \(\tilde{\chi }_{2}^{0} \rightarrow Z \tilde{\chi }_{1}^{0}\), although the GGM interpretation with the decay \(\tilde{\chi }_{1}^{0} \rightarrow Z \tilde{G}\) is the focus of the Z boson final-state channels studied here. The \(\tilde{\chi }_{2}^{0}\) particle may itself be produced in the decays of the squarks or gluinos, e.g. \(\tilde{q} \rightarrow q \tilde{\chi }_{2}^{0}\) and \(\tilde{g} \rightarrow q \bar{q} \tilde{\chi }_{2}^{0}\).

These two SFOS lepton production modes are distinguished by their distributions of dilepton invariant mass (\(m_{\ell \ell }\)). The decay \(Z\rightarrow \ell ^+\ell ^-\) leads to a peak in the \(m_{\ell \ell }\) distribution around the Z boson mass, while the decay \(\tilde{\chi }_{2}^{0}\rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\) leads to a rising distribution in \(m_{\ell \ell }\) that terminates at a kinematic endpoint (“edge”) [22], because events with larger \({m_{\ell \ell }}\) values would violate energy conservation in the decay of the \(\tilde{\chi }_{2}^{0}\) particle. In this paper, two searches are performed that separately target these two signatures. A search for events with a SFOS lepton pair consistent with originating from the decay of a Z boson (on-Z search) targets SUSY models with Z boson production. A search for events with a SFOS lepton pair inconsistent with Z boson decay (off-Z search) targets the decay \(\tilde{\chi }_{2}^{0}\rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\).

Previous searches for physics beyond the Standard Model (BSM) in the \(Z\,+\,\mathrm {jets}\,+\,{E_{\mathrm{T}}^{\mathrm{miss}}} \) final state have been performed by the CMS Collaboration [23, 24]. Searches for a dilepton mass edge have also been performed by the CMS Collaboration [24, 25]. In the CMS analysis performed with \(\sqrt{s}=8\) \({\mathrm {TeV}}\) data reported in Ref. [24], an excess of events above the SM background with a significance of 2.6 standard deviations was observed.

In this paper, the analysis is performed on the full 2012 ATLAS [26] dataset at a centre-of-mass energy of 8 \({\mathrm {TeV}}\), corresponding to an integrated luminosity of 20.3  fb\(^{-1}\).

2 The ATLAS detector

ATLAS is a multi-purpose detector consisting of a tracking system, electromagnetic and hadronic calorimeters and a muon system. The tracking system comprises an inner detector (ID) immersed in a 2 T axial field supplied by the central solenoid magnet surrounding it. This sub-detector provides position and momentum measurements of charged particles over the pseudorapidityFootnote 1 range \(|\eta |<2.5\). The electromagnetic calorimetry is provided by liquid argon (LAr) sampling calorimeters using lead absorbers, covering the central region (\(|\eta |<3.2\)). Hadronic calorimeters in the barrel region (\(|\eta |<1.7\)) use scintillator tiles with steel absorbers, while the pseudorapidity range \(1.5<|\eta |<4.9\) is covered using LAr technology with copper or tungsten absorbers. The muon spectrometer (MS) has coverage up to \(|\eta |<2.7\) and is built around the three superconducting toroid magnet systems. The MS uses various technologies to provide muon tracking and identification as well as dedicated muon triggering for the range \(|\eta |<2.4\).

The trigger system [27] comprises three levels. The first of these (L1) is a hardware-based trigger that uses only a subset of calorimeter and muon system information. Following this, both the second level (L2) and event filter (EF) triggers, constituting the software-based high-level trigger, include fully reconstructed event information to identify objects. At L2, only the regions of interest in \(\eta \)\(\phi \) identified at L1 are scrutinised, whereas complete event information from all detector sub-systems is available at the EF.

3 Data and Monte Carlo samples

The data used in this analysis were collected by ATLAS during 2012. Following requirements based on beam and detector conditions and data quality, the complete dataset corresponds to an integrated luminosity of 20.3 fb\(^{-1}\), with an associated uncertainty of 2.8 %. The uncertainty is derived following the same methodology as that detailed in Ref. [28].

Dedicated high-transverse-momentum (\({p_{\mathrm{T}}} \)) single-lepton triggers are used in conjunction with the lower-\({p_{\mathrm{T}}} \) dilepton triggers to increase the trigger efficiency at high lepton \({p_{\mathrm{T}}} \). The required leading-lepton \({p_{\mathrm{T}}} \) threshold is 25 \({\mathrm {GeV}}\), whereas the sub-leading lepton threshold can be as low as 10 \({\mathrm {GeV}}\), depending on the lepton \({p_{\mathrm{T}}} \) threshold of the trigger responsible for accepting the event. To provide an estimate of the efficiency for the lepton selections used in these analyses, trigger efficiencies are calculated using \(t\bar{t}\) Monte Carlo (MC) simulated event samples for leptons with \({p_{\mathrm{T}}} >14 {\mathrm {GeV}}\). For events where both leptons are in the barrel (endcaps), the total efficiency of the trigger configuration for a two-lepton selection is approximately 96, 88 and 80 % (91, 92 and 82 %) for ee, \(e\mu \) and \(\mu \mu \) events, respectively. Although the searches in this paper probe only same-flavour final states for evidence of SUSY, the \(e\mu \) channel is used to select control samples in data for background estimation purposes.

Simulated event samples are used to validate the analysis techniques and aid in the estimation of SM backgrounds, as well as to provide predictions for BSM signal processes. The SM background samples [2940] used are listed in Table 1, as are the parton distribution function (PDF) set, underlying-event tune and cross-section calculation order in \(\alpha _{\mathrm{s}}\) used to normalise the event yields for these samples. Samples generated with MadGraph5 1.3.28 [41] are interfaced with Pythia 6.426 [42] to simulate the parton shower. All samples generated using Powheg [4345] use Pythia to simulate the parton shower, with the exception of the diboson samples, which use Pythia8 [46]. Sherpa [47] simulated samples use Sherpa’s own internal parton shower and fragmentation methods, as well as the Sherpa default underlying-event tune [47]. The standard ATLAS underlying-event tune, AUET2 [48], is used for all other samples with the exception of the Powheg+Pythia samples, which use the Perugia2011C [49] tune.

The signal models considered include simplified models and a GGM supersymmetry-breaking model. In the simplified models, squarks and gluinos are directly pair-produced, and these subsequently decay to the LSP via two sets of intermediate particles. The squarks and gluinos decay with equal probability to the next-to-lightest neutralino or the lightest chargino, where the neutralino and chargino are mass-degenerate and have masses taken to be the average of the squark or gluino mass and the LSP mass. The intermediate chargino or neutralino then decays via sleptons (or sneutrinos) to two leptons of the same flavour and the lightest neutralino, which is assumed to be the LSP in these models. Here, the sleptons and sneutrinos are mass-degenerate and have masses taken to be the average of the chargino or neutralino and LSP masses. An example of one such process, \(pp\rightarrow \tilde{g}\tilde{g}\rightarrow (q\bar{q}\tilde{\chi }_{2}^{0})(q\bar{q}\tilde{\chi }_{1}^{\pm }), \tilde{\chi }_{2}^{0}\rightarrow \ell ^{+}\ell ^{-}\tilde{\chi }_{1}^{0},\) \(\tilde{\chi }_{1}^{\pm }\rightarrow \ell ^{\pm }\nu \tilde{\chi }_{1}^{0}\) is illustrated on the left in Fig. 1, where \(\ell =e,\mu ,\tau \) with equal branching fractions for each lepton flavour. The dilepton mass distribution for leptons produced from the \(\tilde{\chi }_{2}^{0}\) in these models is a rising distribution that terminates at a kinematic endpoint, whose value is given by \(m_{\mathrm {max}}\) \(\approx m(\tilde{\chi }_{2}^{0})-m(\tilde{\chi }_{1}^{0}) = 1/2(m(\tilde{g}/\tilde{q})-m(\tilde{\chi }_{1}^{0}))\). Therefore, signal models with small values of \(\Delta m = m(\tilde{g}/\tilde{q})-m(\tilde{\chi }_{1}^{0})\) produce events with small dilepton masses; those with large \(\Delta m\) produce events with large dilepton mass.

For the model involving squark pair production, the left-handed partners of the u, d, c and s quarks have the same mass. The right-handed squarks and the partners of the b and t quarks are decoupled. For the gluino-pair model, an effective three-body decay for \(\tilde{g}\rightarrow q \bar{q} \tilde{\chi }_{1}^{0}\) is used, with equal branching fractions for \(q=u,d,c,s\). Exclusion limits on these models are set based on the squark or gluino mass and the LSP mass, with all sparticles not directly involved in the considered decay chains effectively being decoupled.

In the general gauge mediation models, the gravitino is the LSP and the next-to-lightest SUSY particle (NLSP) is a higgsino-like neutralino. The higgsino mass parameter, \(\mu \), and the gluino mass are free parameters. The U(1) and SU(2) gaugino mass parameters, \(M_{1}\) and \(M_{2}\), are fixed to be 1 TeV, and the masses of all other sparticles are set at \(\sim \)1.5 TeV. In addition, \(\mu \) is set to be positive to make \(\tilde{\chi }^{0}_{1} \rightarrow Z \tilde{G}\) the dominant NLSP decay. The branching fraction for \(\tilde{\chi }^{0}_{1} \rightarrow Z \tilde{G}\) varies with \(\tan \beta \), the ratio of the vacuum expectation value for the two Higgs doublets, and so two different values of \(\tan \beta \) are used. At \(\tan \beta =1.5\), the branching fraction for \(\tilde{\chi }^{0}_{1} \rightarrow Z \tilde{G}\) is large (about 97 %) [50], whereas setting \(\tan \beta =30\) results in a considerable contribution (up to 40 %) from \(\tilde{\chi }^{0}_{1} \rightarrow h \tilde{G}\). In these models, h is the lightest CP-even SUSY Higgs boson, with \(m_{h}=126\) GeV and SM-like branching fractions. The dominant SUSY-particle production mode in these scenarios is the strong production of gluino pairs, which subsequently decay to the LSP via several intermediate particles. An example decay mode is shown in the diagram on the right in Fig. 1. The gravitino mass is set to be sufficiently small such that the NLSP decays are prompt. The decay length \(c\tau _{\mathrm{NLSP}}\) (where \(\tau _{\mathrm{NLSP}}\) is the lifetime of the NLSP) can vary depending on \(\mu \), and is longest at \(\mu =120\) GeV, where it is 2 mm, decreasing to \(c\tau _{\mathrm{NLSP}}<0.1\) mm for \(\mu \ge 150\,{\mathrm {GeV}}\). The finite NLSP lifetime is taken into account in the MC signal acceptance and efficiency determination.

All simplified models are produced using MadGraph5 1.3.33 with the CTEQ6L1 PDF set, interfaced with Pythia 6.426. The scale parameter for MLM matching [51] is set at a quarter of the mass of the lightest strongly produced sparticle in the matrix element. The SUSY mass spectra, gluino branching fractions and the gluino decay width for the GGM scenarios are calculated using Suspect 2.41 [52] and Sdecay 1.3 [53]. The GGM signal samples are generated using Pythia 6.423 with the MRST2007 LO \(^{*}\) [54] PDF set. The underlying event is modelled using the AUET2 tune for all signal samples. Signals are normalised to cross sections calculated at next-to-leading order (NLO) in \(\alpha _{s}\), including the resummation of soft gluon emission at next-to-leading-logarithmic accuracy (NLO + NLL) [5559].

A full ATLAS detector simulation [60] using GEANT4 [61] is performed for most of the SM background MC samples. The signal and remaining SM MC samples use a fast simulation [62], which employs a combination of a parameterisation of the response of the ATLAS electromagnetic and hadronic calorimeters and GEANT4. To simulate the effect of multiple pp interactions occurring during the same (in-time) or a nearby (out-of-time) bunch-crossing, called pile-up, minimum-bias interactions are generated and overlaid on top of the hard-scattering process. These are produced using Pythia8 with the A2 tune [63]. MC-to-data corrections are made to simulated samples to account for small differences in lepton identification and reconstruction efficiencies, and the efficiency and misidentification rate associated with the algorithm used to distinguish jets containing b-hadrons.

Fig. 1
figure 1

Decay topologies for example signal processes. A simplified model involving gluino pair production, with the gluinos following two-step decays via sleptons to neutralino LSPs is shown on the left. The diagram on the right shows a GGM decay mode, where gluinos decay via neutralinos to gravitino LSPs

Table 1 Simulated background event samples used in this analysis with the corresponding generator, cross-section order in \(\alpha _{\mathrm{s}}\) used to normalise the event yield, underlying-event tune and PDF set

4 Physics object identification and selection

Electron candidates are reconstructed using energy clusters in the electromagnetic calorimeter matched to ID tracks. Electrons used in this analysis are assigned either “baseline” or “signal” status. Baseline electrons are required to have transverse energy \(E_{\mathrm{T}}>10\) GeV, satisfy the “medium” criteria described in Ref. [64] and reside within \(|\eta |<2.47\) and not in the range \(1.37<|\eta |<1.52\). Signal electrons are further required to be consistent with the primary vertex and isolated with respect to other objects in the event, with a \({p_{\mathrm{T}}} \)-dependent isolation requirement. The primary vertex is defined as the reconstructed vertex with the highest \(\sum p_{\mathrm{T}}^2\), where the summation includes all particle tracks with \({p_{\mathrm{T}}} >400\) \(\mathrm {MeV}\) associated with a given reconstructed vertex. Signal electrons with \(E_{\mathrm{T}}<25\) GeV must additionally satisfy the more stringent shower shape, track quality and matching requirements of the “tight” selection criteria in Ref. [64]. For electrons with \(E_{\mathrm{T}}<25\) GeV (\(\ge \)25 GeV), the sum of the transverse momenta of all charged-particle tracks with \({p_{\mathrm{T}}} >400\) \(\mathrm {MeV}\) associated with the primary vertex, excluding the electron track, within \(\Delta R = 0.3\) (0.2) surrounding the electron must be less than 16 % (10 %) of the electron \({p_{\mathrm{T}}} \). Electrons with \(E_{\mathrm{T}}<25\) GeV must reside within a distance \(|z_{0}\sin \theta | < 0.4\) mm of the primary vertex along the direction of the beamlineFootnote 2. The significance of the transverse-plane distance of closest approach of the electron to the primary vertex must be \(|d_{0}/\sigma _{d_{0}}|<5\). For electrons with \(E_{\mathrm{T}} \ge 25\) GeV, \(|z_{0}|\) is required to be \(<2\) mm and \(|d_{0}|<1\) mm.

Baseline muons are reconstructed from either ID tracks matched to a muon segment in the muon spectrometer or combined tracks formed both from the ID and muon spectrometer [65]. They are required to be of good quality, as described in Ref. [66], and to satisfy \(p_{\mathrm{T}}>10\) GeV and \(|\eta |<2.4\). Signal muons are further required to be isolated, with the scalar sum of the \({p_{\mathrm{T}}} \) of charged particle tracks associated with the primary vertex, excluding the muon track, within a cone of size \(\Delta R<0.3\) surrounding the muon being less than 12 % of the muon \({p_{\mathrm{T}}} \) for muons with \(p_{\mathrm{T}}<25\) GeV. For muons with \(p_{\mathrm{T}} \ge 25\) GeV, the scalar sum of the \({p_{\mathrm{T}}} \) of charged-particle tracks associated with the primary vertex, excluding the muon track, within \(\Delta R<0.2\) surrounding the muon must be less than 1.8 GeV. Signal muons with \(p_{\mathrm{T}}<25\) GeV must also have \(|z_{0}\sin \theta | \le 1\) mm and \(|d_{0}/\sigma _{d_{0}}|<3\). For the leptons selected by this analysis, the \(d_{0}\) requirement is typically several times less restrictive than the \(|d_{0}/\sigma _{d_{0}}|\) requirement.

Jets are reconstructed from topological clusters in the calorimeter using the anti-\(k_{t}\) algorithm [67] with a distance parameter of 0.4. Each cluster is categorised as being electromagnetic or hadronic in origin according to its shape [68], so as to account for the differing calorimeter response for electrons/photons and hadrons. A cluster-level correction is then applied to electromagnetic and hadronic energy deposits using correction factors derived from both MC simulation and data. Jets are corrected for expected pile-up contributions [69] and further calibrated to account for the calorimeter response with respect to the true jet energy [70, 71]. A small residual correction is applied to the jets in data to account for differences between response in data and MC simulation. Baseline jets are selected with \(p_{\mathrm{T}}>20\) GeV. Events in which these jets do not pass specific jet quality requirements are rejected so as to remove events affected by detector noise and non-collision backgrounds [72, 73]. Signal jets are required to satisfy \(p_{\mathrm{T}}>35\) GeV and \(|\eta |<2.5\). To reduce the impact of jets from pileup to a negligible level, jets with \(p_{\mathrm{T}}<50\) GeV within \(|\eta |<2.4\) are further required to have a jet vertex fraction \(|\mathrm {JVF}|> 0.25\). Here the JVF is the \({p_{\mathrm{T}}} \)-weighted fraction of tracks matched to the jet that are associated with the primary vertex [74], with jets without any associated tracks being assigned \(\text {JVF}=-1\).

The MV1 neural network algorithm [75] identifies jets containing b-hadrons using the impact parameters of associated tracks and any reconstructed secondary vertices. For this analysis, the working point corresponding to a 60 % efficiency for tagging b-jets in simulated \(t\bar{t}\) events is used, resulting in a charm quark rejection factor of approximately 8 and a light quark/gluon jet rejection factor of about 600. To ensure that each physics object is counted only once, an overlap removal procedure is applied. If any two baseline electrons reside within \(\Delta R=0.05\) of one another, the electron with lower \(E_{\mathrm{T}}\) is discarded. Following this, any baseline jets within \(\Delta R=0.2\) of a baseline electron are removed. After this, any baseline electron or muon residing within \(\Delta R=0.4\) of a remaining baseline jet is discarded. Finally, to remove electrons originating from muon bremsstrahlung, any baseline electron within \(\Delta R=0.01\) of any remaining baseline muon is removed from the event.

The \({E_{\mathrm{T}}^{\mathrm{miss}}} \) is defined as the magnitude of the vector sum of the transverse momenta of all photons, electrons, muons, baseline jets and an additional “soft term” [76]. The soft term includes clusters of energy in the calorimeter not associated with any calibrated object, which are corrected for material effects and the non-compensating nature of the calorimeter. Reconstructed photons used in the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) calculation are required to satisfy the “tight” requirements of Ref. [77].

5 Event selection

Events selected for this analysis must have at least five tracks with \(p_{\mathrm{T}}>400\) MeV associated with the primary vertex. Any event containing a baseline muon with \(|z_{0} \sin \theta |>0.2\) mm or \(|d_{0}|>1\) mm is rejected, to remove cosmic-ray events. To reject events with fake \({E_{\mathrm{T}}^{\mathrm{miss}}} \), those containing poorly measured muon candidates, characterised by large uncertainties on the measured momentum, are also removed. If the invariant mass of the two leading leptons in the event is less than 15 GeV the event is vetoed to suppress low-mass particle decays and Drell–Yan production.

Events are required to contain at least two signal leptons (electrons or muons). If more than two signal leptons are present, the two with the largest values of \({p_{\mathrm{T}}} \) are selected. These leptons must pass one of the leptonic triggers, with the two leading leptons being matched, within \(\Delta R<0.15\), to the online trigger objects that triggered the event in the case of the dilepton triggers. For events selected by a single-lepton trigger, one of the two leading leptons must be matched to the online trigger object in the same way. The leading lepton in the event must have \(p_{\mathrm{T}}>25\) GeV and the sub-leading lepton is required to have \(p_{\mathrm{T}}>10\)–14 \({\mathrm {GeV}}\), depending on the \({p_{\mathrm{T}}} \) theshold of the trigger selecting the event. For the off-Z analysis, the sub-leading lepton \(p_{\mathrm{T}}\) threshold is increased to 20 GeV. This is done to improve the accuracy of the method for estimating flavour-symmetric backgrounds, discussed in Sect. 6.2, in events with small dilepton invariant mass. For the same reason, the \(m_{\ell \ell }\) threshold is also raised to 20 GeV in this search channel. The two leading leptons must be oppositely charged, with the signal selection requiring that these be same-flavour (SF) lepton pairs. The different-flavour (DF) channel is also exploited to estimate certain backgrounds, such as that due to \(t\bar{t}\) production. All events are further required to contain at least two signal jets, since this is the minimum expected jet multiplicity for the signal models considered in this analysis.

Three types of region are used in the analysis. Control regions (CRs) are used to constrain the SM backgrounds. These backgrounds, estimated in the CRs, are first extrapolated to the validation regions (VRs) as a cross check and then to the signal regions (SRs), where an excess over the expected background is searched for.

GGM scenarios are the target of the on-Z search, where the \(\tilde{G}\) from \(\tilde{\chi }^{0}_{1} \rightarrow (Z/h)+ \tilde{G}\) decays is expected to result in \({E_{\mathrm{T}}^{\mathrm{miss}}} \). The Z boson mass window used for this search is \(81<m_{\ell \ell }<101\) GeV. To isolate GGM signals with high gluino mass and high jet activity the on-Z SR, SR-Z, is defined using requirements on \({E_{\mathrm{T}}^{\mathrm{miss}}} \) and \(H_{\mathrm{T}} = \sum _i p_{\mathrm{T}}^{\mathrm{jet},i} + p_{\mathrm{T}}^{\mathrm{lepton},1} + p_{\mathrm{T}}^{\mathrm{lepton},2}\), where \({H_{\mathrm{T}}} \) includes all signal jets and the two leading leptons. Since b-jets are often, but not always, expected in GGM decay chains, no requirement is placed on b-tagged jet multiplicity. Dedicated CRs are defined in order to estimate the contribution of various SM backgrounds to the SR. These regions are constructed with selection criteria similar to those of the SR, differing either in mll or MET ranges, or in lepton flavour requirements. A comprehensive discussion of the various methods used to perform these estimates follows in Sect. 6. For the SR and CRs, detailed in Table 2, a further requirement on the azimuthal opening angle between each of the leading two jets and the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (\(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})\)) is introduced to reject events with jet mismeasurements contributing to large fake \({E_{\mathrm{T}}^{\mathrm{miss}}} \). This requirement is applied in the SR and two CRs used in the on-Z search, all of which have high \({E_{\mathrm{T}}^{\mathrm{miss}}} \) and \(H_{\mathrm{T}}\) thresholds, at 225 and 600 GeV, respectively. Additional VRs are defined at lower \({E_{\mathrm{T}}^{\mathrm{miss}}} \) and \(H_{\mathrm{T}}\) to cross-check the SM background estimation methods. These are also sumarised in Table 2. The SR selection results in an acceptance times efficiency of 2–4 %, including leptonic Z branching fractions, for GGM signal models with \(\mu >400\,{\mathrm {GeV}}\).

In the off-Z analysis, a search is performed in the Z boson sidebands. The Z boson mass window vetoed here is larger than that selected in the on-Z analysis (\(m_{\ell \ell } \notin [80,110]\) GeV) to maximise Z boson rejection. An asymmetric window is chosen to improve the suppression of boosted \(Z\rightarrow \mu \mu \) events with muons whose momenta are overestimated, leading to large \({E_{\mathrm{T}}^{\mathrm{miss}}} \). In this search, four SRs are defined by requirements on jet multiplicity, b-tagged jet multiplicity, and \({E_{\mathrm{T}}^{\mathrm{miss}}} \). The SR requirements are optimised for the simplified models of pair production of squarks (requiring at least two jets) and gluinos (requiring at least four jets) discussed in Sect. 3. Two SRs with a b-veto provide the best sensitivity in the simplified models considered here, since the signal b-jet content is lower than that of the dominant \(t\bar{t}\) background. Orthogonal SRs with a requirement of at least one b-tagged jet target other signal models not explicitly considered here, such as those with bottom squarks that are lighter than the other squark flavours. For these four SRs, the requirement \({E_{\mathrm{T}}^{\mathrm{miss}}} >200\,{\mathrm {GeV}}\) is imposed. In addition, one signal region with requirements similar to those used in the CMS search [24] is defined (SR-loose). These SRs and their respective CRs, which have the same jet and \({E_{\mathrm{T}}^{\mathrm{miss}}} \) requirements, but select different \(m_{\ell \ell }\) ranges or lepton flavour combinations, are defined in Table 3.

The most sensitive off-Z SR for the squark-pair (gluino-pair) model is SR-2j-bveto (SR-4j-bveto). Because the value of the \({m_{\ell \ell }}\) kinematic endpoint depends on unknown model parameters, the analysis is performed over multiple \({m_{\ell \ell }}\) ranges for these two SRs. The dilepton mass windows considered for the SR-2j-bveto and SR-4j-bveto regions are presented in Sect. 9. For the combined \(ee+\mu \mu \) channels, the typical signal acceptance times efficiency values for the squark-pair (gluino-pair) model in the SR-2j-bveto (SR-4j-bveto) region are 0.1–10 % (0.1–8 %) over the full dilepton mass range.

The on-Z and off-Z searches are optimised for different signal models and as such are defined with orthogonal SRs. Given the different signatures probed, there are cases where the CR of one search may overlap with the SR of the other. Data events that fall in the off-Z SRs can comprise up to 60 % of the top CR for the on-Z analysis (CRT, defined in Table 2). Data events in SR-Z comprise up to 36 % of the events in the CRs with \(80<{m_{\ell \ell }}<110\,{\mathrm {GeV}}\) that are used to normalise the \({Z+\text {jets}}\) background in the off-Z analysis, but the potential impact on the background prediction is small because the \({Z+\text {jets}}\) contribution is a small fraction of the total background. For the following analysis, each search assumes only signal contamination from the specific signal model they are probing.

Table 2 Overview of all signal, control and validation regions used in the on-Z search. More details are given in the text. The \({E_{\mathrm{T}}^{\mathrm{miss}}} \) significance and the soft-term fraction \(f_{\mathrm{ST}}\) needed in the seed regions for the jet smearing method are defined in Sect. 6.1. The flavour combination of the dilepton pair is denoted as either “SF” for same-flavour or “DF” for different flavour
Table 3 Overview of all signal, control and validation regions used in the off-Z analysis. For SR-loose, events with two jets (at least three jets) are required to satisfy \({E_{\mathrm{T}}^{\mathrm{miss}}} \) \(>\) 150 (100) \({\mathrm {GeV}}\). Further details are the same as in Table 2

6 Background estimation

The dominant background processes in the signal regions, and those that are expected to be most difficult to model using MC simulation, are estimated using data-driven techniques. With SRs defined at large \({E_{\mathrm{T}}^{\mathrm{miss}}} \), any contribution from \({Z/\gamma ^{*}+\text {jets}}\) will be a consequence of artificially high \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in the event due to, for example, jet mismeasurements. This background must be carefully estimated, particularly in the on-Z search, since the peaking \({Z/\gamma ^{*}+\text {jets}}\) background can mimic the signal. This background is expected to constitute, in general, less than 10 % of the total background in the off-Z SRs and have a negligible contribution to SR-Z.

In both the off-Z and on-Z signal regions, the dominant backgrounds come from so-called “flavour-symmetric” processes, where the dileptonic branching fractions to ee, \(\mu \mu \) and \(e\mu \) have a 1:1:2 ratio such that the same-flavour contributions can be estimated using information from the different-flavour contribution. This group of backgrounds is dominated by \(t\bar{t}\) and also includes WW, single top (Wt) and \(Z\rightarrow \tau \tau \) production, and makes up \(\sim \)60 % (\(\sim \) 90 %) of the predicted background in the on-Z (off-Z) SRs.

Diboson backgrounds with real Z boson production, while small in the off-Z regions, contribute up to 25 % of the total background in the on-Z regions. These backgrounds are estimated using MC simulation, as are “rare top” backgrounds, including \(t\bar{t}+W(W)/Z\) (i.e. \(t\bar{t}+W\), \(t\bar{t}+Z\) and \(t\bar{t}+WW\)) and \(t+Z\) processes. All backgrounds that are estimated from MC simulation are subject to carefully assessed theoretical and experimental uncertainties.

Other processes, including those that might be present due to mis-reconstructed jets entering as leptons, can contribute up to 10 % (6 %) in the on-Z (off-Z) SRs. The background estimation techniques followed in the on-Z and off-Z searches are similar, with a few well-motivated exceptions.

6.1 Estimation of the \(Z/\gamma ^{*}+\) jets background

6.1.1 \(Z/\gamma ^{*}+\) jets background in the off-Z search

In the off-Z signal regions, the background from \({Z/\gamma ^{*}+\text {jets}}\) is due to off-shell Z bosons and photons, or to on-shell Z bosons with lepton momenta that are mismeasured. The region with dilepton mass in the range \(80<m_{\ell \ell }<110\) GeV is not considered as a search region. To estimate the contribution from \({Z/\gamma ^{*}+\text {jets}}\) outside of this range, dilepton mass shape templates are derived from \({Z/\gamma ^{*}+\text {jets}}\) MC events. These shape templates are normalised to data in control regions with the same selection as the corresponding signal regions, but with the requirement on \(m_{\ell \ell }\) inverted to \(80<m_{\ell \ell }<110\) GeV, to select a sample enriched in \({Z/\gamma ^{*}+\text {jets}}\) events. These CRs are defined in Table 3.

6.1.2 \(Z/\gamma ^{*}+\) jets background in the on-Z search

The assessment of the peaking background due to \({Z/\gamma ^{*}+\text {jets}}\) in the on-Z signal regions requires careful consideration. The events that populate the signal regions result from mismeasurements of physics objects where, for example, one of the final-state jets has its energy underestimated, resulting in an overestimate of the total \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in the event. Due to the difficulties of modelling instrumental \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in simulation, MC events are not relied upon alone for the estimation of the \({Z/\gamma ^{*}+\text {jets}}\) background. A data-driven technique is used as the nominal method for estimating this background. This technique confirms the expectation from MC simulation that the \(Z+\mathrm {jets}\) background is negligible in the SR.

The primary method used to model the \({Z/\gamma ^{*}+\text {jets}}\) background in SR-Z is the so-called “jet smearing” method, which is described in detail in Ref. [78]. This involves defining a region with \({Z/\gamma ^{*}+\text {jets}}\) events containing well-measured jets (at low \({E_{\mathrm{T}}^{\mathrm{miss}}} \)), known as the “seed” region. The jets in these events are then smeared using functions that describe the detector’s jet \({p_{\mathrm{T}}} \) response and \(\phi \) resolution as a function of jet \({p_{\mathrm{T}}} \), creating a set of pseudo-data events. The jet-smearing method provides an estimate for the contribution from events containing both fake \({E_{\mathrm{T}}^{\mathrm{miss}}} \), from object mismeasurements, and real \({E_{\mathrm{T}}^{\mathrm{miss}}} \), from neutrinos in heavy-flavour quark decays, by using different response functions for light-flavour and b-tagged jets. The response function is measured by comparing generator-level jet \({p_{\mathrm{T}}} \) to reconstructed jet \({p_{\mathrm{T}}} \) in Pythia8 dijet MC events, generated using the CT10 NLO PDF set. This function is then tuned to data, based on a dijet balance analysis in which the \({p_{\mathrm{T}}} \) asymmetry is used to constrain the width of the Gaussian core. The non-Gaussian tails of the response function are corrected based on \(\ge \)3-jet events in data, selected such that the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in each event points either towards, or in the opposite direction to one of the jets. This ensures that one of the jets is clearly associated with the \({E_{\mathrm{T}}^{\mathrm{miss}}} \), and the jet response can then be described in terms of the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) and reconstructed jet \({p_{\mathrm{T}}} \). This procedure results in a good estimate of the overall jet response.

In order to calculate the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) distribution of the pseudo-data, the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) is recalculated using the new (smeared) jet \({p_{\mathrm{T}}} \) and \(\phi \). The distribution of pseudo-data events is then normalised to data in the low-\({E_{\mathrm{T}}^{\mathrm{miss}}} \) region (10 \(<\) \({E_{\mathrm{T}}^{\mathrm{miss}}} \) \(<\) 50 GeV) of a validation region, denoted VRZ, after the requirement of \(\Delta \phi (\text {jet}_{1,2}, {E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\). This is defined in Table 2 and is designed to be representative of the signal region but at lower \({E_{\mathrm{T}}^{\mathrm{miss}}} \), where the contamination for relevant GGM signal models is expected to be less than 1 %.

The seed region must contain events with topologies similar to those expected in the signal region. To ensure that this is the case, the \({H_{\mathrm{T}}} \) and jet multiplicity requirements applied to the seed region remain the same as in the signal region, while the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) threshold of 225 GeV is removed, as shown in Table 2. Although the seed events should have little to no \({E_{\mathrm{T}}^{\mathrm{miss}}} \), enforcing a direct upper limit on \({E_{\mathrm{T}}^{\mathrm{miss}}} \) can introduce a bias in the jet \({p_{\mathrm{T}}} \) distribution in the seed region compared with the signal region. To avoid this, a requirement on the \({E_{\mathrm{T}}^{\mathrm{miss}}} \)  significance, defined as:

$$\begin{aligned} {E_{\mathrm{T}}^{\mathrm{miss}}} \, \text {sig.} = \frac{{E_{\mathrm{T}}^{\mathrm{miss}}}}{\sqrt{\sum E_{\mathrm{T}}^{\mathrm{jet}} + \sum E_{\mathrm{T}}^{\mathrm{soft}}}} , \end{aligned}$$
(1)

is used in the seed region. Here \(\sum E_{\mathrm{T}}^{\mathrm{jet}}\) and \(\sum E_{\mathrm{T}}^{\mathrm{soft}}\) are the summed \(E_{\mathrm{T}}\) from the baseline jets and the low-energy calorimeter deposits not associated with final-state physics objects, respectively. Placing a requirement on this variable does not produce a shape difference between jet \({p_{\mathrm{T}}} \) distributions in the seed and signal regions, while effectively selecting well-balanced \({Z/\gamma ^{*}+\text {jets}}\) events in the seed region. This requirement is also found to result in no event overlap between the seed region and SR-Z.

In the seed region an additional requirement is placed on the soft-term fraction, \(f_{\mathrm{ST}}\), defined as the fraction of the total \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in an event originating from calorimeter energy deposits not associated with a calibrated lepton or jet (\(f_{\mathrm{ST}} = \sum E_{\mathrm{T}}^{\mathrm{miss, Soft}} / {E_{\mathrm{T}}^{\mathrm{miss}}} \)), to select events with small \(f_{\mathrm{ST}}\). This is useful because events with large values of fake \({E_{\mathrm{T}}^{\mathrm{miss}}} \) tend to have low soft-term fractions (\(f_{\mathrm{ST}}<0.6\)).

The requirements on the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) significance and \(f_{\mathrm{ST}}\) are initially optimised by applying the jet smearing method to \({Z/\gamma ^{*}+\text {jets}}\) MC events and testing the agreement in the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) spectrum between direct and smeared MC events in the VRZ. This closure test is performed using the response function derived from MC simulation.

The \({Z/\gamma ^{*}+\text {jets}}\) background predominantly comes from events where a single jet is grossly mismeasured, since the mismeasurement of additional jets is unlikely, and can lead to smearing that reduces the total \({E_{\mathrm{T}}^{\mathrm{miss}}} \). The requirement on the opening angle in \(\phi \) between either of the leading two jets and the \({E_{\mathrm{T}}^{\mathrm{miss}}} \), \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\), strongly suppresses this background. The estimate of the \({Z/\gamma ^{*}+\text {jets}}\) background is performed both with and without this requirement, in order to aid in the interpretation of the results in the SR, as described in Sect. 8. The optimisation of the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) significance and \(f_{\mathrm{ST}}\) requirements are performed separately with and without the requirement, although the optimal values are not found to differ significantly.

The jet smearing method using the data-corrected jet response function is validated in VRZ, comparing smeared pseudo-data to data. The resulting \({E_{\mathrm{T}}^{\mathrm{miss}}} \) distributions show agreement within uncertainties assessed based on varying the response function and the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) significance requirement in the seed region. The \({E_{\mathrm{T}}^{\mathrm{miss}}} \) distribution in VRZ, with the additional requirement \(\Delta \phi (\text {jet}_{1,2},\) \({E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\), is shown in Fig. 2. Here the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) range extends only up to 100 \({\mathrm {GeV}}\), since \(t\bar{t}\) events begin to dominate at higher \({E_{\mathrm{T}}^{\mathrm{miss}}} \) values. The pseudo-data to data agreement in VRZ motivates the final determination of the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) significance requirement used for the seed region (\({E_{\mathrm{T}}^{\mathrm{miss}}}\ \text {sig.}<0.9\)). Backgrounds containing real \({E_{\mathrm{T}}^{\mathrm{miss}}} \), including \(t\bar{t}\) and diboson production, are taken from MC simulation for this check. The chosen values are detailed in Table 2 with a summary of the kinematic requirements imposed on the seed and Z validation region. Extrapolating the jet smearing estimate to the signal regions yields the results detailed in Table 4. The data-driven estimate is compatible with the MC expectation that the \(Z+\mathrm {jets}\) background contributes significantly less than one event in SR-Z.

Fig. 2
figure 2

Distribution of \({E_{\mathrm{T}}^{\mathrm{miss}}} \) in the electron (left) and muon (right) channel in VRZ of the on-Z analysis following the requirement of \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\). Here the \({Z/\gamma ^{*}+\text {jets}}\) background (solid blue) is modelled using \({p_{\mathrm{T}}} \)- and \(\phi \)-smeared pseudo-data events. The hatched uncertainty band includes the statistical uncertainty on the simulated event samples and the systematic uncertainty on the jet-smearing estimate due to the jet response function and the seed selection. The backgrounds due to WZ, ZZ or rare top processes, as well as from lepton fakes, are included under “Other Backgrounds”

Table 4 Number of \({Z/\gamma ^{*}+\text {jets}}\) background events estimated in the on-Z signal region (SR-Z) using the jet smearing method. This is compared with the prediction from the Sherpa MC simulation. The quoted uncertainties include those due to statistical and systematic effects (see Sect. 7)

6.2 Estimation of the flavour-symmetric backgrounds

The dominant background in the signal regions is \(t\bar{t}\) production, resulting in two leptons in the final state, with lesser contributors including the production of dibosons (WW), single top quarks (Wt) and Z bosons that decay to \(\tau \) leptons. For these the so-called “flavour-symmetry” method can be used to estimate, in a data-driven way, the contribution from these processes in the same-flavour channels using their measured contribution to the different-flavour channels.

6.2.1 Flavour-symmetric background in the on-Z search

The flavour-symmetry method uses a control region, CR\(e\mu \) in the case of the on-Z search, which is defined to be identical to the signal region, but in the different-flavour \(e\mu \) channel. In CR\(e\mu \), the expected contamination due to GGM signal processes of interest is \(<\)3 %.

The number of data events observed (\(N^{\text {data}}_{e\mu }\)) in this control region is corrected by subtracting the expected contribution from backgrounds that are not flavour symmetric. The background with the largest impact on this correction is that due to fake leptons, with the estimate provided by the matrix method, described in Sect. 6.3, being used in the subtraction. All other contributions, which include WZ, ZZ, tZ and \(t\bar{t}+W(W)/Z\) processes, are taken directly from MC simulation. This corrected number, \(N^{\text {data,corr}}_{e\mu }\), is related to the expected number in the same-flavour channels, \(N^{\text {est}}_{ee/\mu \mu }\), by the following relations:

$$\begin{aligned} N_{ee}^{\mathrm {est}}= & {} \frac{1}{2} N_{e\mu }^{\mathrm {data,corr}} k_{ee} \alpha , \nonumber \\ N_{\mu \mu }^{\mathrm {est}}= & {} \frac{1}{2} N_{e\mu }^{\mathrm {data,corr}} k_{\mu \mu } \alpha , \end{aligned}$$
(2)

where \(k_{ee}\) and \(k_{\mu \mu }\) are electron and muon selection efficiency factors and \(\alpha \) accounts for the different trigger efficiencies for same-flavour and different-flavour dilepton combinations. The selection efficiency factors are calculated using the ratio of dielectron and dimuon events in VRZ according to:

$$\begin{aligned} k_{ee}= & {} \sqrt{\frac{N_{ee}^{\mathrm {data}}\mathrm{(VRZ)}}{N_{\mu \mu }^{\mathrm {data}}\mathrm{(VRZ)}}}, \nonumber \\ k_{\mu \mu }= & {} \sqrt{\frac{N_{\mu \mu }^{\mathrm {data}}\mathrm{(VRZ)}}{N_{ee}^{\mathrm {data}}\mathrm{(VRZ)}}}, \nonumber \\ \alpha= & {} \frac{\sqrt{\epsilon _{\mathrm {trig}}^{ee} \epsilon _{\mathrm {trig}}^{\mu \mu }}}{\epsilon _{\mathrm {trig}}^{e\mu }}, \end{aligned}$$
(3)

where \(\epsilon _{\mathrm {trig}}^{ee}\), \(\epsilon _{\mathrm {trig}}^{\mu \mu }\) and \(\epsilon _{\mathrm {trig}}^{e\mu }\) are the efficiencies of the dielectron, dimuon and electron–muon trigger configurations, respectively, and \(N_{ee(\mu \mu )}^{\mathrm {data}}\mathrm{(VRZ)}\) is the number of ee (\(\mu \mu \)) data events in VRZ. These selection efficiency factors are calculated separately for the cases where both leptons fall within the barrel, both fall within the endcap regions, and for barrel–endcap combinations. This is motivated by the fact that the trigger efficiencies differ in the central and more forward regions of the detector. This estimate is found to be consistent with that resulting from the use of single global k factors, which provides a simpler but less precise estimate. In each case the k factors are close to 1.0, and the \(N_{ee}^{\mathrm {est}}\) or \(N_{\mu \mu }^{\mathrm {est}}\) estimates obtained using k factors from each configuration are consistent with one another to within \(0.2\sigma \).

The flavour-symmetric background estimate was chosen as the nominal method prior to examining the data yields in the signal region, since it relies less heavily on simulation and provides the most precise estimate. This data-driven method is cross-checked using the Z boson mass sidebands (\(m_{\ell \ell } \notin [81, 101]\) GeV) to fit the \(t\bar{t}\) MC events to data in a top control region, CRT. The results are then extrapolated to the signal region in the Z boson mass window, as illustrated in Fig. 3. All other backgrounds estimated using the flavour-symmetry method are taken directly from MC simulation for this cross-check. Here, \({Z/\gamma ^{*}+\text {jets}}\) MC events are used to model the small residual \({Z/\gamma ^{*}+\text {jets}}\) background in the control region, while the jet smearing method provides the estimate in the signal region. The normalisation of the \(t\bar{t}\) sample obtained from the fit is \(0.52 \pm 0.12\) times the nominal MC normalisation, where the uncertainty includes all experimental and theoretical sources of uncertainty as discussed in Sect. 7. This result is compatible with observations from other ATLAS analyses, which indicate that MC simulation tends to overestimate data in regions dominated by \(t\bar{t}\) events accompanied by much jet activity [79, 80]. MC simulation has also been seen to overestimate contributions from \(t\bar{t}\) processes in regions with high \({E_{\mathrm{T}}^{\mathrm{miss}}} \) [81]. In selections with high \({E_{\mathrm{T}}^{\mathrm{miss}}} \)  but including lower \(H_{\mathrm{T}}\), such as those used in the off-Z analysis, this downwards scaling is less dramatic. The results of the cross-check using the Z boson mass sidebands are shown in Table 5, with the sideband fit yielding a prediction slightly higher than, but consistent with, the flavour-symmetry estimate. This test is repeated varying the MC simulation sample used to model the \(t\bar{t}\) background. The nominal Powheg+Pythia \(t\bar{t}\) MC sample is replaced with a sample using Alpgen, and the fit is performed again. The same test is performed using a Powheg \(t\bar{t}\) MC sample that uses Herwig, rather than Pythia, for the parton shower. In all cases the estimates are found to be consistent within \(1 \sigma \). This cross-check using \(t\bar{t}\) MC events is further validated in identical regions with intermediate \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (150 \(<\) \({E_{\mathrm{T}}^{\mathrm{miss}}} \) \(<\) 225 GeV) and slightly looser \({H_{\mathrm{T}}} \) requirements (\({H_{\mathrm{T}}} >500\) GeV), as illustrated in Fig. 3. Here the extrapolation in \(m_{\ell \ell }\) between the sideband region (VRT) and the on-Z region (VRTZ) shows consistent results within approximately \(1\sigma \) between data and the fitted prediction.

Fig. 3
figure 3

Diagram indicating the position in the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) versus dilepton invariant mass plane of SR-Z, the control region CRT, and the two validation regions (VRT and VRTZ) used to validate the sideband fit for the on-Z search. VRT and VRTZ have lower \({H_{\mathrm{T}}} \) thresholds than CRT and SR-Z

Table 5 The number of events for the flavour-symmetric background estimate in the on-Z signal region (SR-Z) using the data-driven method based on data in CR\(e\mu \). This is compared with the prediction for the sum of the flavour-symmetric backgrounds (WW, tW, \(t\bar{t}\) and \(Z \rightarrow \tau \tau \)) from a sideband fit to data in CRT. In each case the combined statistical and systematic uncertainties are indicated

The flavour-symmetry method is also tested in these VRs. An overview of the nominal background predictions, using the flavour-symmetry method, in CRT and these VRs is shown in Fig. 4. This summary includes CRT, VRT, VRTZ and two variations of VRT and VRTZ. The first variation, denoted VRT/VRTZ (high \(H_{\mathrm{T}}\)), shows VRT/ VRTZ with an increased \(H_{\mathrm{T}}\) threshold (\(H_{\mathrm{T}}>600\) GeV), which provides a sample of events very close to the SR. The second variation, denoted VRT/VRTZ (high \({E_{\mathrm{T}}^{\mathrm{miss}}} \)), shows VRT/ VRTZ with the same \({E_{\mathrm{T}}^{\mathrm{miss}}} \) cut as SR-Z, but the requirement \(400<H_{\mathrm{T}}<600\) GeV is added to provide a sample of events very close to the SR. In all cases the data are consistent with the prediction. GGM signal processes near the boundary of the expected excluded region are expected to contribute little to the normalisation regions, with contamination at the level of up to 4 % in CRT and 3 % in VRT. The corresponding contamination in VRTZ is expected to be \(\sim \)10 % across most of the relevant parameter space, increasing to a maximum value of \(\sim \)50 % in the region near \(m(\tilde{g})=700\) \({\mathrm {GeV}}\), \(\mu =200\) \({\mathrm {GeV}}\).

Fig. 4
figure 4

The observed and expected yields in CRT and the VRs in the Z boson mass sidebands (left) and the Z boson mass window (right) regions. The bottom plot shows the difference in standard deviations between the observed and expected yields. The backgrounds due to WZ, ZZ or rare top processes, as well as from lepton fakes, are included under “Other Backgrounds”

6.2.2 Flavour-symmetric background in the off-Z search

The background estimation method of Eq. (2) is extended to allow a prediction of the background dilepton mass shape, which is used explicitly to discriminate signal from background in the off-Z search. In addition to the k and \(\alpha \) correction factors, a third correction factor S(i) is introduced (where i indicates the dilepton mass bin):

$$\begin{aligned} N_{ee}^{\mathrm {est}}(i)= & {} \frac{1}{2} N_{e\mu }^{\mathrm {data,corr}}(i) k_{ee} \alpha S_{ee}(i), \nonumber \\ N_{\mu \mu }^{\mathrm {est}}(i)= & {} \frac{1}{2} N_{e\mu }^{\mathrm {data,corr}}(i) k_{\mu \mu } \alpha S_{\mu \mu }(i). \end{aligned}$$
(4)

These shape correction factors account for different reconstructed dilepton mass shapes in the ee, \(\mu \mu \), and \(e\mu \) channels, which result from two effects. First, the offline selection efficiencies for electrons and muons depend differently on the lepton \({p_{\mathrm{T}}} \) and \(\eta \). For electrons, the offline selection efficiency increases slowly with \({p_{\mathrm{T}}} \), while it has very little dependence on \({p_{\mathrm{T}}} \) for muons. Second, the combinations of single-lepton and dilepton triggers used for the ee, \(\mu \mu \), and \(e\mu \) channels have different efficiencies with respect to the offline selection. In particular, for \(e\mu \) events the trigger efficiency with respect to the offline selection at low \({m_{\ell \ell }}\) is 80 %, which is 10–15 % lower than the trigger efficiencies in the ee and \(\mu \mu \) channels. To correct for these two effects, \(t\bar{t}\) MC simulation is used. The dilepton mass shape in the ee or \(\mu \mu \) channel is compared to that in the \(e\mu \) channel, after scaling the latter by the \(\alpha \)- and k-factor trigger and lepton selection efficiency corrections. The ratio of the dilepton mass distributions, \(N_{ee}({m_{\ell \ell }})/N_{e\mu }({m_{\ell \ell }})\) or \(N_{\mu \mu }({m_{\ell \ell }})/N_{e\mu }({m_{\ell \ell }})\), is fitted with a second-order polynomial, which is then applied as a correction factor, along with \(\alpha \) and k, to the \(e\mu \) distribution in data. These correction factors have an impact on the predicted background yields of approximately a few percent in the ee channel and up to \(\sim \)10–15 % in the \(\mu \mu \) channel, depending on the signal region.

The background estimation methodology is validated in a region with exactly two jets and \(100<{E_{\mathrm{T}}^{\mathrm{miss}}} <150\) \({\mathrm {GeV}}\) (VR-offZ). The flavour-symmetric category contributes more than 95 % of the total background in this region. The dominant systematic uncertainty on the background prediction is the 6 % uncertainty on the trigger efficiency \(\alpha \)-factor. The observed dilepton mass shapes are compared to the SM expectations in Fig. 5, indicating consistency between the data and the expected background yields. The observed yields and expected backgrounds in the below-Z and above-Z regions are presented in Sect. 1. For signal models near the edge of the sensitivity of this analysis, the contamination from signal events in VR-offZ is less than 3 %.

Fig. 5
figure 5

The observed and expected dilepton mass distributions in the electron (left) and muon (right) channel of the validation region (VR-offZ) of the off-Z search. Data (black points) are compared to the sum of expected backgrounds (solid histograms). The vertical dashed lines indicate the \(80<{m_{\ell \ell }}<110\) \({\mathrm {GeV}}\) region, which is used to normalise the \({Z+\text {jets}}\) background. Example signal models (dashed lines) are overlaid, with \(m(\tilde{q})\), \(m(\tilde{\chi }^{0}_{2})/m(\tilde{\chi }^{\pm }_{1})\), \(m(\tilde{\ell })/m(\tilde{\nu })\), and \(m(\tilde{\chi }^{0}_{1})\) of each benchmark point being indicated in the figure legend. The bottom plots show the ratio of the data to expected background. The error bars indicate the statistical uncertainty in data, while the shaded band indicates the total background uncertainty. The last bin contains the overflow

6.3 Fake-lepton contribution

Events from \(W \rightarrow \ell \nu +\)jets, semileptonic \(t\bar{t}\) and single top (s- and t-channel) contribute to the background in the dilepton channels due to “fake” leptons. These include leptons from b-hadron decays, misidentified hadrons or converted photons, and are estimated from data using a matrix method, which is described in detail in Ref. [82]. This method involves creating a control sample using baseline leptons, thereby loosening the lepton isolation and identification requirements and increasing the probability of selecting a fake lepton. For each control or signal region, the relevant requirements are applied to this control sample, and the number of events with leptons that pass or fail the subsequent signal-lepton requirements are counted. Denoting the number of events passing signal lepton requirements by \(N_{\mathrm{pass}}\) and the number failing by \(N_{\mathrm{fail}}\), the number of events containing a fake lepton for a single-lepton selection is given by

$$\begin{aligned} N_{\mathrm{fake}} = \frac{N_{\mathrm{fail}} - (1/\epsilon ^{\mathrm{real}}-1) N_{\mathrm{pass}}}{(1/\epsilon ^{\mathrm{fake}} - 1/\epsilon ^{\mathrm{real}})}, \end{aligned}$$
(5)

where \(\epsilon ^{\mathrm{fake}}\) is the efficiency with which fake leptons passing the baseline lepton selection also pass signal lepton requirements and \(\epsilon ^{\mathrm{real}}\) is the relative identification efficiency (from baseline to signal lepton selection) for real leptons. This principle is expanded to a dilepton sample using a four-by-four matrix to account for the various possible real–fake combinations for the two leading leptons in the event.

The efficiency for fake leptons is estimated in control regions enriched with multi-jet events. Events are selected if they contain at least one baseline lepton, one signal jet with \({p_{\mathrm{T}}} \) \(>60\) GeV and low \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (\(<\)30 GeV). The background due to processes containing prompt leptons, estimated from MC samples, is subtracted from the total data contribution in this region. From the resulting data sample the fraction of events in which the baseline leptons pass signal lepton requirements gives the fake efficiency. This calculation is performed separately for events with b-tagged jets and those without to take into account the various sources from which fake leptons originate. The real-lepton efficiency is estimated using \(Z \rightarrow \ell ^+ \ell ^-\) events in a data sample enriched with leptonically decaying Z bosons. Both the real-lepton and fake-lepton efficiencies are further binned as a function of \({p_{\mathrm{T}}} \) and \(\eta \).

6.4 Estimation of other backgrounds

The remaining background processes, including diboson events with a Z boson decaying to leptons and the \(t\bar{t}+W(W)/Z\) and \(t+Z\) backgrounds, are estimated from MC simulation. In these cases the most accurate theoretical cross sections available are used, as summarised in Table 1. Care is taken to ensure that the flavour-symmetric component of these backgrounds (for events where the two leptons do not originate from the same Z decay) is not double-counted.

7 Systematic uncertainties

Systematic uncertainties have an impact on the predicted signal region yields from the dominant backgrounds, the fake-lepton estimation, and the yields from backgrounds predicted using simulation alone. The expected signal yields are also affected by systematic uncertainties. All sources of systematic uncertainty considered are discussed in the following subsections.

7.1 Experimental uncertainties

The experimental uncertainties arise from the modelling of both the signal processes and backgrounds estimated using MC simulation. Uncertainties associated with the jet energy scale (JES) are assessed using both simulation and in-situ measurements [70, 71]. The JES uncertainty is influenced by the event topology, flavour composition, jet \({p_{\mathrm{T}}} \) and \(\eta \), as well as by the pile-up. The jet energy resolution (JER) is also affected by pile-up, and is estimated using in-situ measurements [83]. An uncertainty associated with the JVF requirement for selected jets is also applied by varying the JVF threshold up (0.28) and down (0.21) with respect to the nominal value of 0.25. This range of variation is chosen based on a comparison of the efficiency of a JVF requirement in dijet events in data and MC simulation.

To distinguish between heavy-flavour-enriched and light-flavour-enriched event samples, b-jet tagging is used. The uncertainties associated with the b-tagging efficiency and the light/charm quark mis-tag rates are measured in \(t\bar{t}\)-enriched samples [84, 85] and dijet samples [86], respectively.

Small uncertainties on the lepton energy scales and momentum resolutions are measured in \(Z \rightarrow \ell ^+\ell ^-\), \(J/\psi \rightarrow \ell ^+\ell ^-\) and \(W \rightarrow \ell ^{\pm }\nu \) event samples [64]. These are propagated to the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) uncertainty, along with the uncertainties due to the JES and JER. An additional uncertainty on the energy scale of topological clusters in the calorimeters not associated with reconstructed objects (the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) soft term) is also applied to the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) calculation.

The trigger efficiency is assigned a 5 % uncertainty following studies comparing the efficiency in simulation to that measured in \(Z \rightarrow \ell ^+ \ell ^-\) events in data.

The data-driven background estimates are subject to uncertainties associated with the methods employed and the limited number of events used in their estimation. The \({Z/\gamma ^{*}+\text {jets}}\) background estimate has an uncertainty to account for differences between pseudo-data and MC events, the choice of seed region definition, the statistical precision of the seed region, and the jet response functions used to create the pseudo-data. Uncertainties in the flavour-symmetric background estimate include those related to the electron and muon selection efficiency factors \(k_{ee}\) and \(k_{\mu \mu }\), the trigger efficiency factor \(\alpha \), and, for the off-Z search only, the dilepton mass shape S(i) reweighting factors. Uncertainties attributed to the subtraction of the non-flavour-symmetric backgrounds, and those due to limited statistical precision in the \(e\mu \) control regions, are also included. Finally, an uncertainty derived from the difference in real-lepton efficiency observed in \(t\bar{t}\) and \(Z \rightarrow \ell ^+ \ell ^-\) events is assigned to the fake-background prediction. An additional uncertainty due to the number of events in the control samples used to derive the real efficiencies and fake rates is assigned to this background, as well as a 20 % uncertainty on the MC background subtraction in the control samples.

7.2 Theoretical uncertainties on background processes

For all backgrounds estimated from MC simulation, the following theoretical uncertainties are considered. The uncertainties due to the choice of factorisation and renormalisation scales are calculated by varying the nominal values by a factor of two. Uncertainties on the PDFs are evaluated following the prescription recommended by PDF4LHC [87]. Total cross-section uncertainties of 22 % [37] and 50 % are applied to \(t\bar{t}\) \(+W\)/Z and \(t\bar{t}\) \(+WW\) sub-processes, respectively. For the \(t\bar{t}\) \(+W\) and \(t\bar{t}\) \(+Z\) sub-processes, an additional uncertainty is evaluated by comparing samples generated with different numbers of partons, to account for the impact of the finite number of partons generated in the nominal samples. For the WZ and ZZ diboson samples, a parton shower uncertainty is estimated by comparing samples showered with Pythia and Herwig+Jimmy [88, 89] and cross-section uncertainties of 5 and 7 % are applied, respectively. These cross-section uncertainties are estimated from variations of the value of the strong coupling constant, the PDF and the generator scales. For the small contribution from \(t+Z\), a 50 % uncertainty is assigned. Finally, a statistical uncertainty derived from the finite size of the MC samples used in the background estimation process is included.

7.3 Dominant uncertainties on the background estimates

The dominant uncertainties in each signal region, along with their values relative to the total background expectation, are summarised in Table 6. In all signal regions the largest uncertainty is that associated with the flavour-symmetric background. The statistical uncertainty on the flavour-symmetric background due to the finite data yields in the \(e\mu \) CRs is 24 % in the on-Z SR. This statistical uncertainty is also the dominant uncertainty for all SRs of the off-Z analysis except for SR-loose, for which the systematic uncertainty on the flavour-symmetric background prediction dominates. In SR-Z the combined MC generator and parton shower modelling uncertainty on the WZ background (7 %), as well as the uncertainty due to the fake-lepton background (14 %), are also important.

Table 6 Overview of the dominant sources of systematic uncertainty on the background estimate in the signal regions. Their relative values with respect to the total background expectation are shown (in %). For the off-Z region, the full dilepton mass range is used, and in all cases the \(ee+\mu \mu \) contributions are considered together

7.4 Theoretical uncertainties on signal processes

Signal cross sections are calculated to next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at NLO+NLL accuracy [5559]. The nominal cross section and the uncertainty are taken from an envelope of cross-section predictions using different PDF sets and factorisation and renormalisation scales, as described in Ref. [90]. For the simplified models the uncertainty on the initial-state radiation modelling is important in the case of small mass differences during the cascade decays. MadGraph+Pythia samples are used to assess this uncertainty, with the factorisation and normalisation scale, the MadGraph parameter used for jet matching, the MadGraph parameter used to set the QCD radiation scale and the Pythia parameter responsible for the value of the QCD scale for final-state radiation, each being varied up and down by a factor of two. The resulting uncertainty on the signal acceptance is up to \(\sim \)25 % in regions with small mass differences within the decay chains.

8 Results

For the on-Z search, the resulting background estimates in the signal regions, along with the observed event yields, are displayed in Table 7. The dominant backgrounds are those due to flavour-symmetric and WZ and ZZ diboson processes. In the electron and muon channel combined, \(10.6\pm 3.2\) events are expected and 29 are observed. For each of these regions, a local probability for the background estimate to produce a fluctuation greater than or equal to the excess observed in the data is calculated using pseudo-experiments. When expressed in terms of the number of standard deviations, this value is referred to as the local significance, or simply the significance. These significances are quantified in the last column of Table 11 and correspond to a \(1.7\sigma \) deviation in the muon channel and a \(3.0\sigma \) deviation in the electron channel, with the combined significance, calculated from the sum of the background predictions and observed yields in the muon and electron channels, being \(3.0\sigma \). The uncertainties on the background predictions in the ee and \(\mu \mu \) channels are correlated as they are dominated by the statistical uncertainty of the \(e\mu \) data sample that is used to derive the flavour-symmetric background in both channels. Since this sample is common to both channels, the relative statistical error on the flavour-symmetric background estimation does not decrease when combining the ee and \(\mu \mu \) channels. No excess was reported in the CMS analysis of the \(Z+\mathrm {jets}+{E_{\mathrm{T}}^{\mathrm{miss}}} \) final state based on \(\sqrt{s}=8\,{\mathrm {TeV}}\) data [24]; however, the kinematic requirements used in that search differ from those used in this paper.

Table 7 Results in the on-Z SRs (SR-Z). The flavour symmetric, \({Z/\gamma ^{*}+\text {jets}}\) and fake-lepton background components are all derived using data-driven estimates described in the text. All other backgrounds are taken from MC simulation. The displayed uncertainties include the statistical and systematic uncertainty components combined

Dilepton invariant mass and \({E_{\mathrm{T}}^{\mathrm{miss}}} \) distributions in the electron and muon on-Z SR are shown in Fig. 6, with \(H_{\mathrm{T}}\) and jet multiplicity being shown in Fig. 7. For the SR selection a requirement is imposed to reject events with \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})<0.4\) to further suppress the background from \({Z/\gamma ^{*}+\text {jets}}\) processes with mismeasured jets.

Fig. 6
figure 6

The dilepton mass (top) and \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (bottom) distributions for the electron (left) and muon (right) channel in the on-Z SRs after having applied the requirement \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\). All uncertainties are included in the hatched uncertainty band. Two example GGM (\(\tan \beta =1.5\)) signal models are overlaid. For the \({E_{\mathrm{T}}^{\mathrm{miss}}} \) distributions, the last bin contains the overflow. The backgrounds due to WZ, ZZ or rare top processes, as well as from fake leptons, are included under “Other Backgrounds”. The negligible contribution from Z+jets is omitted from these distributions

Fig. 7
figure 7

The \(H_{\mathrm{T}}\) (top) and jet multiplicity (bottom) distributions for the electron (left) and muon (right) channel in the on-Z SRs after having applied the requirement \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\). All uncertainties are included in the hatched uncertainty band. Two example GGM (\(\tan \beta =1.5\)) signal models are overlaid. For the \(H_{\mathrm{T}}\) distributions, the last bin contains the overflow. The backgrounds due to WZ, ZZ or rare top processes, as well as from fake leptons, are included under “Other Backgrounds”. The negligible contribution from Z+jets is omitted from these distributions

In Fig. 8, the distribution of events in the on-Z SR as a function of \(\Delta \phi (\text {jet}_{1,2},\) \({E_{\mathrm{T}}^{\mathrm{miss}}})\) (before this requirement is applied) is shown. In these figures the shapes of the flavour-symmetric and \({Z/\gamma ^{*}+\text {jets}}\) backgrounds are derived using MC simulation and the normalisation is taken according to the data driven estimate.

Fig. 8
figure 8

The distribution of the \(\Delta \phi \) between the leading jet and \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (top) and the sub-leading jet and \({E_{\mathrm{T}}^{\mathrm{miss}}} \) (bottom) for the electron (left) and muon (right) channel in the on-Z SRs before having applied the requirement \(\Delta \phi (\text {jet}_{1,2},{E_{\mathrm{T}}^{\mathrm{miss}}})>0.4\). All uncertainties are included in the hatched uncertainty band. Two example GGM (\(\tan \beta =1.5\)) signal models are overlaid. The backgrounds due to WZ, ZZ or rare top processes, as well as from fake leptons, are included under “Other Backgrounds”

For the off-Z search, the dilepton mass distributions in the five SRs are presented in Figs. 9 and 10, and summarised in Fig. 11. The expected backgrounds and observed yields in the below-Z and above-Z regions for SR-2j-bveto, SR-4j-bveto, and SR-loose are presented in Tables 8, 9, and 10, respectively. Corresponding results for SR-2j-btag and SR-4j-btag are presented in Sect. 1. The data are consistent with the expected SM backgrounds in all regions. In the SR-loose region with \(20<{m_{\ell \ell }}<70\) \({\mathrm {GeV}}\), similar to the region in which the CMS Collaboration observed a 2.6\(\sigma \) excess, 1133 events are observed, compared to an expectation of \(1190\pm 40\pm 70\) events.

Fig. 9
figure 9

The observed and expected dilepton mass distributions in the off-Z SR-loose (top), SR-2j-bveto (middle), and SR-4j-bveto (bottom). The vertical dashed lines indicate the \(80<{m_{\ell \ell }}<110\) \({\mathrm {GeV}}\) region, which is used to normalise the \({Z+\text {jets}}\) background and is thus not treated as a search region. Example signal models (dashed lines) are overlaid, with \(m(\tilde{q})/m(\tilde{g})\), \(m(\tilde{\chi }^{0}_{2})/m(\tilde{\chi }^{\pm }_{1})\), \(m(\tilde{\ell })/m(\tilde{\nu })\), and \(m(\tilde{\chi }^{0}_{1})\) of each benchmark point being indicated in the figure legend. The last bin contains the overflow. All uncertainties are included in the hatched uncertainty band

Fig. 10
figure 10

The observed and expected dilepton mass distributions in the SR-2j-btag (top) and SR-4j-btag (bottom) signal regions of the off-Z search. The vertical dashed lines indicate the \(80<{m_{\ell \ell }}<110\) \({\mathrm {GeV}}\) region, which is used to normalise the \({Z+\text {jets}}\) background and is thus not treated as a search region. Example signal models of squark- or gluino-pair production (dashed lines) are overlaid, with \(m(\tilde{g})\), \(m(\tilde{\chi }^{0}_{2})/m(\tilde{\chi }^{\pm }_{1})\), \(m(\tilde{\ell })/m(\tilde{\nu })\), and \(m(\tilde{\chi }^{0}_{1})\) of each benchmark point being indicated in the figure legend. The last bin contains the overflow. All uncertainties are included in the hatched uncertainty band

Fig. 11
figure 11

The observed and expected yields in the below-Z (left) and above-Z (right) dilepton mass regions, for the VR and five SRs of the off-Z search. Here below-Z is \(20<{m_{\ell \ell }}<70\) \({\mathrm {GeV}}\) for VR-offZ and SR-loose and otherwise \(20<{m_{\ell \ell }}<80\) \({\mathrm {GeV}}\); above-Z is \({m_{\ell \ell }}>110\) \({\mathrm {GeV}}\). The bottom plot shows the difference in standard deviations between the observed and expected yields. Results are shown for the ee and \(\mu \mu \) channels as well as for the sum

Table 8 Results in the off-Z search region SR-2j-bveto, in the below-Z range (\(20<{m_{\ell \ell }}<80\) \({\mathrm {GeV}}\), top) and above-Z range (\({m_{\ell \ell }}>110\) \({\mathrm {GeV}}\), bottom). The flavour symmetric, \({Z/\gamma ^{*}+\text {jets}}\) and fake lepton background components are all derived using data-driven estimates described in the text. All other backgrounds are taken from MC simulation. The first uncertainty is statistical and the second is systematic
Table 9 Results in the off-Z search region SR-4j-bveto, in the below-Z range (\(20<{m_{\ell \ell }}<80\) \({\mathrm {GeV}}\), top) and above-Z range (\({m_{\ell \ell }}>110\) \({\mathrm {GeV}}\), bottom). Details are the same as in Table 8
Table 10 Results in the off-Z search region SR-loose, in the below-Z range (\(20<{m_{\ell \ell }}<70\) \({\mathrm {GeV}}\), top) and above-Z range (\({m_{\ell \ell }}>110\) \({\mathrm {GeV}}\), bottom). Details are the same as in Table 8

9 Interpretation of results

In this section, exclusion limits are shown for the SUSY models described in Sect. 3. The asymptotic \(CL_{\mathrm{S}}\) prescription [91], implemented in the HistFitter program [92], is used to determine upper limits at 95 % confidence level (CL). All signal and background uncertainties are taken into account using a Gaussian model of nuisance parameter integration. All uncertainties except that on the signal cross section are included in the limit-setting configuration. The impact of varying the signal cross sections by their uncertainties is indicated separately. Numbers quoted in the text are evaluated from the observed exclusion limit based on the nominal signal cross section minus its \(1\sigma \) theoretical uncertainty.

For the on-Z analysis, the data exceeds the background expectations in the ee (\(\mu \mu \)) channel with a significance of 3.0 (1.7) standard deviations. Exclusion limits in specific models allow us to illustrate which regions of the model parameter space are affected by the observed excess, by comparing the expected and observed limits. The results in SR-Z ee and SR-Z \(\mu \mu \) (Table 7) are considered simultaneously. The signal contamination in CR\(e\mu \) is found to be at the \(\sim \)1 % level, and is therefore neglected in this procedure. The expected and observed exclusion contours, in the plane of \(\mu \) versus \(m(\tilde{g})\) for the GGM model, are shown in Fig. 12. The \(\pm 1\sigma _{\mathrm {exp}}\) and \(\pm 2\sigma _{\mathrm {exp}}\) experimental uncertainty bands indicate the impact on the expected limit of all uncertainties considered on the background processes. The \(\pm 1 \sigma _{\mathrm {theory}}^{\mathrm {SUSY}}\) uncertainty lines around the observed limit illustrate the change in the observed limit as the nominal signal cross section is scaled up and down by the theoretical cross-section uncertainty. Given the observed excess of events with respect to the SM prediction, the observed limits are weaker than expected. In the case of the \(\tan \beta =1.5\) exclusion contour, the on-Z analysis is able to exclude gluino masses up to 850 \({\mathrm {GeV}}\) for \(\mu >450\) GeV, whereas gluino masses of up to 820 \({\mathrm {GeV}}\) are excluded for the \(\tan \beta =30\) model for \(\mu >600\) GeV. The lower exclusion reach for the \(\tan \beta =30\) models is due to the fact that the branching fraction for \(\tilde{\chi }^0_{1} \rightarrow Z\tilde{G}\) is significantly smaller at \(\tan \beta =30\) than at \(\tan \beta =1.5\).

Fig. 12
figure 12

The 95 % CL exclusion limit from the on-Z combined same-flavour channels in the \(\mu \) versus \(m(\tilde{g})\) plane in the GGM model with \(\tan \beta =1.5\) (top) and \(\tan \beta =30\) (bottom). The dark blue dashed line indicates the expected limits at 95 % CL and the green (yellow) bands show the \(\pm 1 \sigma \) (\(\pm 2 \sigma \)) variation on the expected limit as a consequence of the experimental and theoretical uncertainties on the background prediction. The observed limits are shown by the solid red lines, with the dotted red lines indicating the limit obtained upon varying the signal cross section by \(\pm 1 \sigma \). The region below the grey line has the gluino mass less than the lightest neutralino mass and is hence not considered. The value of the lightest neutralino mass is indicated by the x-axis inset

For the off-Z search, the limits for the squark-pair (gluino-pair) model are based on the results of SR-2j-bveto (SR-4j-bveto). The yields in the combined ee+\(\mu \mu \) channels are used. Signal contamination in the \(e\mu \) control region used for the flavour-symmetry method is taken into account by subtracting the expected increase in the background prediction from the signal yields. For each point in the signal model parameter space, limits on the signal strength are calculated using a “sliding window” approach. The binning in SR-2j-bveto (SR-4j-bveto) defines 45 (21) possible dilepton mass windows to use for the squark-pair (gluino-pair) model interpretation, of which the ten (nine) windows with the best expected sensitivity are selected. For each point in the signal model parameter space, the dilepton mass window with the best expected limit on the signal strength is selected. The excluded regions in the squark-LSP and gluino-LSP mass planes are shown in Fig. 13. The analysis probes squarks with masses up to 780 GeV, and gluinos with masses up to 1170 GeV.

Fig. 13
figure 13

Excluded region in the (top) squark-LSP mass plane using the SR-2j-bveto results and (bottom) gluino-LSP mass plane using the SR-4j-bveto results. The observed, expected, and \(\pm 1\sigma \) expected exclusion contours are indicated. The observed limits obtained upon varying the signal cross section by \(\pm 1 \sigma \) are also indicated. The region to the left of the diagonal dashed line has the squark mass less than the LSP mass and is hence not considered. Three signal benchmark points are shown, with their SUSY particle masses indicated in parentheses

The signal regions in these analyses are also used to place upper limits on the allowed number of BSM events (\(N_{\mathrm{BSM}}\)) in each region. The observed (\(S_\mathrm{obs}^{95}\)) and expected (\(S_\mathrm{exp}^{95}\)) 95 % CL upper limits are also derived using the \(CL_{\mathrm{S}}\) procedure. These upper limits on \(N_{\mathrm{BSM}}\) can be interpreted as upper limits on the visible BSM cross section (\(\langle \epsilon {\sigma }\rangle _{\mathrm{obs}}^{95}\)) by normalising \(N_{\mathrm{BSM}}\) by the total integrated luminosity. Here \(\langle \epsilon {\sigma }\rangle _{\mathrm{obs}}^{95}\) is defined as the product of the signal production cross section, acceptance and reconstruction efficiency. The results are obtained using asymptotic formulae [93] in the case of the off-Z numbers. For SR-Z, with a considerably smaller sample size, pseudo-experiments are used. These numbers are presented in Table 11 for the on-Z search. Model-independent upper limits on the visible BSM cross section in the below-Z and above-Z ranges of the five signal regions in the off-Z search are presented in Tables 12 and 13, respectively. Limits for the most sensitive dilepton mass windows of SR-2j-bveto and SR-4j-bveto used for the squark- and gluino-pair model interpretations, respectively, are presented in Tables 14 and 15. These tables also present the confidence level observed for the background-only hypothesis \(CL_{B}\), and the one-sided discovery p-value, \(p(s=0)\), which is the probability that the event yield obtained in a single hypothetical background-only experiment (signal, \(s=0\)) is greater than that observed in this dataset. The \(p(s=0)\) value is truncated at 0.5.

Table 11 From left to right: 95 % CL upper limits on the visible cross section (\(\langle \epsilon {\sigma }\rangle _{\mathrm{obs}}^{95}\)) and on the number of signal events (\(S_{\mathrm{obs}}^{95}\)); the expected 95 % CL upper limit on the number of signal events is denoted by \(S_{\mathrm{exp}}^{95}\) and is derived from the expected number of background events (and the \(\pm 1\sigma \) uncertainty on the expectation); two-sided \(CL_{B}\) value, which is the confidence level observed for the background-only hypothesis; the discovery p-value for 0 signal strength s (\(p(s = 0)\)), and the Gaussian significance for the on-Z search
Table 12 Summary of model-independent upper limits for the five signal regions, in the below-Z region (\(20<{m_{\ell \ell }}<70\) \({\mathrm {GeV}}\) for SR-loose, \(20<{m_{\ell \ell }}<80\) \({\mathrm {GeV}}\) for all other signal regions), in the combined \(ee+\mu \mu \) and individual ee and \(\mu \mu \) channels. Left to right: the observed yield (\(N_\mathrm {data}\)), total expected background (\(N_\mathrm {bkg}\)), 95 % CL upper limits on the visible cross section (\(\langle \epsilon \sigma \rangle _{\mathrm{obs}}^{95}\)) and on the number of signal events (\(S_{\mathrm{obs}}^{95}\) ). The fifth column (\(S_{\mathrm{exp}}^{95}\)) shows the 95 % CL upper limit on the number of signal events, given the expected number (and \(\pm 1\sigma \) excursions on the expectation) of background events. The last two columns indicate the \(CL_{B}\) value, i.e. the confidence level observed for the background-only hypothesis, and the discovery p-value (\(p(s = 0)\)). For an observed number of events lower than expected, the discovery p-value has been truncated at 0.5
Table 13 Summary of model-independent upper limits for the five signal regions, in the above-Z (\({m_{\ell \ell }}>110\) \({\mathrm {GeV}}\)) dilepton mass range, in the combined \(ee+\mu \mu \) and individual ee and \(\mu \mu \) channels. Details are the same as in Table 12
Table 14 Summary of model-independent upper limits for SR-2j-bveto, in the combined \(ee+\mu \mu \) and individual ee and \(\mu \mu \) channels, for the ten dilepton mass windows used for the squark-pair interpretation. Details are the same as in Table 12. The last three columns indicate the expected signal yield for three squark-pair model benchmark points; the first (second) number indicates the squark (LSP) mass. The signal yield in square brackets indicates the best selected dilepton mass window for the given benchmark point
Table 15 Summary of model-independent upper limits for SR-4j-bveto, in the combined \(ee+\mu \mu \) and individual ee and \(\mu \mu \) channels, for the nine dilepton mass windows used for the gluino-pair interpretation. Details are the same as in Table 12. The last three columns indicate the expected signal yield for three gluino-pair model benchmark points; the first (second) number indicates the gluino (LSP) mass. The signal yield in square brackets indicates the best selected dilepton mass window for the given benchmark point

10 Summary

This paper presents results of two searches for supersymmetric particles in events with two SFOS leptons, jets, and \({E_{\mathrm{T}}^{\mathrm{miss}}} \), using 20.3 fb\(^{-1}\) of 8 TeV pp collisions recorded by the ATLAS detector at the LHC. The first search targets events with a lepton pair with invariant mass consistent with that of the Z boson and hence probes models in which the lepton pair is produced from the decay \(Z\rightarrow \ell \ell \). In this search \(6.4\pm 2.2\) (\(4.2\pm 1.6\)) events from SM processes are expected in the \(\mu \mu \) (ee) SR-Z, as predicted using almost exclusively data-driven methods. The background estimates for the major and most difficult-to-model backgrounds are cross-checked using MC simulation normalised in data control regions, providing further confidence in the SR prediction. Following this assessment of the expected background contribution to the SR the number of events in data is higher than anticipated, with 13 observed in SR-Z \(\mu \mu \) and 16 in SR-Z ee. This corresponding significances are 1.7 standard deviations in the muon channel and 3.0 standard deviations in the electron channel. These results are interpreted in a supersymmetric model of general gauge mediation, and probe gluino masses up to 900 GeV. The second search targets events with a lepton pair with invariant mass inconsistent with Z boson decay, and probes models with the decay chain \(\tilde{\chi }_{2}^{0}\rightarrow \ell ^+\ell ^-\tilde{\chi }_{1}^{0}\). In this case the data are found to be consistent with the expected SM backgrounds. No evidence for an excess is observed in the region similar to that in which CMS reported a 2.6\(\sigma \) excess [24]. The results are interpreted in simplified models with squark- and gluino-pair production, and probe squark (gluino) masses up to about 780 (1170) GeV.