1 Introduction

This paper reports results from an updated and improved search for new physics processes in proton–proton collisions at a center-of-mass energy of 7 TeV, focusing on the signature with a single isolated lepton (electron or muon), multiple energetic jets, and large missing momentum transverse to the beam direction (). The data sample was collected by the Compact Muon Solenoid (CMS) experiment during 2011 at the Large Hadron Collider (LHC) and corresponds to an integrated luminosity of 4.98 fb−1, roughly one hundred times larger than the sample used for our previous search [1].

The signature is prominent in models based on supersymmetry (SUSY) [27]. In R-parity-conserving models [8], SUSY particles are produced in pairs, and their decay chains end with the lightest supersymmetric particle (LSP). In some scenarios, the LSP is a neutralino (\(\widetilde{\chi}^{0}\)), a heavy, electrically neutral, weakly interacting particle with the properties of a dark-matter candidate [9]. The presence of two such LSPs in each SUSY event typically leads to a large missing transverse momentum, depending on the details of the SUSY mass spectrum. The isolated lepton indicates a weak decay of a heavy particle, such as a W boson or a chargino (\(\widetilde{\chi}^{\pm}\)). Multiple jets can be produced in complex decay chains of SUSY particles. This signature arises in many SUSY models, including the constrained minimal supersymmetric extension of the standard model (CMSSM) [10, 11], and in simplified models [1215], which are based on simplified mass spectra and decays of new particles. Both of these frameworks are used to interpret the results. Searches in this or similar channels have been reported by CMS [1, 16] and ATLAS [1719].

Searches for SUSY particles are complicated by the presence of standard model (SM) backgrounds that can share many of the features of signal events. In the single-lepton final state, backgrounds arise primarily from the production of \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events, with smaller contributions from Z+jets, single-top quark production, and QCD multijet events. In the event topology studied here, a large observed value of in a standard model event is usually genuine, resulting from the production of one or more high-momentum neutrinos. A smaller contribution to events in the high- tail in this search can arise from the mismeasurement of jets in high cross section processes such as QCD multijet events. To determine the contributions from these backgrounds, we use methods that are primarily based on control samples in data, sometimes in conjunction with specific information from simulated event samples or from additional measurements that provide constraints on the background processes.

Three complementary methods are used to analyze the data, providing valuable cross-checks and probing different signal regions. The Lepton Spectrum (LS) method was used in the CMS single-lepton [1] and opposite-sign dilepton [20] SUSY searches performed using the 2010 data sample. It uses the observed lepton transverse momentum (p T) spectrum and other control samples to predict the distribution associated with the dominant SM backgrounds. This method is sensitive to SUSY models in which the distribution is decoupled from the lepton p T spectrum, as is the case when two undetected LSPs produce a large missing transverse momentum. The Lepton-Projection Variable (L P) method uses the L P variable, which was developed for the CMS measurement of the W polarization in W+jets events [21]. This variable, described in Sect. 6, is correlated with the helicity angle of the lepton in the W-boson rest frame. Both the L P and the LS methods take advantage of well-understood properties of the W polarization in \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events for the background determination. The methods are complementary in that they rely on significantly different approaches to determining the backgrounds, based on different kinematic variables and different signal regions. The ANN method uses an artificial neural network discriminant built from several kinematic quantities. The ANN discriminant is then used in conjunction with to define signal and sideband regions, from which the background yield is determined. A key variable in the ANN is M T, an approximate invariant mass of the system comprising the lepton and the , computed with the momentum components transverse to the beam direction. Background events usually have M T<M(W), where M(W) is the W boson mass, because the observed is associated with the neutrino from \(\mathrm{W}\to\ell\bar{\nu}\) decay.

This paper is organized as follows. Sections 2 and 3 describe the CMS detector and the event samples. The event preselection requirements that are common to all methods are discussed in Sect. 4. Sections 5, 6, and 7 describe the LS, L P, and ANN methods, respectively, for obtaining SM background estimates from control samples in data. The observed yields in data are compared with the background estimate obtained for each method. Systematic uncertainties are described in Sect. 8. Finally, the results, interpretation, and conclusions of the analysis are presented in Sects. 9 and 10.

2 The CMS detector

The CMS detector, described in detail in Ref. [22], is a multipurpose apparatus designed to study high-p T physics processes in proton–proton collisions, as well as a broad range of phenomena in heavy-ion collisions. The central element of CMS is a 3.8 T superconducting solenoid, 13 m in length and 6 m in diameter. Within the magnet are (in order of increasing distance from the beam pipe) high-precision silicon pixel and silicon strip detectors for charged particle tracking; a lead–tungstate crystal electromagnetic calorimeter for measurements of photons, electrons, and the electromagnetic component of jets; and a hadron calorimeter, constructed from scintillating tiles and brass absorbers, for jet energy measurements. Beyond the magnet is the muon system, comprising drift tube, cathode strip, and resistive-plate detectors interleaved with steel absorbers. Most of the detector systems are divided into subsystems that cover the central (barrel) and forward (endcap) regions. The first level of the CMS trigger consists of custom hardware processors that use information from the calorimeter and the muon system to select up to 100 kHz of the most interesting events. These events are then analyzed in the High Level Trigger (HLT) processor farm, which uses information from all CMS detector systems to reduce the event rate to about 300 Hz.

In describing the angular distribution of particles and the acceptance of the detector, we frequently make use of the pseudorapidity, η=−ln[tan(θ/2)], where the polar angle θ of the particle’s momentum vector is measured with respect to the z axis of the CMS coordinate system. The z axis points along the direction of the counterclockwise-moving proton beam; the azimuthal angle ϕ is measured in a plane perpendicular to this axis. The separation between two momentum vectors in ηϕ space is characterized by the quantity \(\Delta R = \sqrt{(\Delta\eta)^{2}+ (\Delta\phi)^{2}}\), which is approximately invariant under Lorentz boosts along the z axis.

3 Data and simulated event samples

The data samples used in the analysis were selected using triggers based on , lepton p T, and the transverse momenta (\(p_{\mathrm{T}} ^{j}\)) of the observed jets j. The overall level of jet activity was measured with the quantity \(H_{\mathrm{T}}^{\mathrm{trigger}}=\sum_{j} p_{\mathrm{T}}^{j}\), the scalar sum of jet transverse momenta satisfying \(p_{\mathrm{T}}^{j}>40~\mathrm{GeV}\). The missing transverse momentum was computed in the trigger using particle-flow algorithms [23, 24]. To maintain an acceptable trigger rate, the thresholds on , lepton p T, and \(H_{\mathrm{T}} ^{\mathrm{trigger}}\), were raised as the LHC luminosity increased over the course of the data collection period. The highest thresholds applied in the muon trigger selection were , muon p T>15 GeV, and \(H_{\mathrm{T}}^{\mathrm{trigger}}>300~\mathrm{GeV}\). For electron triggers, the highest thresholds applied were , electron p T>15 GeV and \(H_{\mathrm{T}}^{\mathrm{trigger}}> 250~\mathrm{GeV}\); a loose electron isolation requirement was also applied to help control the rate. The offline analysis requirements for both muon and electron events are more restrictive than those used in the trigger.

The analysis procedures are designed using simulated event samples. Except for certain scans of the SUSY parameter space discussed later, the detector simulation is performed using the Geant4 package [25]. A variety of Monte Carlo (MC) event generators are used to model the backgrounds. The QCD multijet samples are generated with the pythia 6.4.22 [26] MC generator with tune Z2 [27]. The dominant background, \(\mathrm{t}\overline{\mathrm{t}}\), is studied with a sample generated using MadGraph 5.1.1.0 [28]. The W+jets and Z+jets processes are also simulated with MadGraph. Single-top (s-channel, t-channel, and tW) production is simulated with powheg [29]. To model the effect of multiple pp interactions per beam crossing (pileup), simulated events are generated with a nominal distribution of multiple vertices, then reweighted to match the distribution of the number of collision vertices per bunch crossing as measured in data.

Event samples for SUSY benchmark models are generated with pythia. As example CMSSM scenarios, we use LM3 and LM6, which are among the standard benchmarks [30] used in CMS. The CMSSM benchmarks are described by the universal scalar mass parameter m 0, the universal gaugino mass parameter m 1/2, the universal trilinear soft-SUSY-breaking parameter A 0, the ratio of the two Higgs-doublet vacuum expectation values tanβ, and the sign of the Higgs mixing parameter μ. The LM3 (LM6) benchmark is described by m 0=330 GeV (85 GeV), m 1/2=240 GeV (400 GeV), A 0=0 GeV (0 GeV), tanβ=20 (10), and μ>0 (0). For LM3, the masses of the gluino and squarks are very similar (≈600 GeV), except for \(m(\widetilde{\mathrm{t}})\approx440~\mathrm{GeV}\), while the mass of the LSP is \(m(\widetilde{\chi}^{0}_{1}) = 94~\mathrm{GeV}\). The LM6 spectrum is heavier, with \(m(\widetilde{\mathrm{g}})\approx930~\mathrm{GeV}\), \(m(\widetilde {\mathrm{q}})\approx800~\mathrm{GeV}\), \(m(\widetilde{\mathrm{t}})\approx650~\mathrm{GeV}\), and \(m(\widetilde{\chi }^{0}_{1})\approx 160~\mathrm{GeV}\). The next-to-leading-order (NLO) cross sections for these models are approximately 4.8 pb (LM3), and 0.4 pb (LM6).

The ANN method uses the LM0 model [30] to train the neural network. Because of its large cross section (54.9 pb at NLO), LM0 has already been excluded [1], but its kinematic distributions still provide a reasonably generic description of SUSY behavior with respect to the variables used in the neural network. The parameters for LM0 are m 0=200 GeV, m 1/2=160 GeV, A 0=−400 GeV, tanβ=10, and μ>0.

The results are interpreted in two ways: (i) as constraints on CMSSM parameter space and (ii) as constraints on cross sections for event topologies described in the framework of simplified models. In both cases, a large number of simulated event samples are required to scan over the relevant space of model parameters. For this reason, the scans are performed with the CMS fast simulation package [31], which reduces the time associated with the detector simulation.

Both the LS and L P background determination methods rely on knowledge of the W-boson polarization in W+jets and in \(\mathrm{t}\overline {\mathrm{t}}\) events. The polarization effects are well modeled in simulated event samples, which are used in conjunction with control samples in data. The angular distribution of the (positively) charged lepton in the W+ rest frame can be written as:

(1)

where f +1, f −1, and f 0 denote the polarization fractions associated with the W-boson helicities +1, −1, and 0, respectively. The angle \(\theta^{*}_{\ell}\) is the polar angle of the charged lepton in the W+ rest frame, measured with respect to a z axis that is aligned with the momentum direction of the W+ in the top-quark rest frame. The polarization fractions thus determine the angular distribution of the lepton in the W rest frame and, together with the Lorentz boosts, control the p T distributions of the lepton and the neutrino in the laboratory frame.

The W polarization fractions in top-quark decays have been calculated [32] with QCD corrections to next-to-next-to-leading order (NNLO), and the polarization is predominantly longitudinal. For t→bW+ these fractions are f 0=0.687±0.005, f −1=0.311±0.005, and f +1=0.0017±0.0001. These precise calculations reduce the uncertainties associated with the W polarization in \(\mathrm{t}\overline {\mathrm{t}}\) events to a low level. The theoretical values are consistent with measurements from ATLAS [33], which obtained f 0=0.67±0.03±0.06, f −1=0.32±0.02±0.03, and f +1=0.01±0.01±0.04, expressed for the W+ polarizations.

The W polarization in W+jets events exhibits a more complex behavior than that in \(\mathrm{t}\overline{\mathrm{t}}\) production. Both CMS [21] and ATLAS [34] have reported measurements of these effects, which are consistent with alpgen [35] and MadGraph [28] simulations predicting that the W+ and W bosons are both predominantly left-handed in W+jets events at high p T. An NLO QCD calculation [36] has demonstrated that the predicted polarization fractions are stable with respect to QCD corrections. As discussed in later sections, this detailed knowledge of the W-boson polarization provides key information for measuring the SM backgrounds using control samples in data.

4 Event preselection

Table 1 summarizes the main variables and requirements used in the event preselection, which is designed to be simple and robust. Except where noted, a common set of preselection requirements is used by each of the three analysis methods. Events are required to have at least one good reconstructed primary vertex, at least three jets (L P method and ANN method) or four jets (LS method), and exactly one isolated muon or exactly one isolated electron. These basic requirements select an event sample that is dominated by genuine, single-lepton events from SM processes.

Table 1 Main preselection requirements. The term lepton designates either an electron or a muon. Definitions of the quantities and further details are given in the text

The primary vertex must satisfy a set of quality requirements, including |z PV|<24 cm and ρ PV<2 cm, where z PV and ρ PV are the longitudinal and transverse distances of the primary vertex with respect to the nominal interaction point in the CMS detector.

Jets are reconstructed offline using the anti-k T clustering algorithm [37] with a distance parameter of 0.5. The particle four-vectors reconstructed by the CMS particle-flow algorithm [23, 24], are used as inputs to the jet clustering algorithm. The particle-flow algorithm combines information from all CMS sub-detectors to provide a complete list of long-lived particles in the event. Corrections based on simulation are applied to the jet energies to establish a uniform response across the detector and a first approximation to the absolute energy scale [38]. Additional jet energy corrections are applied to the data using measurements of energy balance in dijet and photon+jet control samples in data. These additional corrections take into account residual differences between the jet energy scale in data and simulation. The effect of pileup was significant during much of the data-taking period. Extra energy clustered into jets due to pileup is taken into account with an event-by-event correction to the jet momentum four-vectors. Jet candidates are required to satisfy quality criteria that suppress noise and spurious energy deposits in the calorimeters. The performance of jet reconstruction and the corrections are described in Ref. [38]. In this analysis, reconstructed jets are required to satisfy p T>40 GeV and |η|<2.4. The vector is defined as the negative of the vector sum of the transverse momenta of all the particles reconstructed and identified by the particle-flow algorithm.

In the muon channel, the preselection requires a single muon candidate [39] satisfying p T(μ)>20 GeV and |η|<2.1. Several requirements are imposed on the elements that form the muon candidate. The reconstructed track must satisfy quality criteria related to the number of hits in the pixel, strip, and muon detectors, and it must have an impact parameter d 0 in the transverse plane with respect to the beam spot satisfying |d 0|<0.02 cm and an impact parameter d z with respect to the primary vertex along the z direction satisfying |d z |<1.0 cm.

To suppress background in which the muon originates from a semileptonic decay of a hadron containing a bottom or charm quark, we require that the muon candidate be spatially isolated from other energy in the event. A cone of size ΔR=0.3 is constructed around the initial muon momentum direction in ηϕ space. The muon combined isolation variable, I comb=∑ΔR<0.3(E T+p T), is defined as the sum of the transverse energy E T (as measured in the electromagnetic and hadron calorimeters) and the transverse momentum p T (as measured in the silicon tracker) of all reconstructed objects within this cone, excluding the muon. This quantity is used to compute the combined isolation relative to the muon transverse momentum, \(I^{\mathrm{comb}}_{\mathrm{rel}}=I^{\mathrm{comb}}/p_{\mathrm{T}}(\mu)\), which is required to satisfy \(I^{\mathrm{comb}}_{\mathrm{rel}}<0.1\).

Electron candidates [40] are reconstructed by matching energy clusters in the ECAL with tracks in the silicon tracking system. Candidates must satisfy p T>20 GeV and |η|<2.4, excluding the barrel-endcap transition region (1.442<|η|<1.56). Quality and photon-conversion rejection requirements are also imposed. The relative isolation variable, defined in a manner similar to that in the muon channel, must satisfy \(I^{\mathrm{comb}}_{\mathrm{rel}}<0.07\) in the barrel region and \(I^{\mathrm{comb}}_{\mathrm{rel}}<0.06\) in the endcaps. The requirements on d 0 and d z are the same as those used in the muon channel.

The preselection requirements have a large effect on the sample composition. The lepton isolation requirement is critical for the rejection of QCD multijet processes, which have very large cross sections but are reduced to a low level by the isolation and the other preselection requirements. While many lepton candidates are produced in the semileptonic decays of hadrons containing b or c quarks, from π and K decays in flight, and from misidentification of hadrons, the vast majority of these candidates are either within or near hadronic jets. The background from W+jets events (primarily from W→eν or W→μν, but also W→τν) is initially also very large. This contribution is heavily suppressed by the three- or four-jet requirement. Depending on the particular signal region, either \(\mathrm{t}\overline{\mathrm{t}}\) or W+jets production emerges as the largest contribution to the background in the sample of events with moderate to large .

Events with a second isolated-lepton candidate satisfying the criteria listed in Table 1 are vetoed. This requirement not only suppresses SM background, but also minimizes the statistical overlap between the event sample used in this search and those used in multilepton searches. However, \(\mathrm{t}\overline{\mathrm{t}}\) events with dileptons can still be present, and this contribution must be determined, particularly because the presence of two neutrinos in the decay chains can result in large values of . The background involving W→τν decays, both from \(\mathrm{t}\overline {\mathrm{t}}\) events and from direct W production, must also be determined. To help suppress the dilepton background, the requirements on the veto leptons are somewhat looser than those on the signal lepton. For both muons and electrons, the p T threshold is p T>15 GeV, the isolation requirement is \(I^{\mathrm{comb}}_{\mathrm{rel}}<0.15\), and the impact parameter requirement is |d 0|<0.1 cm (the d z requirement is kept the same as for the signal lepton). In addition, some of the quality requirements for both the muon and electron are loosened.

Further event selection requirements are used in the individual background estimation methods described in Sects. 5, 6, and 7. The methods use the quantity H T, which is defined as the scalar sum of the transverse momenta of particle-flow jets j with \(p_{\mathrm{T}}^{j}>40~\mathrm{GeV}\) and |η j|<2.4,

(2)

The three background determination methods presented in the following three sections use different approaches to estimating the SM backgrounds using control samples in data. In Sect. 9, we compare the results of the different methods and make some observations about their features.

5 Lepton Spectrum method

5.1 Overview of the Lepton Spectrum method

This section describes the Lepton Spectrum (LS) method, which is named for the technique used to determine the dominant background source: genuine, single-lepton processes. Such processes account for about 75 % of the total SM background in the signal regions and arise primarily from \(\mathrm{t}\overline{\mathrm{t}}\), single-top, and W+jets events. Their contribution to the distribution is estimated by exploiting the fact that, when the lepton is produced in W-boson decay, the distribution is fundamentally related to the lepton p T spectrum, unlike the for many SUSY models. A more detailed description of the Lepton Spectrum method is given in the references [1, 41].

Non-single-lepton backgrounds are also determined using control samples in the data. Such events arise mainly from (i) \(\mathrm{t}\overline{\mathrm{t}}\) dilepton events, in which zero, one, or both of the leptons is a τ and (ii) \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events with a single τ→(μ,e) decay. Background from QCD multijet events is expected from simulation to be very small. However, the uncertainties in such simulations are difficult to quantify, because the QCD multijet background in the phase space relevant to this analysis arises from extreme tails of processes with very large cross sections. We therefore use control samples in data to measure the QCD multijet background. Simulated event samples are used for the determination of the Z+jets contribution, which is estimated with sufficient precision to be below one event for most of the signal regions.

The signal regions are defined with three thresholds in H T (H T≥500 GeV, H T≥750 GeV, and H T≥1000 GeV) and four bins in (, , , and ).

5.2 Estimation of single-lepton backgrounds

The physical foundation of the Lepton Spectrum method is that, when the lepton and neutrino are produced together in two-body W decay (either in \(\mathrm{t}\overline{\mathrm{t}}\) or in W+jets events), the lepton p T spectrum is directly related to the spectrum. The lepton and the neutrino share a common Lorentz boost from the W rest frame to the laboratory frame. As a consequence, the lepton spectrum reflects the p T distribution of the W, regardless of whether the lepton was produced in a top-quark decay or in a W+jets event. With suitable corrections, discussed below, the lepton p T spectrum can therefore be used to predict the spectrum for SM single-lepton backgrounds.

The distribution in many SUSY models is dominated by the presence of two LSPs. In contrast to the SM backgrounds, the and lepton p T distributions in SUSY processes are therefore nearly decoupled. The distribution for such models extends to far higher values than the lepton spectrum. Figure 1 shows the relationship between the lepton-p T and distributions in the laboratory frame for two simulated event samples: (i) the predicted SM mixture of \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events and (ii) the SUSY LM6 benchmark model. When taken from data, the upper-left region in Fig. 1 (top) provides the key control sample of high-p T leptons from SM processes. This region typically has very little contamination from SUSY events, which populate the high- region but have relatively low lepton p T values.

Fig. 1
figure 1

Distributions of muon p T vs.  in the μ channel for simulated \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events (top) and for the LM6 SUSY benchmark model (bottom)

The lepton p T spectrum is measured with a muon control sample defined by the preselection criteria and the H T requirements. Unlike the signal region, no requirement is applied, because even a modest one () would bias the high end of the lepton p T spectrum, which is critical for making the background prediction. Only muon events are used as a control sample, because the QCD multijet background is significant in the low- region of the electron sample. The number of events that are common to both the control sample and the signal region is small. For example, the overlap as measured in simulated \(\mathrm{t}\overline{\mathrm{t}}\) events is 3.6 % for H T≥750 GeV, , and p T≥250 GeV. Because no requirement is placed on the muon control sample, a small amount of QCD background remains and must be measured and subtracted. The scaling from the muon to the electron samples is obtained by fitting their ratio in the data over the range , with systematic uncertainties evaluated by varying the fit range. The resulting correction factor is N(e)/N(μ)=0.88±0.03±0.03, where the uncertainties are statistical and systematic, respectively.

To use the lepton spectrum to predict the spectrum in single-lepton SM background processes, three main issues must be understood: (i) the effect of the W-boson polarization in both \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events, (ii) the effect of the applied lepton p T threshold, and (iii) the difference between the experimental resolutions on the measurements of lepton p T and .

The status of theoretical and experimental knowledge of W-boson polarization in \(\mathrm{t}\overline{\mathrm{t}}\) and in W+jets events is discussed in Sect. 3. The helicity zero polarization state results in a forward–backward symmetric angular distribution of the lepton and the neutrino in the W rest frame (with respect to the W momentum direction), leading to identical lepton and neutrino spectra in the laboratory frame. In contrast, the helicity ±1 states result in angular asymmetries that lead to somewhat different lepton and neutrino p T spectra in the laboratory frame. These effects are taken into account by applying correction factors obtained from simulation to the measured lepton spectrum, with uncertainties as described in Sect. 8.

The second key issue in the Lepton Spectrum method is the effect of the threshold (p T>20 GeV) applied to the leptons in both the signal and control samples. Because of the anticorrelation between the lepton p T and the arising from non-zero W-boson helicity states, the threshold requirement removes SM background events in the high- signal region but not the events in the control sample with high-p T muons that are used to predict the high tail of the spectrum. For the \(\mathrm{t}\overline{\mathrm{t}}\) background, this effect partially compensates for the bias from the W polarization. For W+jets events, in contrast, the polarization effects for W+ and W approximately cancel, but the lepton p T threshold shifts the predicted yield upward. Correction factors from simulation are used to account for these effects (as well as for polarization effects), which are well defined and understood.

Finally, the resolution on the reconstructed is poorer than that for the lepton p T, so the spectrum is somewhat broadened with respect to the prediction from the lepton spectrum. We measure resolution functions in the data using QCD multijet events obtained with a set of single-jet triggers spanning the range from E T≥30 GeV to E T≥370 GeV. These resolution functions, or templates, quantify the resolution as a function of the number of jets and the H T of the event. These templates are used to smear the measured lepton momenta. Because the templates are taken from data, they include not only the intrinsic detector resolutions, but also acceptance effects. The overall effect of the smearing is modest, changing the background prediction by 5–15 %, depending on the threshold applied.

The raw background predictions for the single-lepton background are corrected to account for the effects described above, as well as for the small contamination of the single-lepton control sample arising from dilepton and single-τ events with high-p T leptons. These backgrounds are measured separately, as described below. The overall correction factor is defined such that the single-lepton prediction in a given signal region in simulation matches the yield from single-lepton processes.

The predicted single-lepton background yield varies from about 150 events for the signal region with and H T≥500 GeV to about 3 events for the region with and H T≥1000 GeV. These predictions, as well as the expectations from simulation, are presented in Tables 2, 3, and 4 and discussed in more detail in Sect. 5.4.

Table 2 Event yields for the Lepton Spectrum method for H T≥500 GeV. The upper part of the table gives the background predictions that are based on simulated (MC) event samples and the yield for the SUSY signal points LM3 and LM6. The lower part gives the backgrounds predicted using control samples in the data (data-driven prediction). The actual yield observed in data is given at the bottom, with the separate muon and electron yields given in parentheses (N μ ,N e) after the total yield. The uncertainties on the background predictions are statistical and systematic. The MC yields are not used in setting limits and are included only for reference. The uncertainties on the MC yields are statistical only
Table 3 Event yields for the Lepton Spectrum method for H T≥750 GeV. Further details are given in the Table 2 caption
Table 4 Event yields for the Lepton Spectrum method for H T>1000 GeV. Further details are given in the Table 2 caption

5.3 Estimation of non-single-lepton backgrounds

The non-single-lepton backgrounds include dilepton events in several categories, events with W→τν followed by τ decays (in either \(\mathrm{t}\overline{\mathrm{t}}\) or W+jets events), and QCD multijet processes. These subdominant backgrounds are estimated using control samples in data, in conjunction with information from simulation. The contribution from Drell-Yan and Z+jets is very small and is estimated directly from simulation.

Dilepton background events (including the τ as one of the leptons) contain at least two neutrinos, so these events can be important in the tails of the distributions. These backgrounds are divided into the following categories: (i) 2 events with one lost or ignored lepton (=e,μ), (ii) +τ events with τ→hadrons, and (iii) +τ events with τ→lepton. A lost lepton is one that is either not reconstructed or is out of the detector acceptance. An ignored lepton is one that is reconstructed but fails either the lepton-identification requirements (including isolation) or the p T threshold requirement.

To estimate the background from dilepton events with lost or ignored leptons, we compute the ratio of the combined yield of dilepton events in the ee, eμ, and μμ channels in data to the corresponding combined yield in simulated event samples. This ratio, which is 0.91±0.07 for H T≥500 GeV, 0.93±0.15 for H T≥750 GeV, and 0.87±0.37 for H T≥1000 GeV, is used to rescale the distribution of dilepton events that appear in the signal region in simulation. (Events within 20 GeV of the nominal Z mass are excluded in the e+e and μ + μ channels.) This approach is used because the dilepton control sample in data is small, and using it to obtain the shapes of distributions would result in large statistical uncertainties. For all bins above 250 GeV, the predicted yield from this background contribution is less than 6 events, and for all bins above 350 GeV, the yield is at or below 1 event. The distribution associated with the reconstructed dilepton events in data is well described by the simulation.

Dilepton events can also involve τ decays, either τ→ hadrons or τ. The distributions in the dilepton events in data, when suitably modified to reflect the presence of a leptonic or hadronic τ decay, provide an accurate description of the shape of the distribution of these backgrounds. Thus, to estimate the shape from the τ→ hadrons background, we effectively replace a lepton in a reconstructed dilepton event with a hadronic τ jet. Both hadronic and leptonic τ response functions are used, providing a probability distribution for a τ to produce a jet or a lepton with a given fraction p T(jet)/p T(τ) or p T()/p T(τ). These response functions, obtained from simulation, are computed in bins of p T(τ). This procedure can change the total number of jets above threshold in the event, as well as other properties such as H T and , which are recalculated. Simulated event samples are used to determine, for each of these processes i, the ratio \(r_{i}=N_{\mathrm{feed}}^{i}/N_{\mathrm{control}}\) of the number of events observed in the single-lepton channel to the number of events in the control sample, as a function of . This procedure effectively normalizes all such contributions to the control samples in data. For all bins above 250 GeV, the number of dilepton events with a τ→hadrons decay is predicted to be about 7 events or less and is much smaller in the higher bins. The number of dilepton events with a τ decay is predicted to be less than 3 events for all bins above 250 GeV and is much smaller in the higher bins.

Estimates for the τ single-lepton backgrounds from \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets processes are based on a procedure similar to that used for the dilepton backgrounds, but in this case the single-lepton sample itself is used as the control sample. The distribution obtained by applying the τ response function to the data is rescaled by a ratio from simulation that gives the yield of τ background events divided by the yield of events in the single-lepton control sample, as a function of . The number of background events from the single τ contribution falls from 33 for H T≥500 GeV and to 1.1 event for H T≥500 GeV and .

The background predictions in data are shown in Fig. 2, where the expectation based on simulation is shown for comparison. The total predicted dilepton plus single τ background yield ranges from about 50 events for H T≥500 GeV and to about 1 event for H T≥1 TeV and . All of these predictions, as well as the expectations from simulation, are presented in Tables 2, 3, and 4, which are discussed in more detail in Sect. 5.4.

Fig. 2
figure 2

Predictions for dilepton and τ backgrounds after requiring H T≥750 GeV: control samples in data (green points with error bars) vs. MC predictions (black solid histogram) for (top) dilepton background and (bottom) τ background. The MC prediction has been scaled to the integral of the data prediction (Color figure online)

Background from QCD multijet events is suppressed to a level well below 1 event in nearly all signal regions, as shown in Tables 2, 3, and 4. The QCD multijet background is determined by first defining a control sample with small missing transverse momentum () and with a lepton impact parameter relative to the beam spot |d 0|>0.02 cm. These requirements select a sample with little contamination from other SM processes such as \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets processes. Using this control sample, we measure the shape of the distribution in the combined relative isolation variable, \(I_{\mathrm{rel}}^{\mathrm{comb}}\) (see Sect. 4). The shape of this distribution has very little correlation to or to the lepton impact parameter (d 0), and so can be applied in the high- signal regions. For each signal region in the data, we determine the background at low values of \(I_{\mathrm{rel}}^{\mathrm{comb}}\) by first scaling the measured QCD multijet background shape in the relative isolation variable to the high-\(I_{\mathrm{rel}}^{\mathrm{comb}}\) sideband of the signal region. The shape is then used to extrapolate the yield to the low-\(I_{\mathrm{rel}}^{\mathrm{comb}}\) signal region. In the high- signal regions, some non-QCD SM background can be present at high \(I_{\mathrm{rel}}^{\mathrm{comb}}\), where the QCD background shape is normalized. We therefore subtract the estimated background from \(\mathrm{t}\overline{\mathrm{t}}\), W+jets, and Z+jets from this region. These yields are taken from simulation, with systematic uncertainties determined from a comparison with a control region in the data.

5.4 Results from the Lepton Spectrum method

Tables 2, 3, and 4 compare the background yields predicted from the control samples in data with the yields obtained directly from simulation for H T≥500 GeV, H T≥750 GeV, and H T≥1000 GeV, respectively. We observe that the single-lepton background is the dominant contribution in all regions. The various sources of uncertainties associated with these background determinations are discussed in Sect. 9. Finally, the yields observed in the signal regions in the data, which are listed at the bottom of each table, are consistent with the total background predictions based on the control samples. Thus, we observe no evidence for any excess of events in the data above the SM contributions.

Figure 3 shows the distributions in data for the combined muon and electron channels, with all of the selection requirements, except that on itself. The distributions are shown for H T≥500 GeV, H T≥750 GeV, and H T≥1000 GeV, on both linear and logarithmic scales. The predicted distribution (green-bar histogram) is a sum over three sources: single-lepton backgrounds (from \(\mathrm{t}\overline{\mathrm{t}}\), single-top, and W+jets events), dilepton background from \(\mathrm{t}\overline{\mathrm{t}}\), and single-τ events (from both \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets processes). The vertical span of the green bar corresponds to the statistical uncertainty on the background prediction. (The systematic uncertainties are computed in wider bins used for setting the limits and are given in Tables 2, 3, and 4.) In each signal region, the blue histogram shows the contribution from the dilepton and single-τ backgrounds only. It is evident that the single-lepton background is dominant in all cases. The distributions for the SUSY benchmark models LM3 and LM6 are overlaid (not summed) for comparison. Systematic uncertainties and the interpretation are presented in Sect. 9.

Fig. 3
figure 3

Lepton Spectrum method: observed distributions in data (filled points with error bars) compared with predicted distributions (green bars) in the combined electron and muon channels, on linear (left) and logarithmic (right) scales. Three different H T thresholds are applied: H T≥500 GeV (upper row), H T≥750 GeV (middle row), and H T≥1000 GeV (lower row) (Color figure online)

6 Lepton Projection method

6.1 Overview of the Lepton Projection method

The Lepton-Projection (LP) method uses the difference between SM and SUSY processes in the correlation of the lepton transverse momentum and the missing transverse momentum. As previously discussed, in the SM processes the corresponds to the neutrino in the decay of the W boson, either in W+jets or in \(\mathrm{t}\overline{\mathrm{t}}\) events. The kinematics of W decays are dictated by the V–A nature of the W coupling to leptons and the helicity of the W boson, as discussed in Sect. 3. Since W bosons that are produced with high transverse momentum in W+jets events exhibit a sizable left-handed polarization, there is a significant asymmetry in the p T spectra of the neutrino and charged lepton. A smaller asymmetry is expected in W bosons from t quark (\(\bar{\mathrm{t}}\) antiquark) decays, which yield W bosons which are predominantly longitudinally polarized with smaller left-handed (right-handed) components for W+ (W).

We have measured the fraction of the helicity states of the W boson using an angular analysis of leptonic W decays [21]. Since the total momentum of the W boson in these decays, and therefore its center-of-mass frame, cannot be accurately determined because the momentum of the neutrino along the beam axis cannot be measured, an observable that depends only on transverse quantities is used. A variable that is highly correlated with the cosine of the polar angle (with respect to the W boson flight direction) in the center-of-mass frame of the W boson is the “lepton projection variable”:

$$ L_{\mathrm{P}}= \frac{\boldsymbol{p}_{\mathrm{T}}(\ell) \cdot\boldsymbol{p}_{\mathrm{T}}({\mathrm{W}})}{|\boldsymbol{p}_{\mathrm{T}} ({\mathrm{W}})|^2}, $$
(3)

where p T() is the transverse momentum of the charged lepton and p T(W) is the transverse momentum of the W boson. The latter quantity is obtained from the vector sum of the charged lepton transverse momentum and the missing transverse momentum in the event.

Since SUSY decay chains result in large values of , and often result in relatively low values of the lepton momentum as well, the L P distribution for SUSY events tends to peak near zero, whereas W+jets and \(\mathrm{t}\overline{\mathrm{t}}\) yield a broad range of L P values. This behavior is illustrated in Fig. 4, which compares the L P distribution from both SM processes and from two representative SUSY benchmark points (LM3 and LM6).

Fig. 4
figure 4

Distribution of L P in SUSY and standard model processes from simulation. Top: all distributions are normalized to the integrated luminosity. The different contributions from SM processes are shown, whereas for SUSY two benchmark points, LM3 and LM6, are displayed. Bottom: the same distributions normalized to unity. The SM distribution is the sum of all the individual SM processes shown in the top pane. The quantity is discussed in the text

In the L P method, two regions in L P are defined: the region with L P<0.15 is used as the signal region; the region with L P>0.3 is used as the control region, i.e., a sample that is depleted in the signal expected and is instead dominated by SM processes. These regions are selected using simulated event samples of W+jets, Z+jets, and \(\mathrm{t}\overline{\mathrm{t}}\), that are collectively referred to as electroweak (EWK) processes in what follows, as well as with simulated SUSY events with SUSY particle masses near the region currently under exploration.

6.2 Background estimation in the L P method

The key ingredient of the analysis is the estimate of the number of events in the signal region from the SM processes. We define a translation factor,

$$ R_\mathrm{CS} = \frac{N_\mathrm{MC}(L_{\mathrm{P}}<0.15)}{N_\mathrm {MC}(L_{\mathrm{P}}>0.3)}, $$
(4)

which is the ratio of the number of events in the signal and control regions for the EWK processes. The translation factor is obtained from MC simulation of the EWK processes, and the uncertainties on this factor are included in the systematic uncertainty of the background estimate. In the case of muons, where the background from QCD multijets is negligible, the total number of events predicted from SM processes in the signal region, \(N_{\mathrm{SM}}^{\mathrm{pred}}(L_{\mathrm{P}}<0.15)\), can be determined directly from the number of events observed in the data in the control region, N data(L P>0.3):

$$ N_\mathrm{SM}^\mathrm{pred}(L_{\mathrm{P}}<0.15) = R_\mathrm{CS} \cdot N_\mathrm{data}(L_{\mathrm{P}}>0.3). $$
(5)

In the case of the electrons, the presence of events from QCD multijet processes necessitates an independent evaluation of this background prior to the application of the translation factor for EWK processes.

The number of events estimated with this method is then compared to the number of events observed in the data in the signal region, N data(L P<0.15), for indications of an excess of events over the SM expectation. The analysis is performed in different regions of the event mass scale. To characterize the latter without affecting the correlation of the charged lepton and the neutrino in SM events, the scalar sum of the lepton transverse momentum and the missing transverse momentum, \(S_{\mathrm{T}}^{\mathrm{lep}}\), is used: . For W decays, \(S_{\mathrm{T}}^{\mathrm{lep}}\approx p_{\mathrm{T}}({\mathrm{W}})\) at large values of p T(W).

In order to make the search optimization less dependent on the unknown energy scale of a new physics signal, the analysis is performed in disjoint ranges of \(S_{\mathrm{T}}^{\mathrm{lep}}\) and the results in these ranges are combined. In addition, the selection is also binned in a second dimension, the H T variable, defined in Eq. (2).

As indicated in Table 1, the event selection used in this analysis is slightly different from the corresponding one in the LS analysis. To increase the sensitivity to SUSY decays, this analysis requires three or more jets. While this results in a significant increase in W+jets events, the additional SM background is mostly concentrated in the control region in L P.

The event yields in the muon and electron channels, as predicted from simulation, are shown in Table 5. As discussed previously, the dominant backgrounds to the lepton plus jets and signature arise from the production and decay of W+jets and \(\mathrm{t}\overline{\mathrm{t}}\). The production of single W bosons in association with jets, and with large transverse momenta, is in general the larger of the two, especially at lower jet multiplicities. The majority of the \(\mathrm{t}\overline{\mathrm{t}}\) background arises from semi-leptonic \(\mathrm{t}\overline{\mathrm{t}}\) decays, with fully leptonic \(\mathrm{t}\overline{\mathrm{t}}\) decays in which a lepton is either ignored or not reconstructed contributing about 20 % of the total \(\mathrm{t}\overline{\mathrm{t}}\) background.

Table 5 Expected event yields in the signal region (L P<0.15) from simulation. These yields are for H T>500 GeV. These MC values are only listed for illustration purposes

A source of background, which is not listed in Table 5, stems from QCD multijet events in which a jet is misreconstructed as a lepton. The simulation indicates that the magnitude of this background is small in the control region and negligible in the signal region. Nevertheless, since the uncertainties in simulating these backgrounds can be significant, we use control data samples to estimate the background in the muon and electron channels.

To estimate the background from QCD multijets in the muon final state, we use the relative combined isolation, \(I_{\mathrm{comb}}^{\mathrm{rel}}\), of the muon. Multijet events are expected to populate the region at high values of \(I_{\mathrm{comb}}^{\mathrm{rel}}\), whereas muons from SUSY decays are isolated and thus have low values of \(I_{\mathrm{comb}}^{\mathrm{rel}}\). We employ an additional control data sample, which is specially selected to be enriched in QCD multijets, to determine the ratio of multijet events at low values of the relative isolation. Using this ratio and the number of multijet events expected in the control region of the sample passing the preselection requirements, we estimate the background from multijet events in the signal region to be always smaller than 1 % of the EWK backgrounds. This level of background is negligible and is thus ignored in what follows.

The main sources of electrons in QCD multijet events are misidentified jets and photon conversions. This background is expected to be more substantial than the corresponding one in the muon sample, and its estimate exhibits a large dependence on the details of the simulation. For this reason, we estimate this background from control samples in data. The method relies on the inversion of one or more of the electron identification requirements in order to obtain a sample of anti-selected events, which is dominated by jets misidentified as electrons. We find that the inversion of the requirements on the spatial matching of the calorimeter cluster and the charged-particle track in pseudorapidity and azimuth leaves the relative fraction of the different background sources in QCD multijets unchanged. Moreover, to increase the number of events in this control sample, the requirements on d 0 and d z are removed, while the isolation requirement is loosened. These changes to the event selection have a negligible effect on the L P distribution in the data. In the simulated event samples, it is found that the L P distribution from the control sample events provides a good description of the corresponding distribution from QCD background passing all selection requirements.

The L P distribution obtained with this control sample is used as a template to fit, along with the L P distribution from EWK processes, the L P distribution in the data. In this fit, the EWK template is taken from simulation. This approach, which provides a template obtained from data for the QCD contamination, was applied in the measurement of the polarization of high-p T W bosons [21]. The fit is performed in the control region (L P>0.3), where the possible presence of signal is highly suppressed. The numbers of QCD and EWK events obtained by the fit are used to estimate the total SM contamination in the signal region (L P<0.15). The method for estimating the number of SM events expected in the signal region is applied in each range of \(S_{\mathrm{T}}^{\mathrm{lep}}\) and H T.

The method for estimating the SM expectation in the signal region is checked using two different control samples, where both the fit and signal regions have a negligible expected SUSY yield. The first sample is defined as all events satisfying the preselection requirements but confined to low values of \(S_{\mathrm{T}}^{\mathrm{lep}}\): \(150<S_{\mathrm{T}}^{\mathrm{lep}}<250~\mathrm{GeV}\). The method described above is employed to predict the number of events expected in the signal region for both muons and electrons. This prediction is found to be fully consistent with the number of events observed in the data in the signal region. The results of the fits and the yields of QCD and EWK events in the region of low \(S_{\mathrm{T}}^{\mathrm{lep}}\) (<250 GeV) are displayed in Fig. 5 for the electron and muon samples. As can be seen in Fig. 5, the QCD contamination in the signal region, L P<0.15, is negligible, as expected, since low values of L P favor events with low-p T leptons and high . The second sample, used only for events with muons, is collected with a separate trigger without any requirements on H T or . The muon transverse momentum threshold is raised to p T(μ)>35 GeV, while the H T threshold is lowered from 500 GeV to 200 GeV and the jet multiplicity requirement is reversed, to be fewer than three jets. Given these requirements on H T and on the jet multiplicity, this control sample is dominated by SM processes. It is found that the estimated background agrees well with the number of events seen in the signal region L P<0.15.

Fig. 5
figure 5

Fit results on data for \(150 < S_{\mathrm{T}}^{\mathrm{lep}}<250~\mathrm{GeV}\), in the muon (top) and electron (bottom) search samples. The fit is performed in the control region (L P>0.3) and the result is extrapolated into the signal region (L P<0.15)

6.3 Results of the L P method

The L P distributions in three ranges of \(S_{\mathrm{T}}^{\mathrm{lep}}\), are displayed in Fig. 6 for muons (top) and electrons (bottom). Tables 6 and 7 list the numbers of events observed and the number of events expected from all SM processes as presented above, in the signal region, for the muon and electron channels, respectively. The predictions, along with the numbers of events observed in each range of \(S_{\mathrm{T}}^{\mathrm{lep}}\) and H T, are also displayed graphically in Fig. 8 for muons and in Fig. 7 for electrons. The uncertainties quoted in Table 7 correspond to the statistical uncertainty of the fit, while the predictions displayed in Fig. 7 include the total statistical and systematic uncertainty.

Fig. 6
figure 6

Data and fit results for the predictions for the L P distribution, for events in the search sample, in different \(S_{\mathrm{T}}^{\mathrm{lep}}\) regions. Top plots for the muon channel; bottom plots for the electron channel. Left: \(250<S_{\mathrm{T}}^{\mathrm{lep}}<350~\mathrm{GeV}\), center: \(350<S_{\mathrm{T}}^{\mathrm{lep}}<450~\mathrm{GeV}\), and right: \(S_{\mathrm{T}}^{\mathrm{lep}}>450~\mathrm{GeV}\)

Fig. 7
figure 7

Comparison of the number of events observed in the data and the expectations from the background estimation methods for the electron channel, in the different \(S_{\mathrm{T}}^{\mathrm{lep}}\) bins. Left: 500<H T<750 GeV; Center: 750<H T<1000 GeV; Right: H T>1000 GeV. The error bars indicate the statistical uncertainty of the data only, while the green band indicates the total statistical and systematic uncertainty on the background estimate (Color figure online)

Fig. 8
figure 8

Comparison of the number of events observed in the data and the expectations from the background estimation methods for the muon channel, in the different \(S_{\mathrm{T}}^{\mathrm{lep}}\) bins. Left: 500<H T<750 GeV; Center: 750<H T<1000 GeV; Right: H T>1000 GeV. The error bars indicate the statistical uncertainty of the data only, while the green band indicates the total statistical and systematic uncertainty on the background estimate (Color figure online)

Table 6 Event yields in data and MC simulation for the muon sample. The results in the columns labeled “Total MC” are listed for reference only. The corresponding uncertainties statistical only
Table 7 Event yields in data and predictions of the numbers of EWK and QCD events for the electron sample in bins of H T. The sum of predicted EWK events and predicted QCD events in the control region is constrained to be equal to the total number of data events. The background estimate used in comparing to the yields in the data is the result of the procedure described earlier and is listed in the row labeled “SM estimate”. The uncertainties for the QCD and EWK background estimates are statistical only. The uncertainties shown for the SM estimate are first the statistical uncertainty from the control region fit and second all other systematic uncertainties

All estimates of the total contribution expected from SM processes in the various bins in (\(S_{\mathrm{T}}^{\mathrm{lep}},H_{\mathrm{T}}\)) are consistent with the numbers of events observed in the data, with no visible excess from a potential SUSY signal. The result is interpreted as a limit in SUSY parameter space in the context of the CMSSM in Sect. 9.

7 The Artificial Neural Network method

7.1 Overview of the method

The Artificial Neural Network (ANN) method uses a multi-variate analysis to combine several event characteristics, other than , into a single variable that distinguishes signal from background. Signal events then preferentially populate a signal region in the two-dimensional plane of the ANN output (z ANN) and , and the sidebands in this plane provide an estimate of the residual background.

Four input variables drive the ANN. The first two are n jets, the number of jets with p T>40 GeV, and H T, the scalar sum of the p T of each jet with p T>40 GeV. The SUSY signal typically has heavy particles decaying via complex cascades, and as such, is likely to produce more jets and larger H T than SM backgrounds. The third variable is Δϕ(j1,j2), the angle between the two leading p T jets in the transverse plane, which makes use of the greater likelihood that the two highest p T jets are produced back-to-back in SM than in SUSY events. The final variable is M T, the transverse mass of the lepton and system. In \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events, the lepton and generally arise from the decay of a W boson, and as a result, M T peaks near the W boson mass, with larger values arising only when there are additional neutrinos from τ or semileptonic decays. By contrast, in SUSY events, M T tends to be greater than the W mass because of due to undetected LSPs.

Figure 9 shows the distributions of these variables for simulated SM and SUSY events. The most powerful input variable is M T; n jets and H T also have considerable discriminating power. The Δϕ(j1,j2) variable is weaker, but it still improves the sensitivity of the search. Lepton p T also discriminates between the SM and SUSY, but it is not included in the ANN because its strong correlation with in the SM would spoil the background estimate. Additional variables either do little to improve sensitivity or introduce a correlation between z ANN and . The input variables have similar distributions in the muon and electron channels, so we choose to train the ANN on the two channels combined, and use the same ANN for both. In general, the SM simulation describes the data adequately apart from a possible small structure near 130 GeV in the M T distribution. Reweighting the simulation to match the M T distribution in data does not affect the results of the analysis.

Fig. 9
figure 9

The distributions of n jets, H T, Δϕ, and M T for data (solid circles), simulated SM (stacked shaded histograms), LM3 (open circles), and LM6 (open triangles) events after preselection. The small plot beneath each distribution shows the ratio of data to simulated SM yields. The muon and electron channels are combined

The ANN infrastructure uses standard Root utilities [42]. During training, weights are determined that minimize the root-mean-square deviation of background events from zero and signal events from unity. For the SUSY parameter space under study, our sensitivity depends only mildly on the details of the signal sample that trains the ANN. Specifically, for LM points 0 through 13 [30], the sensitivity is comparable (less than 30 % variation) whether the ANN is trained on LM0, LM6 or LM9, even though these three training samples have rather different characteristics. We select LM0 for training because it gives the best overall performance. The SM simulation provides the background sample.

Figure 10 compares the distributions of z ANN for data and SM simulation for all events surviving the preselection. The two distributions are consistent within the uncertainties. The SM contribution is concentrated at small values of z ANN, while the LM3 and LM6 SUSY distributions, which are also shown, extend to high values of z ANN where the SM is suppressed.

Fig. 10
figure 10

The z ANN distribution of the data (solid circles) and simulated SM (stacked shaded histograms), LM3 (open circles), and LM6 (open triangles) events, after preselection. The small plot beneath shows the ratio of data to simulated SM yields

We define two signal regions in the two-dimensional and z ANN plane. One region, referred to as the “low-” signal region, has z ANN>0.4 and , while the other, the “high-” signal region, has the same z ANN range, but . The high- signal region minimizes the probability that the expected background fluctuates up to a LM6 signal when signal contamination is taken into account. We observe 10 events in the low- signal region and 1 event in the high- signal region.

7.2 Background estimation using the ANN sidebands

The sidebands in the two dimensional plane of and z ANN provide a strategy for estimating the background. The signal and sideband regions are shown in Fig. 11 and are denoted A, B, C, and D for the low- signal region and A, B′, C, and D′ for the high- signal region. The choice of boundaries for the sideband regions balances the competing needs of statistics and insensitivity to signal contamination against preserving similar event compositions in the signal and sideband regions.

Fig. 11
figure 11

The yields of simulated SM (left) and LM6 (right) events in the versus z ANN plane. The regions D and D′ are the low- and high- signal regions. The sideband regions are also indicated

The predicted yield in region D is given by

$$ N_{\mathrm{D},\mathrm{pred}} = \frac{ N_{\mathrm{B}} \times N_{\mathrm{C}} }{ N_{\mathrm{A}} }, $$
(6)

where N i is the yield in region i, and the predicted yield in region D′ is defined similarly. This procedure is equivalent to using the distribution of the z ANN sideband regions (A, B, and B′) as a template for the distribution of events with high z ANN (C, D and D′), normalized using the yields in regions A and C. We test this estimation procedure using SM simulation: Fig. 12 (top) shows that the distributions for low and high z ANN are similar.

Fig. 12
figure 12

The distributions of events in the z ANN signal region (solid circles) and sideband (green bars) for simulated SM (top) and data (bottom) events. The distributions are normalized in the sideband, (regions A and C for the two distributions respectively). The rightmost histogram bin includes overflow. The small lower plots show the ratio of normalized sideband to signal yields (Color figure online)

If a signal is present, it enters primarily in the signal regions D and D′, but there are also significant contributions relative to the SM in regions B and B′, somewhat increasing the predicted backgrounds in D and D′. This effect is accounted for in the final results.

Table 8 summarizes the event yields in the sideband subtraction regions for the various components of the SM background. The W+jets and \(\mathrm{t}\overline{\mathrm{t}}\) dominate in all the regions, though their relative proportion varies. The W+jets events are most important at low z ANN since M T, which largely drives z ANN, tends to peak near the W-boson mass. Because the W bosons (and hence their daughters) can be highly boosted, these events extend to very high values of . As seen in Fig. 10, \(\mathrm{t}\overline{\mathrm{t}}\) events are more likely to have high values of z ANN than are W+jets events; this is because of the presence of dilepton \(\mathrm{t}\overline{\mathrm{t}}\) events, in which both W bosons (from the top quark pair) decay leptonically, but only one lepton is identified (dilepton ()), giving large M T. There is also a small contribution from events in which the lepton comes from the decay of a τ produced from a top quark decay, with the other top quark decaying either leptonically (dilepton (τ)) or hadronically (single τ). The remaining small backgrounds come from single-top-quark, QCD multijet and Z+jets events.

Table 8 Event yields for the sideband (SB) and signal regions used in the ANN method. The uncertainties listed are statistical only

There are too few events in the simulated QCD multijet and Z+jets samples to populate the high regions (B, B′, D and D′). For the results quoted in Table 8 for QCD multijet and Z+jets events, we employ an extrapolation technique based on loosening the z ANN and requirements. The extrapolated numbers for all the regions are consistent with those obtained from the simulated samples. The simulated yields in the sideband and signal regions indicate that QCD multijet and Z+jets events are negligible.

The total SM simulation yields agree well with data in all regions, suggesting that the data share the main features described above. The z ANN and distributions are shown in Fig. 13.

Fig. 13
figure 13

Distributions of for (a) 0.2<z ANN<0.4 and (bz ANN>0.4, and distributions of z ANN for (c and (d. The samples shown are data (solid circles), simulated SM (stacked shaded histograms), LM3 (open circles), and LM6 (open triangles) events. The small plot beneath each distribution shows the ratio of data to simulated SM yields

7.3 Results of the ANN method

Figure 12 (top) shows the results of applying the background estimation method to the SM simulation. We find that the method correctly predicts the background within a factor of \(\kappa= \mathrm{D}^{\prime}/ \mathrm{D}^{\prime}_{\mathrm{pred}}\) of 0.82±0.12 (stat.) in the low- signal region and 0.69±0.16 (stat.) in the high- signal region. The modest deviation from unity results from a correlation between z ANN and that arises because the W+jets background, which extends to large values, dominates in the z ANN sideband (because it tends to have M T near the W mass), whereas dileptonic \(\mathrm{t}\overline{\mathrm{t}}\) events, with their somewhat softer spectrum, dominate in the z ANN signal region.

Figure 12 (bottom) shows the distributions of the data in the high and low z ANN regions, after normalizing in the region (A and C). Because the SM simulation appears to describe the data well, with, for example, consistent exponential decay constants describing the distributions in the ANN sidebands, we choose to scale the background prediction of the data by κ. The uncertainty in the background from the relative cross sections of SM processes and other effects is quantified in Sect. 8. In the low- signal region, we expect 9.5±2.2 (stat.) events, and in the high- signal region 0.7±0.5 (stat.) events. The observed yields are 10 and 1 events, respectively, consistent with the background prediction.

8 Systematic uncertainties

Systematic uncertainties affect both the background estimates and the signal efficiencies. The sources of systematic uncertainty in the background predictions vary among the three methods, both because the final event selections differ and because the background estimation methods themselves differ. The systematic uncertainties stem from lack of perfect knowledge of the detector response and from uncertainties in the properties of the SM backgrounds. Common uncertainties for all methods are described in Sect. 8.1, while details that are specific to each method are given in Sects. 8.2, 8.3, and 8.4 for the LS, L P, and ANN methods, respectively. Tables 9, 10, and 11 list the main uncertainties associated with each method. The systematic uncertainties affecting the signal efficiency and luminosity, which are largely common to all methods, are described in Sect. 8.5.

Table 9 Sources of systematic uncertainties for the LS method and their effects on the background prediction in bins of . The full list of systematic uncertainties is given for H T>750 GeV, and the total uncertainties are shown for H T>500 GeV and H T>1000 GeV. Each uncertainty is expressed as a change in the ratio of the predicted to the true number of events (evaluated with simulation). Uncertainties associated with the dilepton and QCD backgrounds are discussed in the text. The total uncertainty is the individual uncertainties summed in quadrature
Table 10 Sources of systematic uncertainties for the L P method and their effects on the background prediction in bins of \(S_{\mathrm{T}}^{\mathrm{lep}}\) for the muon and electron channels. The full list of systematic uncertainties are given for the range 500<H T<750 GeV, and the total uncertainties are shown for the two ranges 750<H T<1000 GeV and H T>1000 GeV. The total uncertainty is the individual uncertainties summed in quadrature
Table 11 Sources of systematic uncertainties for the ANN method and their effects on the background prediction in bins of . The total uncertainty is the individual uncertainties summed in quadrature

8.1 Common uncertainties in the background predictions

The jet energy scale (JES) and its effect on in the event can affect the H T and distributions and can also lead to differences between the lepton p T spectrum and spectrum. To understand the effects of energy-scale variations, we vary the jet energy scale as a function of p T and η by amounts determined in independent studies of jet energy scale uncertainties [38], and corresponding to 2 GeV or less for jets with p T>40 GeV, and then recompute H T and . We also vary the energy scale of “unclustered” calorimeter deposits by 10 % to take into account energy not clustered into jets (this effect is very small).

The uncertainty in the lepton efficiency accounts for differences between data and simulation and uncertainties in the trigger efficiencies. The lepton efficiencies are studied using a sample of lepton pairs with invariant mass close to the Z peak, in which one lepton satisfies tight selection criteria, and the second, reconstructed with relaxed criteria, serves as a probe of the tighter reconstruction and isolation requirements (“tag-and-probe” method [43]). Discrepancies between the data and simulation for electrons are maximal at low p T (10 % effect at around 20 GeV), and we reweight events as a function of lepton p T to quantify the effect. The total lepton efficiency in data is described by simulation with an accuracy of 3 %. Studies of the trigger that separately determine the efficiencies of the \(H_{\mathrm{T}}^{\mathrm{trigger}}\), , and lepton requirements show that the lepton inefficiencies dominate, and amount to 2 % to 3 % for leptons that are reconstructed successfully offline. Muon p T scale uncertainties are obtained from the study of the q/p T (transverse curvature with sign given by the electric charge q) distribution of muons in Z events in data. By comparing the q/p T distribution of positive and negative muons it is possible to quantify the amount of bias in the measurement of q/p T.

The relative amount of \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets background affects each analysis method through corrections obtained from simulation. The contributions from \(\mathrm{t}\overline{\mathrm{t}}\) and W+jet have not been specifically measured in the narrow region of phase space studied in this analysis and their relative contribution must be evaluated. The \(\mathrm{t}\overline{\mathrm{t}}\) cross section is validated using an algorithm based on the reconstructed top-quark masses for both the hadronic and the leptonic top-quark decays. The uncertainty in the \(\mathrm{t}\overline{\mathrm{t}}\) cross section is determined by comparing yields in data and simulation after a selection based on top mass variables. The W+jets cross section is validated by comparing event yields between data and simulation in Z+jets events in a dedicated dilepton event selection with similar kinematics. We assign an uncertainty to the W+jets cross section based on the agreement of the data and simulation in the Z+jets sample. Using the uncertainties obtained for the \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets cross sections, we probe different relative contributions of \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events in our sample and the effect on our background predictions.

Uncertainties in the polarization fraction for the W boson, either in \(\mathrm{t}\overline{\mathrm{t}}\) or W+jets events, must be taken into account. For the W polarization in \(\mathrm{t}\overline{\mathrm{t}}\) events, the theoretical uncertainties are very small (see Sect. 3) and have negligible effect on the background predictions. The W polarization in W+jets events, which is described in more detail in Sect. 3, is more complicated than in \(\mathrm{t}\overline{\mathrm{t}}\) production. In this case, we consider the effect of conservative variations of the helicity fractions in bins of W-boson p T and η with respect to the theoretical NLO calculations [36].

For the dilepton \(\mathrm{t}\overline{\mathrm{t}}\) background, \(\mathrm{t}\overline{\mathrm{t}}\) (ℓℓ), the uncertainties are evaluated somewhat differently for the different methods. In the L P and ANN methods this background is evaluated together with the same control sample as for the main single-lepton background prediction. Uncertainties in the prediction can arise from finite detector acceptance, inefficient lepton identification, and cross section uncertainties. In the LS method the dilepton \(\mathrm{t}\overline{\mathrm{t}}\) background is not predicted using the single-lepton background prediction and separate control samples must be used. Thus the uncertainties for the dilepton \(\mathrm{t}\overline{\mathrm{t}}\) background are estimated separately and described in the next section.

The small residual QCD multijet background is probed by inverting the requirement on \(I_{\mathrm{rel}}^{\mathrm{comb}}\) or the electron selection criteria to obtain QCD dominated control samples. Contamination from \(\mathrm{t}\overline{\mathrm{t}}\) and W+jet events in these control samples must be considered and the uncertainties on their cross sections are the dominant uncertainty for these methods.

The Z+jets contribution to the signal regions is very small and uncertainties on this background prediction come from lepton efficiency and cross section uncertainties. In addition, for the LS method there is a small Z+jets contamination to the single-lepton control sample, which must be subtracted, and lepton efficiency and cross-section uncertainties are considered for this as well.

8.2 Lepton Spectrum method background prediction uncertainty

For the LS method, the systematic uncertainties for each of the different background predictions from control samples in data (1 , dilepton, 1 τ, QCD, and Z+jets) are included in Tables 2, 3, and 4. To determine the systematic uncertainties for the largest source of background, 1- events (arising from \(\mathrm{t}\overline{\mathrm{t}}\), W+jets, and single-top processes), we evaluate deviations for the -dependent correction factor, which is determined from simulation and applied to the 1- background prediction (see Sect. 5.2). Table 9 gives a breakdown of the contributions of the systematic uncertainties for the 1- prediction in bins of and for H T>750 GeV. The uncertainties in the 1- prediction for the H T>500 GeV and H T>1 TeV signal regions are similar to those listed in Table 9. The largest source of uncertainty arises from the potential difference in the muon p T and the scales, because the muon p T spectrum is used to predict the spectrum. The statistical uncertainties in the correction factors (denoted as K-factors in Table 9) for the 1- method are slightly smaller than the combined systematic uncertainty of the correction factor. Table 9 does not include an uncertainty from jet resolution effects because this is taken into account by the smearing of the lepton p T spectra by QCD multijet templates (described in Sect. 5.2). For the purposes of setting limits, the total systematic uncertainty in the 1- background prediction is treated as correlated across all bins in .

Tables 2, 3, and 4 also list the non-single-lepton backgrounds, which account for about 25 % of the total, with a relative uncertainty of 5–10 % in the lowest- bin and about 30 % in the highest- bin. For the dilepton prediction of lost and ignored leptons (described in Sect. 5.3) the main sources of systematic uncertainty arise from the lepton reconstruction and identification efficiencies and the top-quark p T spectrum. The uncertainties on the lepton efficiencies are described in Sect. 8.1, and the uncertainty associated with the top-quark p T spectrum is determined from varying the fraction of events in the tail of this distribution in simulation in a manner consistent with the uncertainty in this tail as observed in data. This uncertainty is then propagated through the background determination procedure.

8.3 Lepton Projection method background prediction uncertainty

For the L P method, the estimate of the total number of events expected from SM processes in the signal region, \(\mathrm{N}_{\mathrm{SM}}^{\mathrm{pred}}(L_{\mathrm{P}}<0.15)\), relies on the knowledge of the translation factor, R CS, as well as the number of events observed in the control region, subtracted for the QCD background, Ndata(L P>0.3). There are, therefore, two sources of uncertainty in this estimate: uncertainties in the number of events from EWK processes in the control region and uncertainty in R CS. The relative change on the predicted background from each source of systematic uncertainty is listed in Table 10 for both muons and electrons. The largest uncertainty for high \(S_{\mathrm{T}}^{\mathrm{lep}}\) bins is the statistical uncertainty in the data in the control region. The second largest uncertainty comes from the JES uncertainty. The effect from the JES uncertainty is larger in the electron channel, since the JES affects also the shape of the L P distribution used in the fit of the control region. The uncertainty in the resolution of the measurement of the hadronic energy recoiling against the lepton and is evaluated conservatively by smearing the total recoil energy in simulation by an additional 7.5 % along the direction of the recoil and by 3.75 % in the direction orthogonal to the recoil. This decreases the resolution more than 10 % for the high recoils (above 250 GeV) of the signal region and thus covers the difference between data and simulation.

8.4 ANN method background prediction uncertainty

For the ANN method, the systematic uncertainty in the background prediction is dominated by the statistics of the simulation, which probes for bias in the background estimation. Another important uncertainty comes from the p T spectrum of the W boson in W+jets events, since it affects the distribution of these events, which preferentially populate the z ANN sideband. To assess the impact, we reweight the p T spectrum of W boson events, using the differences in the p T spectra of Z bosons in data and simulation as a guide. This uncertainty is driven by the statistics of the Z+jets sample. The relative proportions of W+jets and \(\mathrm{t}\overline{\mathrm{t}}\) events differ in the z ANN signal and sideband regions so the background prediction depends on their relative cross sections. Those \(\mathrm{t}\overline{\mathrm{t}}\) events with two leptons in the final state, only one of which is observed, have large and are the source of most SM events in the signal region. In addition to the \(\mathrm{t}\overline{\mathrm{t}}\) cross section, this background depends on lepton acceptance and identification inefficiencies. Additional sources of systematic uncertainty are the hadronic and leptonic energy scales. Table 11 summarizes these uncertainties.

8.5 Signal efficiency and other multiplicative uncertainties

The systematic uncertainty in the signal yield arises from the uncertainty in the signal efficiency. In general, this uncertainty is correlated across or \(S_{\mathrm{T}}^{\mathrm{lep}}\) bins. The JES component of the signal efficiency uncertainty is computed separately for each model point in CMSSM and simplified model parameter space and is correlated with the JES uncertainty in the single-lepton background prediction. The systematic uncertainties in the signal efficiency associated with lepton reconstruction and the trigger amount to 3 %. The uncertainty in the integrated luminosity is 2.2 % [44]. The systematic uncertainty in the signal efficiency, not including the JES component, is 6 % for each of the analyses.

9 Results and interpretation

The LS, L P, and ANN methods each yield SM background predictions that are consistent with the number of events observed in data. We therefore proceed to set exclusion limits on SUSY model parameters. All limits are computed using the modified-frequentist CLs method [45] with a one-sided profile likelihood test statistic. To interpret the absence of an observed signal, three complementary approaches are used.

9.1 Constraints on CMSSM parameter space

First, we scan over models in the CMSSM and determine whether the number of events predicted at each model point in parameter space can be excluded by the measurements. This procedure relies on the fact that the CMSSM parameter space can be described with just five parameters, and we fix three of them to commonly used values (A 0=0 GeV, μ>0, tanβ=10). Each model point has a complete SUSY particle spectrum and a well defined cross section, which typically involves several production subprocesses. The CMSSM simulated samples are initially generated using leading-order cross sections. At each point in CMSSM parameter space, the predicted yields for each production subprocess (e.g., \(\mathrm {g}\mathrm{g}\to \widetilde{\mathrm{g}}\widetilde{\mathrm{g}}\)) are corrected using the NLO cross sections discussed in Ref. [46]. Using the observed yield in data and the predicted background, we determine whether the CMSSM yield for the particular model point can be excluded at the 95 % confidence level (CL). This procedure is complicated by the fact that the control regions in data could potentially be contaminated by signal events. This effect is taken into account for each model by removing the expected contribution to the predicted background arising from signal contamination of the control regions.

Figures 14, 15, and 16 show the CMSSM exclusion region [47] for the three background estimation methods, evaluated in the m 1/2 vs. m 0 plane, with the values of the remaining CMSSM parameters fixed at tanβ=10, A 0=0 GeV, and μ>0. Figure 17 displays all of the results together. The excluded regions are below the plotted curves, corresponding to SUSY particle masses below certain values. For reference, the plots display curves of constant gluino and squark masses. The lines of constant gluino mass are approximately horizontal with \(m(\widetilde{\mathrm{g}})\approx 2.5~m_{1/2}\). Lines of constant squark mass are strongly curved in the m 1/2 vs. m 0 plane. At low m 0, the analyses exclude gluinos with masses up to about 1.3 TeV, but the sensitivity falls with increasing m 0. To determine the one standard deviation (σ) theoretical uncertainty on the observed limit, the signal yields are recomputed after changing each of the process-dependent SUSY production cross sections at each model point by ±1σ of their uncertainty arising from the parton distribution functions and renormalization and factorization scales [46].

Fig. 14
figure 14

LS method: exclusion region in CMSSM parameter space for the H T>750 GeV selection

Fig. 15
figure 15

L P method: exclusion region in CMSSM parameter space for all H T bins combined

Fig. 16
figure 16

ANN method: exclusion region in CMSSM parameter space

Fig. 17
figure 17

Exclusion region for the LS, L P, and ANN methods in CMSSM parameter space. Results from the low- and high- signal regions are combined

9.2 Constraints on simplified model parameter space

The second approach to interpretation is based on the use of simplified models [14, 15], which provide a more generic description of new physics signatures. Such models do not include a full SUSY particle spectrum, but instead include only the states needed to describe a particular set of decay chains of interest. Rather than excluding a model, the procedure is to calculate cross section upper limits on a given topological signature. (Such cross section limits can, however, be converted into limits on particle masses within the assumptions of the particular model.) Because simplified models do not describe a full SUSY spectrum, the number of free parameters is small. Furthermore, the parameters are simply the masses of the SUSY particles, in contrast to the grand-unified-theory-scale parameters used in the CMSSM. An advantage of simplified models is that, as a consequence, certain relationships between particle masses that arise with the CMSSM no longer hold, and the spectra can be much more generic.

We consider the “Topology 3 weakino” (T3w) simplified model, which involves the production of two gluinos and their decay via the mechanism shown in Fig. 18. One gluino is forced to decay into two quark jets plus the LSP (\(\widetilde{\chi}^{0} \)) via the three-body decay \(\widetilde{\mathrm{g}}\to\mathrm{q}\bar{\mathrm{q}}\widetilde{\chi}^{0}\), while the other gluino decays via \(\widetilde{\mathrm{g}}\to\mathrm{q}\bar{\mathrm{q}}^{\prime}\widetilde{\chi }^{\pm}\), followed by \(\widetilde{\chi}^{\pm}\to\mathrm{W}^{\pm}\widetilde{\chi}^{0}\). The W± boson can then decay leptonically. The T3w model is specified by masses of the gluino, the LSP (\(\widetilde{\chi}^{0}\)), and an intermediate chargino (\(\widetilde{\chi}^{\pm}\)). We calculate cross section limits as a function of \(M(\widetilde{\mathrm{g}})\), assuming a fixed value for the LSP mass \(M(\widetilde{\chi }^{0})=50~\mathrm{GeV}\) and setting the chargino mass according to \(M(\widetilde{\chi}^{\pm})=0.5( M(\widetilde{\chi}^{0}) + M(\widetilde {\mathrm{g}}))\). The nominal production cross section for the gluino pair production mechanism is given in Ref. [46]. Figure 19 shows the cross sections excluded by each method for this model. The limits fluctuate significantly at low \(M(\widetilde{\mathrm{g}})\) because of the low signal efficiency in this region.

Fig. 18
figure 18

Diagram for production and decay in the T3w simplified model

Fig. 19
figure 19

Excluded cross sections for the LS, L P, and ANN methods for the T3w simplified model

9.3 Alternate model exclusions

The data can be interpreted using a third approach, which is applicable to models that do not fall within the scope of either the CMSSM or the simplified model discussed in this section. A model builder can investigate the sensitivity of any one of the three methods presented in this paper to a given signal hypothesis by applying the event selection requirements listed in Table 1, together with the final requirements that define the signal regions. We provide a simple efficiency model for the most important observables used in the event selections. The efficiency model can then be applied to a basic (pythia) simulation of the signal process.

The efficiency model is based on parametrizations of the efficiencies for the event selection requirements with respect to the main reconstruction objects and quantities, such as H T, , and lepton p T. The efficiency of the analysis for a given model can be estimated by applying these individual reconstruction efficiencies, which are given as a function of the most important parameter (such as lepton p T), to the corresponding kinematic distributions in the model. This procedure would then yield an estimate for the number of signal events from the model. Finally, the sensitivity of the analysis to the model can be obtained by comparing the yield of signal events obtained in this manner with the background yields given in this paper. Kinematic correlations (which can be model dependent) are not taken into account, but this approach nonetheless provides a first approximation to the sensitivity.

The efficiencies for each analysis object are described using “turn-on” curves, which are simply error functions,

$$ \epsilon(x) = \epsilon_\mathrm{plateau} \frac{1}{2} \biggl[ \operatorname{erf} \biggl(\frac{x-x_\mathrm{thresh}}{\sigma}\biggr) + 1 \biggr], $$
(7)

where x represents the variable most relevant for the reconstruction of the particular object. The error function is parametrized in terms of the plateau efficiency, ϵ plateau; the turn-on threshold, x thresh; and the characteristic width of the turn-on region, σ. These parameters are obtained by fitting simulated event samples as a function of the true (generated) value.

The selection efficiency associated with the lepton reconstruction, identification, and isolation requirements is estimated as a function of lepton p T by considering muons and electrons (including those from τ decay) generated in the pythia-simulated hard-scattering process. The lepton isolation requirement has a large effect on the efficiency, which consequently depends on the number of jets in the event. To reduce the model dependence arising from this effect, two categories of leptons are considered. First, we assign zero efficiency to leptons that are within ΔR<0.4 of a quark or gluon with p T>40 GeV in the hard-scattering process. The efficiency for the remaining leptons is described by a turn-on curve whose parameters are listed in Table 12. The efficiencies are specified for both the lepton selection and for the lepton veto.

Table 12 Efficiency-model parameters for lepton efficiencies as a function of xp T. The leptons are required to lie within the fiducial region and must satisfy the p T thresholds specified in Table 1

The number of jets and the resulting H T value for each event are computed using information available at the generator level. The same clustering algorithm used to reconstruct jets in the data is applied to the generator-level particles. The resulting generator-level jets are required to satisfy ΔR>0.3 with respect to the leptons described above. The variable is estimated at the generator level from the transverse momenta of neutrinos and any new weakly interacting particles, such as the \(\widetilde{\chi}^{0}\). The parametrizations of the efficiency turn-on curves for the H T and requirements are listed in Tables 13 and 14, respectively. For the requirements used with the LS method, the information given in these tables generally reproduces the efficiency from full simulation to within about 15 %.

Table 13 Efficiency-model parameters for xH T
Table 14 Efficiency-model parameters for

In the L P method, the variables L P and \(S_{\mathrm{T}}^{\mathrm{lep}}\) are functions of lepton p T and . The modeling of lepton p T is described above. To emulate , one needs to apply both a scale shift and smearing to the generated value. The scale factor is . The value of is about 0.2 at . It falls linearly to about 0.06 at , and it remains at 0.06 for .

In the ANN method, the preselection requirements on H T and are 400 and 100 GeV, respectively. The signal regions are specified by and together with z ANN>0.4, where z ANN is a functionFootnote 1 of n jets, H T, Δϕ(j1,j2), and M T. The efficiency turn-on curve for z ANN>0.4 is approximated by the parameter values ϵ plateau=0.98, x thresh=0.41, and σ=0.1.

With these additional procedures, the emulation of the efficiencies for the L P and ANN methods is found to be accurate to within ∼15 %, as for the LS method.

10 Summary

Using a sample of proton–proton collisions at \(\sqrt{s}=7\) TeV corresponding to an integrated luminosity of 4.98 fb−1, we have performed a search for an excess of events with a single, isolated high-p T lepton, at least three jets, and large missing transverse momentum. To provide a robust and redundant determination of the SM backgrounds, three methods are used, each of which relies primarily on control samples in the data.

The Lepton Spectrum (LS) method exploits the relationship between two key observables, the lepton p T distribution and the distribution. In the dominant SM background processes, which have a single, isolated lepton, this connection arises from the fact that the lepton and neutrino are produced together in the two-body decay of the W boson, regardless of whether the W is produced in \(\mathrm{t}\overline{\mathrm{t}}\) or W+jets events. In many SUSY models, however, the is associated with the production of two neutralinos, which decouples from the lepton p T spectrum. Smaller backgrounds arising from \(\mathrm{t}\overline{\mathrm{t}}\) dilepton events, from τ decays in \(\mathrm{t}\overline{\mathrm{t}}\) or W+jets events, and from QCD multijet processes are also estimated using control samples in the data. In the sample investigated with this method, at least four jets are required, which helps to suppress the background from W+jets events. Nine signal regions are considered, specified by three thresholds on H T and three bins of . The observed yields in each region are consistent with the background estimates based on control samples in the data.

The Lepton Projection (L P) method exploits information on the W-boson polarization in \(\mathrm{t}\overline{\mathrm{t}}\) and W+jets events. The dimensionless L P variable itself is sensitive to the helicity angle of the lepton from W decay, but it also provides discrimination between signal and background through the ratio of the lepton p T and the values, which is small in SUSY-like events. The \(S_{\mathrm{T}}^{\mathrm{lep}}\) variable maps out a diagonal line in the plane of lepton p T vs.  and reflects the W transverse momentum for the boosted W boson. The L P distributions are studied in bins of \(S_{\mathrm{T}}^{\mathrm{lep}}\), and H T, and at least three jets are required. In each signal region, the data are in agreement with expectations from the SM.

The artificial neural network (ANN) method provides a means to obtain the distribution of background events in data by constructing a neural network variable z ANN, which has a very small correlation with . This variable also provides strong discrimination between signal and background events, so that the background regions do not suffer from large signal contamination in the models considered. A key element of the z ANN variable is the transverse mass of the lepton- system, but additional variables, such as the number of observed jets, play a role as well. In the ANN analysis, no excess of events is observed in the signal regions with respect to the SM background prediction.

Because these methods probe extreme kinematic regions of the background phase space, the use of redundant approaches provides confidence in the results. Although the LS and L P methods both make use of information on the W-boson polarization in the background, they are based on different kinematic variables and have different signal regions. The LS method breaks the background into several pieces (single lepton, τ, dilepton, and QCD) and provides a direct background prediction for the distribution. In contrast, the L P method defines a powerful kinematic variable that is used to obtain a global background prediction by extrapolating an overall background shape from a control region into the signal region. The ANN method similarly uses a global approach to estimating the background. The neural-net variable incorporates information used in neither of the other two methods.

The results from each method are interpreted in the context of both the CMSSM and a so-called simplified model, T3w, which has a minimal SUSY particle spectrum. The CMSSM limits exclude gluino masses up to approximately 1.3 TeV in the part of the parameter space in which m 0<800 GeV, but the bound gradually weakens for larger values of m 0. For the T3w simplified model, we obtain cross section upper limits as a function of gluino mass. Finally, we provide an approximate model of our signal efficiency that can be used in conjunction with a simple pythia simulation to determine whether other models can be probed by these data.