Multi-channel search for squarks and gluinos in sqrt(s)=7 TeV pp collisions with the ATLAS detector

A search for supersymmetric particles in final states with zero, one, and two leptons, with and without jets identified as originating from b-quarks, in 4.7 fb-1 of sqrt(s)=7 TeV pp collisions produced by the Large Hadron Collider and recorded by the ATLAS detector is presented. The search uses a set of variables carrying information on the event kinematics transverse and parallel to the beam line that are sensitive to several topologies expected in supersymmetry. Mutually exclusive final states are defined, allowing a combination of all channels to increase the search sensitivity. No deviation from the Standard Model expectation is observed. Upper limits at 95% confidence level on fiducial cross-sections for the production of new particles are extracted. Results are interpreted in the context of the constrained minimal supersymmetric extension to the Standard Model and in supersymmetry-inspired models with diverse, high-multiplicity final states.


Introduction
One of the most promising extensions of the Standard Model, supersymmetry (SUSY) [1][2][3][4][5][6][7][8][9], has been the target of a large number of searches at the LHC. Prompted by the large predicted production cross-section of coloured SUSY particles (sparticles), ATLAS and CMS have performed inclusive searches for strongly produced squarks and gluinos, the superpartners of quarks and gluons [10][11][12][13][14][15][16][17]. Assuming R-parity conservation [18][19][20][21][22], these sparticles are produced in pairs and decay into energetic jets, possibly leptons, and the lightest SUSY particle (LSP, typically the lightest neutralinoχ 0 1 ), which escapes detection and results in missing transverse momentum. For these searches, the selections adopted to discriminate the signal processes from the background typically include requirements on the missing transverse momentum (E miss T ) and the scalar sum of transverse momenta of all selected physics objects (H T ) plus the scalar E miss T . This paper presents a search for strongly produced sparticles that makes use of a variety of final states including high transverse momentum jets and zero, one, or two leptons (electrons or muons). The events are also separated according to the presence of a jet identified as originating from a b-quark (b-tagged jets). Several mutually exclusive search channels are defined, facilitating a simultaneous search in all of the typical final states and increasing the search sensitivity. The search employs a set of observables, called the "razor variables" [23], which make use of both longitudinal and transverse event information. These variables were first employed in SUSY searches by CMS [24,25]. Their use makes this search sensitive to different regions of kinematic phase space relative to the other E miss T -based searches. Thus, these search results complement those already performed by ATLAS. This paper is organised as follows. The main features of the ATLAS detector are presented in Sect. 2. Section 3 introduces the razor variables. Section 4 describes the data sample, basic event selection, and the Monte Carlo simulation used to model the data. Section 5 defines the basic physics objects and event-level variables that are used through the analysis. The search technique is described in Sect. 6, and the background estimation is presented in Sect. 7. The performance of the search and interpretation of the results are presented in Sect. 8. Finally, Sect. 9 includes a summary of the analysis and of its findings.

ATLAS detector
The ATLAS detector comprises an inner tracking detector, a calorimeter, and a muon system [26]. The inner detector includes a silicon pixel detector, a silicon microstrip detector, and a transition radiation tracker. It is immersed in a 2 T axial field and precisely measures the tracks of charged particles in the pseudorapidity region 1 |η| < 2.5. The calorimeter covers the region |η| < 4.9 and is divided into electromagnetic and hadronic compartments. The electromagnetic calorimetry in the central (|η| < 3.2) region is provided by liquid argon sampling calorimeters with lead absorbers. In the barrel region (|η| < 1.4), the hadronic calorimetry is provided by scintillator tiles with steel absorbers, and the more forward (1.4 < |η| < 3.2) region is covered by a liquid argon and copper sampling hadronic calorimeter. The forward calorimetry (|η| > 3.2) uses liquid argon and copper or tungsten absorbers. The muon spectrometer covers |η| < 2.7 and includes a system of air-core toroidal magnets. A variety of technologies are used to provide precision muon tracking and identification for |η| < 2.7 and rapid response for triggering for |η| < 2.4.
ATLAS uses a three-tier trigger system to select events. The first-level (L1) trigger is hardware-based and only uses coarse calorimeter information and muon system information. The calorimeter information available at the lowest level includes basic objects with rough calibration and simple identification of electromagnetic objects (electrons and photons) as distinct from hadronic objects (jets). The second-level (L2) trigger and eventfilter (EF) compose the software-based high-level trigger (HLT), in which full event reconstruction is run, similar to that used offline, in order to accurately identify and measure objects. The L2 only examines η/φ regions that triggered the L1. The EF fully reconstructs events that pass L2.

Razor variable definitions
Searches for sparticles in R-parity-conserving scenarios generally make the assumption that the sparticles are pair-produced and decay subsequently to an LSP that is invisible in the detector. The heavy sparticles produced 1 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upward. Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2). are either the same type of particle (pair-production) or are at the same mass scale. Thus, the production and visible energy in the decays are fairly symmetric. Most analyses make use of the transverse balance of typical pp collision events, or exploit the event symmetry in the transverse plane. The razor variables attempt to also include longitudinal information about the event by making several assumptions motivated by the kinematics of the models of interest.
In the rest frame of each heavy sparticle, called the R-frame, the sparticle decays are symmetric. In an attempt to reconstruct the primary produced sparticle pair, the razor calculation clusters all final-state particles into a pair of objects with four-momenta called "mega-jets." Each of these mega-jets is associated with one of the two SUSY decay chains and represents the visible energy-momentum of that produced sparticles. All possible combinations of the four-vectors of the visibly reconstructed/selected objects (signal jets and leptons) are considered when constructing the two megajets. The pair of mega-jets, j 1 and j 2 , that minimises the sum of the squared masses of the four-vectors is selected. Following the prescription in Ref. [23] and for consistency with Ref. [24], all jets and the mega-jets are forced to be massless by setting their energy equal to the magnitude of their three momenta. Studies indicate that neither this choice nor the mega-jet selection, based on minimizing the mega-jet mass squared, have a significant impact on the reach of the razor-based search.
In the R-frames, each heavy sparticle should be nearly at rest with some mass m Heavy . The sparticle decay may then be approximated as a two-body decay to some visible object (a mega-jet) and the invisible, stable LSP. The final visible decay products (i.e. the final-state quarks and gluons, or the observable jets and leptons) have masses far below the SUSY mass scale and can therefore be approximated as being massless. Then the energy of each mega-jet in the R-frame, E 1 and E 2 , becomes: where m LSP is the mass of the LSP. This leads to a characteristic mass, M R , in the R frame of M R = 2 × E 1 = 2 × E 2 , which for m Heavy m LSP is identical to m Heavy . Therefore, in events where heavy particles are pair-produced, M R , which is a measure of the scale of the heaviest particles produced, should form a bump. In tt or W W events, for example, the characteristic mass M R ≈ m top or m W . The product of M R and the Lorentz factor for the boost from the lab to R-frame, M R = γ R × M R , is useful for characterisation of the sparticle mass scale, in part because of its close relation to m Heavy , and in part because Standard Model backgrounds tend to have small values of M R . When expressed in terms of the mega-jet quantities in the lab frame, the expression is given by: where j i,E and j i,z are the energy and longitudinal momentum, respectively, of mega-jet i. The transverse information of the system is taken into account by constructing a transverse mass for the mega-jets, assuming half of the E miss T is associated with each jet: where E miss T is the two-dimensional vector of the E miss T in the transverse plane. When an event contains "fake" E miss T from a detector defect or mismeasurement, the system will tend to have back-to-back mega-jets. In such cases, the vector sum of the two mega-jet momenta will be small. If, on the other hand, there is real E miss T , the mega-jets may not be back-to-back and may even point in the same direction. In these cases, the vector sum, and thus M R T , will have a large value. M R T is another measure of the scale of the event that only uses transverse quantities in contrast to longitudinal quantities in M R .
Finally a razor variable is defined to discriminate between signal and background: This variable takes low values for multijet-like events and tends to be uniformly distributed between 0 and 1 for sparticle decay-like events, providing good discrimination against backgrounds without genuine E miss T . The impact of some important experimental uncertainties, like the jet energy scale uncertainty, are reduced in this ratio. In an analysis based on the razor variables, a cut on R can be used to eliminate these backgrounds before a SUSY search is made in the distribution of the variable M R .

Data and Monte Carlo samples
The data included in this analysis were collected between March and October 2011. After basic trigger and data quality requirements, the full dataset corresponds to 4.7 ± 0.2 fb −1 [27,28].
Events in the zero-lepton channels are selected using a trigger that requires a jet with transverse momentum p T > 100 GeV at L1. In the event filter, H T > 400 GeV is required, where H T is calculated through a scalar sum of the p T of all calorimeter objects with p T > 30 GeV and |η| < 3.2. With the exception of a cross-check of the multijet background estimate, which uses prescaled single-jet triggers, this trigger requirement is fully efficient for the offline selection used in the analysis.
The one-and two-lepton channels make use of the lowest-p T single-lepton triggers available for the entire running period. The muon triggers require a muon with p T > 18 GeV, and the electron triggers require an electron with p T > 22 GeV. Offline, the leading lepton in the event is required to have p T > 20 GeV (p T > 25 GeV) if it is a muon (electron), in order to ensure that the triggers are fully efficient with respect to the offline event selection. For the two-lepton analysis, where there are overlaps in the triggers, the electron trigger takes priority over the muon trigger.
Offline, an event is required to have at least one vertex with at least five tracks associated to it, each with p track T > 400 MeV. This requirement reduces cosmic ray and beam-related backgrounds. The primary vertex is defined as the one with the largest (p track T ) 2 of the associated tracks. Events that suffer from sporadic calorimeter noise bursts or data integrity errors are also rejected.
Monte Carlo (MC) simulated events were used to develop the analysis and assist in estimations of background rates. All MC samples are processed through ATLAS's full detector simulation [29] based on Geant4 [30], which was run with four different configurations corresponding to detector conditions of four distinct operating periods of 2011. The fractions of MC simulation events in these four periods match the fractions of data in each period. During the data collection, the average number of proton-proton collisions per bunch crossing in addition to the one of interest ("event pileup" or simply "pile-up") increased from approximately two to twelve. To mimic the effect of pile-up, additional inelastic proton-proton collisions are generated using Pythia [31] and overlaid on top of every MC event. Within each period, the profile of the average number of events per bunch crossing ( µ ) is re-weighted to match the data in that period. The same trigger selection is applied to the MC simulation events, which are then passed through the same analysis code as the data. Reconstruction and trigger efficiency scale factors are applied to the MC simulation in order to take into account small discrepancies between the data and the MC simulation. Table 1 lists the major backgrounds along with the chosen estimation method (described in Sect. 7) and the primary and alternative MC generators used in this analysis. In all cases, MC@NLO and Alpgen are interfaced to Herwig and Jimmy for the parton shower, hadronisation, and underlying event modelling. The multijet background is normalised to the leading order generator cross-section predicted by Pythia. The tt production cross-section of 166.8 pb is calculated at approximate NNLO in QCD using Hathor [32] with the MSTW2008 NNLO PDF sets [33]. The calculation is cross-checked with an NLO+NNLL calculation [34] implemented in Top++ [35]. The single-top production cross-sections are calculated separately for s-channel, t-channel, and W t production at NNLO [36][37][38].
The W and Z (including Drell-Yan with m > 40 GeV) production cross-sections of 10.46 nb and 0.964 nb are calculated at NNLO using FEWZ [39]. For the production of vector bosons in association with heavy flavour, in accordance with ATLAS measurements [40], the production cross-section for W +b and W + cc are scaled by 1.63, and the cross-section for W +c is scaled by 1.11 compared to the NLO cross-section [41]. Additional uncertainties on the production of W and Z bosons in association with heavy flavour of 45 % for W +b and W + cc, 32 % for W + c, and 55 % for Z + bb are included. Alpgen describes the jet multiplicity and inclusive M R distributions well, but it does not correctly model the vector boson p T distribution. Therefore, the boson p T in the Alpgen samples is re-weighted according to the distribution produced Sherpa. Half of the difference between the weight and unity is applied as a systematic uncertainty on the re-weighting procedure. Further systematic uncertainties on the shapes of Alpgen samples are derived by systematically varying the generator parameters, including matching and factorisation scales. Diboson production cross-sections of 44.92 pb, 17.97 pb, and 9.23 pb for W W , W Z, and ZZ (including off-shell production with m > 12 GeV) are calculated at NLO using MCFM [42]. In order to avoid low-mass resonances, all dilepton events are required to have the invariant mass m > 20 GeV. These cross-sections provide the starting normalisations for all background processes.
Two SUSY-inspired simplified models are used for the interpretation of the results from this search. The first considers gluino pair-production, with the gluino decaying to a tt pair and the LSP via an off-shell stop. This model is generated using Herwig++ [49], with the gluino and LSP masses being the only free parameters. The top quarks are required to be on-shell, limiting the mass splitting between the gluino and the LSP to greater than 2 × m top .
The second considers gluino pair-production, with the gluino decaying to two quarks and a chargino via an off-shell squark. The chargino then decays to a W boson and the LSP. The free parameters of this model are the masses of the gluino, chargino, and LSP. For convenience, two two-dimensional planes are generated: one with the chargino mass exactly between the masses of the gluino and the LSP and one with the mass of the LSP fixed to 60 GeV. Because initial-state radiation can be important for the acceptance of these models when the mass splitting between the gluino and LSP is small, this model is generated using MadGraph [50] with at most one additional jet in the matrix element. Pythia is used for the parton shower and hadronisation. Systematic uncertainties on matrix element matching and initial-state radiation modelling are included, leading to 20 % uncertainties for small mass splittings and small gluino masses, but no uncertainty for mass splittings above 200 GeV and masses above 400 GeV.
Additionally, the results are interpreted in terms of SUSY signal models based on the constrained minimal supersymmetric model (CMSSM or MSUGRA) [18][19][20][21][22]. The parameters of this model are the high-energy-scale universal scalar mass, m 0 , the universal gaugino mass, m 1/2 , the ratio of the vacuum expectation values of the two Higgs fields, tan(β), the tri-linear coupling strength, A 0 , and the sign of the Higgsino mass parameter, µ. Samples are generated in a two-dimensional grid of the m 0 -m 1/2 parameters where tan(β) = 10 and A 0 = 0 are fixed and µ is set positive. This MC data grid is generated using Herwig++ [49], with a more dense population of points at low mass. IsaSUSY [51] is used to run the high-energy-scale parameters down to the weak-scale.
Signal cross-sections are calculated to next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-leadinglogarithmic accuracy (NLO+NLL) [52][53][54][55][56]. The nominal cross-section and the uncertainty are taken from an envelope of cross-section predictions using different PDF sets and factorisation and renormalisation scales, as described in Ref. [57]. For each of these signal models, the luminosity systematic uncertainty of 3.9 % [27,28] and statistical uncertainty, typically of order 10 %, is included. Table 1 Background estimation methods, primary and alternative MC event generators, and normalisation uncertainties for each of the major backgrounds. The backgrounds are constrained using various Control Regions (CRs) that are enriched in certain samples (see Sect. 6). The tt background estimate includes small contributions from ttW and ttZ, generated with MadGraph. The diboson W W background estimate also includes W ± W ± jj generated with MadGraph. The last column of the table indicates the uncertainty on the normalization in the simultaneous fit used to test signal hypotheses. "None" indicates that the normalization is fully constrained in the fit. The grouping indicates the samples that are collected together in the fit. Within a group, the relative normalizations are fixed.

Physics object identification and selection
Events are categorised into six exclusive samples defined by the presence of zero, one, or two leptons, with or without b-tagged jets. The particle candidate selections that define these samples are referred to as the "baseline" object selection. Since a particle may simultaneously satisfy multiple particle hypotheses (e.g. electron and jet), an overlap removal procedure (described below) assigns a unique interpretation to each candidate. The selections are then further refined to define "signal" candidates with higher signal-to-background ratio in each sample. Baseline electrons are required to have E T > 10 GeV, be within the fiducial acceptance of the inner detector (|η| < 2.47), and pass a version of the "medium" selection criteria [58] updated for 2011 running conditions, which requires hadronic calorimeter energy deposition and a calorimetric shower shape consistent with an electron and a match to a good quality inner detector track. Signal electrons are required to be isolated from other objects and satisfy "tight" selections. The tight selection applies stricter track quality and matching than medium and ensures the number of hits in the transition radation tracker is consistent with the electron hypothesis. The isolation requirement is that the sum of the p T of all charged particle tracks associated with the primary vertex within ∆R = 0.2, where ∆R = (∆η) 2 + (∆φ) 2 , of the electron is less than 10 % of the electron E T . In the leptonic channels, if the leading lepton in a data event is an electron, it is additionally required to match to an EF trigger electron. MC simulation events are re-weighted to compensate for mis-modelling of the single-lepton trigger efficiency. The energy of electrons in simulated events is also smeared prior to object selection in order to repro-duce the resolution in Z and J/ψ data. Finally, in order to account for differences in electron reconstruction efficiency, η-and E T -dependent scale factors, derived from Z, W , and J/ψ events in the data, are applied to each simulated electron satisfying overlap removal and selection requirements.
Baseline muons are reconstructed as either a combined track in the muon spectrometer and inner detector, or as an inner detector track matching with a muon spectrometer segment [59]. Tracks are required to have good quality, and the muon is required to have p T > 10 GeV and |η| < 2.4. Signal muons are required to be isolated by ensuring that the sum of the p T of all charged particle tracks associated with the primary vertex within ∆R = 0.2 of the muon is less than 1.8 GeV. Matching to EF trigger muons in data, MC event trigger re-weighting, muon momentum smearing, and MC/data efficiency scaling are performed in a similar way for muons as electrons (described above) [60][61][62].
Calorimeter jets are reconstructed from topological clusters of energy deposited in the calorimeter calibrated at the electromagnetic (EM) scale [63] using the anti-k t jet algorithm [64,65] with a four-momentum recombination scheme and a distance parameter of 0.4. Jets reconstructed with an EM-scale p T > 7 GeV are calibrated to the hadronic scale (particle level) using p T and η-dependent factors, derived from simulation and validated with test beam and collision data [66]. In order to remove specific non-collision backgrounds, events are rejected if they contain a reconstructed jet that does not pass several quality and selection criteria [66]. Signal jets are selected if they lie within |η| < 2.5 with a jet vertex fraction (JVF) of at least 75 %, where the JVF is the fraction of summed p T of the tracks associated with the jet that is carried by tracks consistent with the primary vertex of the event, thus associating the jet with the pp collision of interest. Jets are tagged as heavy flavour using the combined neural network "jet fitter" algorithm [67] with the 60 % efficiency working point. Scale factors for heavy flavour jets are used in MC simulation in order to reproduce the expected b-jet identification performance in data.
In order to ensure that objects are not double counted, overlaps between objects are removed using a hierachical procedure. If any two baseline electrons lie within a distance of ∆R = 0.1 of one another, the electron with the lower calorimeter E T is discarded. Next, jets passing basic selections are required to be at least 0.2 units away from all surviving baseline electrons in ηφ. Electrons are then required to be at least 0.4 units away from surviving jets. Finally, in order to mitigate the effect of jets which have deposited significant energy in the muon spectrometer on mass measurements and reduce the number of events with badly measured missing transverse momentum, muons with p T > 250 GeV within ∆R = 0.2 of a jet with p T > 500 GeV are removed. A negligible number of events in the data are removed by this cut.
Following these overlap removal procedures, the missing transverse momentum and razor variables are calculated. The determination of the missing transverse momentum uses all baseline electrons with E T > 20 GeV, all baseline muons, all calibrated jets with p T > 20 GeV, and EM scale topological calorimeter clusters not belonging to any object. Note that in the MC simulation, objects enter this calculation after the energy or p T smearing described above.
In counting leptons for event classification, baseline electrons and muons are then required to be at least 0.4 units away from all good jets in η-φ. If an electron and muon are separated by ∆R cone < 0.1, neither is counted.
In order to remove events with large missing transverse momentum due to cosmic rays, events are vetoed if they contain a muon in which the transverse and longitudinal impact track parameters are greater than 0.2 mm and 0.1 mm with respect to the primary vertex, respectively. Also vetoed are events with badly measured, non-isolated muons. These muons with large momentum uncertainties are rare in both the signal and background events and can have significant impact on the E miss T and razor variables. During a portion of the run period, a hardware failure resulted in a region of the calorimeter not being read out. For data collected during this period, and for a corresponding fraction of the MC samples, events are rejected if they fail the "smart LAr hole veto" [13]. This ensures that if an event contains one or more jets point-ing to the dead region and those jets may contribute substantially to the missing transverse momentum in the event, the event is discarded.
Signal regions are defined after all overlap removal is complete. Events with no baseline leptons and events with the highest-p T lepton below the leading lepton requirement (25 GeV for electrons, 20 GeV for muons) are accepted into the zero-lepton regions. Events with one leading lepton satisfying all requirements, including that on leading lepton p T (above), and no other baseline leptons with p T > 10 GeV are accepted into the one-lepton regions. Events with exactly one additional signal lepton above 10 GeV and no other baseline leptons are accepted into the two-lepton regions.

Search technique
After sorting events into the six samples described in the previous section, each sample is further divided in the R-M R plane into control regions (CR), which are choosen so that they are dominated by a specific background, and signal regions (SR). Additionally, validation regions (VR) are constructed, which do not constrain the background but are used to evaluate the agreement between data and MC simulation. Table 2 lists these regions, which are also visualised in the R-M R plane in Fig. 1. These regions are binned in either R or M R and then simultaneously fit to MC estimates for background and signal rates with correlations from sample to sample and region to region taken into account. The hadronic (had.) and one-lepton signal regions are divided into events with and without b-tagged jets ("b-tag" and "b-veto," respectively). The two-lepton events are divided into regions with opposite-sign (OS) and same-sign (SS) leptons and regions with opposite-flavour (OF) and same-flavour (SF) leptons. While some background components are sufficiently constrained by the CRs to be left free in the fit, others are constrained to estimates derived from other techniques or MC simulation. Table 1 summarises the backgrounds, the estimation technique, the source of the estimate, and the normalization uncertainty used in the fit. Finally, systematic uncertainties on all backgrounds are included as nuisance parameters. The result is a maximum likelihood fit that encapsulates all knowledge about the background and signal consistently across all channels.
When evaluating a signal sample for exclusion, any signal contamination in the control regions is taken into account for each signal point, as the control region fits are performed for each signal hypothesis. Separately, each signal region (one at a time), along with all control regions, is also fit under the background-only hypothesis. This fit is used to characterise agreement in each signal region with the background-only hypothesis and to extract fiducial cross-section limits and upper limits on the production of events from new physics (N BSM ).
The fit considers several independent background components: tt and single top. A total of five top control regions are defined in the one-and two-lepton channels. The normalisation of this component is allowed to vary freely in the fit. -Bosons, except diboson W W . The inclusion of the W Z and ZZ diboson samples is motivated by the dominance of leptonic Z decays in the two-lepton signal regions, the dominance of Z → νν in the zero-lepton signal regions, and the dominance of W Z → νqq in the one-lepton signal regions. In all of these cases, the experimental uncertainties affect the samples in the same way as they do W +jets or Z+jets, and therefore they are combined in order to treat them as fully correlated. The normalisation of this sample is allowed to vary freely in the fit. Independent validation of the Z+jets background is carried out in two-lepton control regions. The agreement is good between data and MC simulation in both normalisation and shape. -Diboson W W . This sample is constrained with a 30 % cross-section systematic uncertainty. The constraint is necessary because of the relatively small contribution of the sample in most signal and control regions and because no W W -dominated control region can be constructed; if the background were allowed to vary freely, then the fit may find a minimum with an unreasonably large or small contribution from diboson W W events and hide some other effect with an artificial W W normalisation. -Charge flip. Charge mis-identification can occur due to physical effects, like lepton Bremsstrahlung, and detector effects, especially for high-p T leptons with almost straight tracks. These effects generate background in the same-sign dielectron and electronmuon channels. This background is negligable in the dimuon channel, where the contribution from both physical and detector effects is far smaller. The electron charge-flip rate is measured as a function of η in the data [68], allowing MC simulation to model the lesser dependence on p T . These charge-flip rates are applied to opposite-sign MC simulation events, providing an estimate of the overall contribution from charge flip in these channels. The electron p T is additionally shifted and smeared to mimic the effect of charge mis-identification. This shift in the p T is propagated through to the razor variables. The un-certainty from the charge flip probabilities dominates the uncertainty of this background. -Fake leptons. The multijet background in the onelepton signal regions, as well as the W +jets, semileptonic tt, and multijet background in the twolepton signal regions, comes predominantly from hadrons faking electrons and muons. This background is estimated using the "matrix method" [13,68], using the number of baseline leptons not passing signal lepton requirements. The efficiency for a real lepton passing the baseline lepton requirements to pass the signal lepton requirements is estimated using Z MC simulation events. The rejection rate for fake leptons is estimated in data, using samples enriched in fake leptons. For electrons, the factors are derived and applied separately for inclusive samples of events and samples requiring a b-tagged jet. Because this background accounts for all fake background, MC events in the one-lepton (two-lepton) channels are required to have at least one (two) prompt lepton(s) from a τ lepton, W boson, Z boson, or sparticle. The uncertainty on this background estimate has a statistical component from the number of events in the control region and a systematic component from the uncertainty on the scale factors. Some fraction of the events with same-sign, baseline leptons in the data may be due to charge flip. Thus, the matrix method overestimates somewhat the fake lepton background in the dilepton channels. In order to correct for this overlap, opposite-sign events in data containing baseline leptons that do not pass the signal lepton requirements are used. Each event is assigned a weight representing the likelihood of that event being subject to charge mis-identification. The weighted events are then presented as a negative component to the same-sign fake background distribution, such that the contribution to the same sign fake background from originally oppositely charged leptons is subtracted. -Multijets in zero-lepton channel. Two specific control regions constrain this background, and its normalisation is allowed to vary freely in the fit. Several different approaches are used to cross-check this estimate. Prescaled single jet triggers are used to construct independent multijet-enriched control regions at low M R that is free from the inefficiency of the H T -based trigger. The observed number of events in this region are then projected into the signal region using transfer factors from MC simulation. Alternatively, in order to model the mis-measurement of jets in the calorimeter, jets in events collected with these single-jet triggers are smeared according to response functions estimated using data [10]. Both of Table 2 Background control, signal, and validation regions. All signal regions include the overflow in the highest bin. "N/A" means that there is no requirement. The binning of the validation regions does not affect the results, so they are listed as "N/A." The systematic effects included as nuisance parameters in the fit are: the jet energy scale and resolution uncertainties; b-tagging uncertainties; uncertainty on the MC simulation modelling of the JVF; the additional cross-section uncertainty on the production of heavy flavour in association with a vector boson; the uncertainties on trigger efficiency and matching and reconstruction efficiency; a systematic uncertainty on the re-weighting of the W -boson p T ; uncertainties on the missing transverse momentum pile-up dependence and the calibration of energy not associated with an object in the event; the matrix method statistical and systematic uncertainties; the charge flip systematic uncertainties; the diboson W W shape systematic uncertainty taken from comparing Herwig to Alpgen. Where the systematic uncertainties affect object definitions, corrections are propagated to the missing transverse mo-mentum and razor variable calculations. The effects of other uncertainties on the final results are negligible. These uncertainties affect the signal yield and shape in the signal regions, as well as the allowed variation in signal-region background estimates after the control region constraints. In most signal regions, the jet energy scale uncertainty is the dominant experimental uncertainty (from 10 % to 25 %). Figure 2 shows the distributions of M R and jet multiplicity in the zero-lepton multijet control region with a b-tagged jet requirement, with results from the fit to the control regions overlayed. By design, the multijet background is dominant in these regions. The small contribution from tt and W +jets backgrounds are constrained by other control regions in the simultaneous  fit. The hatched area indicates the total systematic uncertainty after the constraints imposed by the fit.

Background fit
The distributions of R and jet multiplicty for the backgrounds after the control region fit in the W → µν+jets control region are shown in Fig. 3. The control region at low R is dominated by fake backgrounds, and at moderate-to-high R they are dominated by W +jets.
The use of an alternate control region with a cut on transverse mass, which significantly reduces the fake contribution, results in a negligible change in the final search results.  trol region by approximately 15-20 % with respect to the unmodified expectation from MC simulation. This shift predominantly affects the semi-leptonic tt background. In the two-lepton analysis, there is a significant contribution to the background expectation from Z-boson events with heavy flavour, particularly at low E miss T . The lowest M R bin shows the most significant disagreement, which demonstrates the importance of shape profiling by binning the control regions. Although in that lowest bin, particularly in the two-muon channel, the MC simulation underestimates the amount of data, a single-binned normalisation of the tt background would result in an overestimation of the background at high M R . The distributions of missing transverse mo-mentum are also shown for the two-lepton control regions.
The distributions of M R for the two Z+jets control regions are shown in Fig. 6. In both the electron and muon channel, good agreement after the control region fit is visible.
The charge flip background is significant in the samesign two-electron channel. Figure 6 also shows the distribution of M R in a charge flip enriched control region. There remain significant uncertainties on the background even after the control region fit, since it is dominated by charge flip and fake leptons, both of which have large systematic uncertainties associated with them. The distribution of dilepton mass is also shown.  Fig. 4 The distribution of R in the one-lepton tt control regions (dots with error bars), the expectation after the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched). Table 3 Results of the background-only fit to the control regions in the zero-and one-lepton control regions, for an integrated luminosity of 4.7 fb −1 . The errors shown are the statistical plus systematic uncertainties.

Control region
Had.  The contributions to each of the control regions before and after the fit to the control regions are shown in Tables 3 and 4.
Various tests of the fit are carried out in order to ensure its stability. As a test of the multijet background constraint and the validity of fitting the M R distributions in those control regions, the control region fit is instead performed in the number of jets with p T > 30 GeV. The p T cut is raised from the baseline selection to make the fit less sensitive to pile-up effects. The expectation for the multijet background in the signal regions is consistent with the main result.
The yields and shapes of the validation regions show good agreement with the Standard Model expectation. The significance of the deviation of the observation from the expectation in each of the signal and validation regions are shown in Figure 7. There is some tension between the same-flavour and opposite-flavour dilepton tt validation regions, but there is no indication of a systematic mis-modelling of any of the major backgrounds. The yields of all validation regions are within 1.2σ of the SM expectations.
The numbers of expected events in each signal region before and after the fit to the control regions are shown in Tables 5 and 6. Additionally, the probability (p 0 -value) that a background-only pseudo-experiment is more signal-like than observed is given for each individual signal region. To obtain these p 0 -values, the fit in the signal region proceeds in the same way as the control-region-only fit, except that the number of events     Fitted background events 5.5 ± 1.5 39 ± 7 10 ± 2 6.6 ± 1.7 5.5 ± 1.7 4.4 ± 1.3 Fitted top events 0.40 ± 0.14 21 ± 3 2.7 ± 0.9 5.0 ± 1.3 1.7 ± 0.6 3.7 ± 1.1 Fitted W /Z events 4.9 ± 1. observed in the signal region is included as an input to the fit. Then, an additional parameter for the non-Standard-Model signal strength, constrained to be nonnegative, is fitted. The shape of the distributions in the signal region is neglected in this fit. Therefore, in order to provide tighter constraints on non-Standard-Model production, in some of the high-count signal regions the M R requirements are tightened. The significance of the excess is given, along with the model-independent upper limit on the number of events and cross-section times acceptance times efficiency from non-Standard-Model production. The distributions in all signal regions as a function of M R of background expectations after the fit to the control region has been performed are shown in Figs. 8 and 9. No significant deviations from the expected background are found. The most significant excess is 1.50 standard deviations from the expectation, in the one electron, b-tagged jet signal region.

Exclusion results
Using these signal regions, the CL S [69] prescription is applied to find 95 % Confidence Level (CL) one-sided limits on the production of SUSY events in various models. A likelihood is constructed, taking into account signal shape information provided by the binning of the signal regions. All fitted nuisance parameters, with their correlations, are included in the likelihood. Because the typical M R of the signal may vary across a signal grid, the use of shape information results in an observed exclusion that is not consistently above or below the expected. The observed limits for the separate zero-, one-, and two-lepton signal regions are also constructed additionally, with all control regions included as constraints. Figure 10 shows exclusion contours for a simplified model with gluino pair-production, where the gluinos decay to a chargino and two quarks and the chargino Table 6 Results of the background-only fit to the control regions in the two-lepton signal regions, for an integrated luminosity of 4.7 fb −1 . The errors shown are the statistical plus systematic uncertainties. The p 0 -values and significances are given for single-bin signal regions, along with the 95 % Confidence Level upper limit on the number of and cross-section for non-Standard-Model production within each signal region.    subsequently decays to a W boson and the LSP. Two planes are shown for this simplified model. The first fixes the chargino mass to be exactly between the LSP and gluino mass and shows the exclusion in the gluinomass-LSP-mass plane. The production cross-section falls smoothly and exponentially with m heavy , while M R and therefore the acceptance times efficiency for a signal region typically rises with the mass splitting, m heavy − m LSP . In the second plane, the LSP mass is fixed to 60 GeV and the exclusion is shown in the gluino mass-x plane, where x = (m chargino − m LSP )/(m gluino − m LSP ). The zero-and one-lepton signal regions with a b-tagged jet requirement do not contribute to the exclusion because these simplified models have only light quarks in the matrix element final state.

Region Deviation
At high x, although the leptons have high-p T , the larger branching fraction of the W to quarks allows the zero-lepton channel to dominate. At moderate x, the leptons allow better discrimination between signal and background in the one-and two-lepton channels. At low x, the leptons have too low p T , and the zero-lepton channel again dominates. At high x, the limit set by this analysis exceeds somewhat that of the dedicated 0-lepton and 1-lepton ATLAS searches [10,13], while at low x the limit is weaker. This asymmetry is produced by the difference in kinematics in these two regions of the plane. At high x, the charginos are almost at rest in the lab frame, and the event topology is dominated by a two-body decay,χ ± 1 → W ±χ0 1 . At low x, on the other hand, the chargino is highly boosted, and the topology is dominated by a three-body decay,g → qq χ ± 1 . Thus, the high-x events have a higher typical R than the lowx events, and the two have approximately the same M R distribution. Figure 11 shows exclusion contours in simplified models with gluino pair-production, where the gluinos decay to the LSP via the emission of a tt pair. The exclusion is presented in the gluino mass-LSP mass plane,  Fig. 9 The two-lepton signal regions for same-flavour (top) and opposite-flavour (bottom) leptons of the same sign (left) and opposite sign (right), after the fit to the control regions has been performed. and, since all top quarks are required to be on-shell, only points with m gluino > m LSP + 2 × m top are considered. The zero-and one-lepton signal regions with a b-tagged jet veto do not contribute to the exclusion, because these models include four top quarks per event. At small mass splitting, the limits here are somewhat stronger than the ATLAS multijet analysis [11], though both are exceeded by those of the dedicated multi-bjet analysis [12]. The three b-tagged jet requirement suppresses the background substantially while preserving the signal acceptance because of the four tops in the event. The combined limit on LSP mass falls more quickly than that of the multi-b-jet analysis because M R is proportional to the mass splitting in the event, here m gluino − m LSP . The zero-lepton razor analysis is limited in this case by the use of the H T trigger, which was chosen to avoid a bias in the M R distribution.

ATLAS
Finally, Figure 12 shows exclusion contours in a plane of MSUGRA with tan(β) = 10, A 0 = 0, and µ > 0. At low m 0 , where squark pair-production is dominant, the zero-lepton channel dominates the exclusion, although it is affected somewhat by the jet multiplicity requirement that is not applied in the dedicated signal region of Ref.
[10], which therefore has more stringent limits. The leptonic channels enter at high m 0 , particularly where longer decay chains are common. The robustness of the individual limits have also been cross checked by removing some of the control regions. For example, removing the zero-and one-lepton control regions from the calculation of the two-lepton limit, the MSUGRA limit changes by less than 20 GeV in m 1/2 . These limits are consistent with those of earlier ATLAS analyses [10,11,13] in the m gluino ≈ m squark region. In this region, the single mass-splitting scale of the main strong production modes should produce a somewhat sharper peak in M R , allowing an improved limit in the shape fit. At large m 0 , the high M R requirement of the all-hadronic signal regions, resulting from the H T   trigger use, produce a somewhat weaker limit than the ATLAS multijet analysis [11].
The complementarity of a search using razor variables can be quantified by studying the overlap of the signal regions with the dedicated searches. Various signal models have been studied to understand this overlap, including both simplified models and full SUSY production models. The overlap between the signal regions presented here and other searches in ATLAS [10, 11, 13, 70] is typically 10-50 %, with similar overlaps in the data. The signal regions of this search access kinematic regions that are different from those of the standard searches. In simplified models in particular, the overlap between the dominant signal regions in the standard ATLAS analyses and the signal regions presented here is below 10-15 %. Thus, the regions of SUSY parameter space and kinematic phase space excluded by this search complement those excluded by earlier AT-LAS searches using the same data sample.