Searches for scalar leptoquarks and differential cross-section measurements in dilepton-dijet events in proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 13 TeV with the ATLAS experiment

Searches for scalar leptoquarks pair-produced in proton-proton collisions at $\sqrt{s}=13$ TeV at the Large Hadron Collider are performed by the ATLAS experiment. A data set corresponding to an integrated luminosity of 36.1 fb$^{-1}$ is used. Final states containing two electrons or two muons and two or more jets are studied, as are states with one electron or muon, missing transverse momentum and two or more jets. No statistically significant excess above the Standard Model expectation is observed. The observed and expected lower limits on the leptoquark mass at 95% confidence level extend up to 1.25 TeV for first- and second-generation leptoquarks, as postulated in the minimal Buchm\"uller-R\"uckl-Wyler model, assuming a branching ratio into a charged lepton and a quark of 50%. In addition, measurements of particle-level fiducial and differential cross sections are presented for the $Z\rightarrow ee$, $Z\rightarrow\mu\mu$ and $t\bar{t}$ processes in several regions related to the search control regions. Predictions from a range of generators are compared with the measurements, and good agreement is seen for many of the observables. However, the predictions for the $Z\rightarrow\ell\ell$ measurements in observables sensitive to jet energies disagree with the data.


Introduction
Leptoquarks (LQs) are features of a number of extensions of the Standard Model (SM) [1][2][3][4][5][6][7][8] and may provide an explanation for the similarities between the quark and lepton sectors in the SM. They also appear in models addressing some of the recent b-flavour anomalies [9][10][11]. They are colour-triplet bosons with fractional electric charge and possess non-zero baryon and lepton number [12]. Scalar and vector LQs have been proposed and are expected to decay directly into lepton-quark pairs. The lepton can be either electrically charged or neutral.
A single Yukawa coupling, λ LQ→ q , determines the coupling strength between scalar LQs and the leptonquark pair [13]. Two additional coupling constants due to magnetic moment and electric quadrupole moment interactions are needed for vector LQs [14].
LQs can be produced both singly and in pairs in proton-proton (pp) interactions; diagrams showing representative single-and pair-production processes are shown in Figure 1. The single-LQ production cross-section depends on λ LQ→ q whereas the pair-production cross-section is largely insensitive to this coupling. For pp interactions with a centre-of-mass energy √ s = 13 TeV, gluon fusion represents the dominant pair-production mechanism for LQ masses below around 1 TeV. The contribution of the qq-annihilation process rises with LQ mass. Only scalar LQ production is considered in this paper because this is less model dependent than vector LQ production. The production of vector LQs depends on additional parameters and a full interpretation in such models is beyond the scope of this analysis. The results obtained here can, however, be regarded as conservative estimates of limits on vector LQ production, since the production cross-section for vector LQs is typically much larger than for scalar LQs, while the kinematic properties used to search for their signature are very similar for both spin hypotheses [8]. The benchmark signal model used in this paper is the minimal Buchmüller-Rückl-Wyler (BRW) model [15]. This model imposes a number of constraints on the properties of the LQs. Couplings are purely chiral and LQs belong to three families, corresponding to the three SM generations, such that only leptons and quarks within a given generation can interact. The requirement of same-generation interactions excludes flavour-changing neutral currents (FCNC) [16]. The branching ratio (BR) of a LQ into different states is taken as a free parameter. In this paper, β denotes the BR for a LQ decay into a charged lepton and quark, LQ → ± q. The BR for a LQ decay into a neutrino and quark is (1 − β).
The decay of pair-produced LQs can lead to final states containing a pair of charged leptons and jets (LQLQ → − q +q ), or one charged lepton, a neutrino and jets (LQLQ → − qνq). The four search channels in this paper correspond to the above decays. The eejj (µµjj) channel comprises exactly two electrons (muons) and at least two jets, and the eνjj (µνjj) channel requires exactly one electron (muon), missing momentum in the transverse plane and at least two jets. The electron and muon channels are treated separately as they correspond to LQs of different generations. The dilepton and lepton-neutrino channels are combined in order to increase the sensitivity to the parameter of the BRW model, β. While the main target of this paper are first-and second-generation LQs, the data selections are chosen such as to retain a certain level of sensitivity to third-generation LQs as well. There are also dedicated searches looking for 3rd generation LQ production [20,30].
The main background processes in this search are Z/γ * +jets, W+jets and tt production. The shapes of these background distributions are estimated from Monte Carlo (MC) simulations while their normalisation is obtained from a simultaneous background-only profile likelihood fit to three data control regions (CRs). The CRs are defined with specific selections such that they are orthogonal to the signal regions (SRs), free from any LQ signal of interest, and enriched in their relevant background process. For the particular benchmark model chosen, further signal and background discrimination could be achieved by exploiting flavour tagging in order to reject events with b-hadrons in the search region. This is not used, in order to maintain sensitivity to a LQ coupling to a b-quark. The shapes and normalisations of other (subdominant) backgrounds are estimated from MC simulation. Background contributions from misidentified leptons are also sub-dominant, and are estimated from data when their contribution is significant.
Several variables that provide separation between signal and background processes are combined into a single discriminant using a boosted decision tree (BDT) method [31].
For each LQ-mass hypothesis and for each lepton flavour, two BDTs are used: one for the dilepton channel and one for the lepton-neutrino channel. The BDT output score is binned and the signal is accumulated at high score values. A single bin at a high score defines the SR for the respective mass hypothesis. The bin range is chosen such that a sensitivity estimator based on the expected number of signal and background events is maximised.
A profile likelihood fit is performed simultaneously to both the dilepton and lepton-neutrino channel for each lepton flavour to extract a limit on the signal yield. This is done for different assumptions of LQ mass and branching ratio β, resulting in exclusion bounds in terms of these parameters.
In addition to the search results, this paper presents a set of particle-level fiducial and differential crosssection measurements in six dilepton-dijet regions which are related to the analysis CRs, which are defined in detail in Section 5. The measurements are made for the dominant process in each region (Z+jets or tt) exclusively. The motivation for making these measurements is as follows. Search CRs are typically in kinematic regions which are not directly covered by dedicated measurements of SM processes. For instance, the ATLAS measurement of Z + jets production described in Ref.
[32] has much looser selections on the transverse momentum (p T ) of leptons and jets than the region that includes Z bosons for this search. Event generators are tuned and validated to ensure that they agree with the available SM data. However, in some cases, although good agreement is seen for the regions covered by dedicated SM measurements, significant disagreements remain possible in more extreme regions where no measurements are available to validate the generator predictions. Indeed, although no disagreement was seen when validating generated Z+jets samples against various SM measurements [33], significant disagreement was observed in the Z+jets CR for this search. Extracting particle-level cross-sections in these CRs and related more extreme regions of phase space can therefore provide complementary information to validate the performance of event generators. Such measurements may lead to improved background modelling in future searches with final states using similar CRs (see for example Ref. [34]). The particle-level selection and the measurement procedure are described in more detail in Section 6.
The paper is organised as follows. Section 2 gives an overview of the ATLAS detector, while Section 3 summarises information on the data set and simulations used. The physics objects used in the analysis are defined in Section 4. The background estimation and CRs used in the process are described in Section 5, and Section 6 provides details on the cross-section extraction. Section 7 gives an overview over the multivariate analysis employed for the search and the SRs defined with it. The systematic uncertainties are summarised in Section 8, and the statistical procedure employed for the search is described in Section 9. Results from both the measurement and the search are given in Section 10. Section 11 concludes the paper.

The ATLAS detector
The ATLAS experiment [35] is a multipurpose detector with a forward-backward symmetric cylindrical geometry and nearly 4π coverage in solid angle. 1 The three major subcomponents are the tracking detector, the calorimeter and the muon spectrometer. Charged-particle tracks and vertices are reconstructed by the inner detector (ID) tracking system, comprising silicon pixel and microstrip detectors covering the pseudorapidity range |η| < 2.5, and a transition radiation straw-tube tracker that covers |η| < 2.0 and provides electron identification. The ID is immersed in a homogeneous 2 T magnetic field provided by a solenoid. Electron, photon, and jet energies are measured with sampling calorimeters. The calorimeter system covers a pseudorapidity range of |η| < 4.9. Within the region |η| < 3.2, barrel and endcap highgranularity lead/liquid argon (LAr) electromagnetic calorimeters are deployed, with an additional thin LAr presampler covering |η| < 1.8, to correct for energy loss in material upstream of the calorimeters. Hadronic calorimetry is provided by a steel/scintillator-tile calorimeter, segmented into three barrel structures within |η| < 1.7, and two copper/LAr hadronic endcap calorimeters. The forward region (3.1 < |η| < 4.9) is instrumented with a LAr calorimeter with copper (electromagnetic) and tungsten (hadronic) absorbers. Surrounding the calorimeters is a muon spectrometer (MS) with air-core toroid magnets to provide precise muon identification and momentum measurements, consisting of a system of precision tracking chambers providing coverage over |η| < 2.7, and detectors with triggering capabilities over |η| < 2.4. A two-level trigger system [36], the first level using custom hardware and followed by a software-based level, is used to reduce the event rate to a maximum of around one kHz for offline storage.

Data sample
This analysis is based on proton-proton collision data at a centre-of-mass energy of √ s = 13 TeV, collected at the LHC during 2015 and 2016. After imposing requirements based on beam and detector conditions and data quality, the data sample corresponds to an integrated luminosity of 36.1 fb −1 . A set of triggers is used to select events [36][37][38]. For the dielectron channel, a two-electron trigger with a transverse energy (E T ) threshold of 17 GeV for each electron was used for the entire data set. In the electron-neutrino channel, events were selected by either of two single-electron triggers, as described below. For the 2015 data set, one trigger required E T above 60 GeV. A second trigger with a higher threshold of 120 GeV but looser identification requirements otherwise was used. For the 2016 data set, the main difference is that the second trigger had a threshold of 140 GeV. The trigger efficiency is at least 95% for electrons above threshold for the kinematic region used in this work.
In the muon channel, the same triggers were used for the µµjj and the µνjj channels. Events were selected by either of two single-muon triggers. For the 2015 data, one trigger required a transverse momentum of at least 26 GeV and applied an isolation criterion. To maintain a high efficiency at high p T , a second trigger with a threshold of 50 GeV and no further requirements was used. The main difference for the 2016 data set is that slightly different isolation requirements were used for the lower-p T trigger. The thresholds were the same as in 2015. For the offline p T values considered in this analysis, the trigger efficiencies have reached their plateau values, which vary between 50% and 80% in the more central detector region and are around 90% for |η| > 1.05.
Multiple pp interactions in the same or neighbouring bunch-crossings can lead to many reconstructed vertices in the beam collision of interest. The primary vertex of the event, from which the leptons are required to originate, is defined as that with the largest sum of squared transverse momenta of its associated tracks. Selected events must contain a primary vertex with at least two associated tracks with p T > 0.4 GeV.

Signal and background simulations
Samples of simulated events with pair-produced scalar LQs with masses between 200 GeV and 1700 GeV, in steps of 50 GeV up to 1500 GeV and thereafter 100 GeV, were generated at next-to-leading order (NLO) in QCD [39]  for the event simulation beyond the hard scattering process, i.e. the parton shower, hadronisation and underlying event, collectively referred to as UEPS. The A14 set of tuned parameters (tune) [43] was used for the UEPS modelling. The NNPDF3.0 NLO [44] parton distribution function (PDF) set was used. The coupling λ LQ→ q , which determines the LQ lifetime and width [13] was set to √ 4πα, where α is the fine-structure constant. This value corresponds to a LQ full width of about 0.2% of its mass; LQs can thus be considered to decay promptly. Samples were generated for β with a value of 0.5.
Events containing W or Z bosons and associated jets [45] were simulated with the S 2.2.1 generator [46]. Matrix elements were calculated in perturbative QCD for up to two partons at NLO and four partons at leading order (LO) using the C [47] and O L [48] matrix element generators. Merging with the S UEPS model was performed with the ME+PS@NLO method [49]. The NNPDF3.0 PDF set [50] at next-to-next-to-leading order (NNLO) was used. Simulated processes in which a Z boson decays into leptons are hereafter termed Drell-Yan processes.
For the simulation of tt and single-top-quark production in the Wt final state and s-channel the P -B 2 [51][52][53][54]  Diboson processes with at least one charged lepton in the final state were simulated using the S 2.1.1 generator. All diagrams with four electroweak vertices were considered. They were calculated for up to one parton at NLO and up to three partons at LO using the C and O L matrix element generators and merged with the S UEPS model using the ME+PS@NLO prescription. The CT10 PDF set was used in conjunction with dedicated parton shower tuning developed by the S authors. The generator cross sections, calculated up to NLO, were used.
To model the effect of multiple proton-proton interactions in the same or neighbouring bunches (pile-up), simulated inclusive proton-proton events were overlaid on each generated signal and background event. The pile-up was simulated with P 8.186 using tune A2 [60] and the MSTW2008LO PDF set [61]. Simulated events were corrected using per-event weights to describe the distribution of the average number of interactions per proton bunch-crossing as observed in data.
The detector response to events in the SM background samples was evaluated with the G 4-based detector simulation [62,63]. A fast simulation employing a parameterisation of calorimeter response [64] and G 4 for the other detector components was used for the signal samples. The standard ATLAS reconstruction software was used for both simulated and pp data. Furthermore, correction factors, termed scale factors, which were derived from data were applied as event weights to the simulation of the lepton trigger, reconstruction, identification, isolation, and impact parameter selection, and of b-tagging efficiencies.

Object selection
Electrons are reconstructed from ID tracks which are matched to energy clusters found in the electromagnetic calorimeter. The reconstruction efficiency is higher than 97% for candidates with p T greater than 30 GeV. An object is identified as an electron following requirements made on the quality of the associated track, shower shapes exploiting the longitudinal segmentation of the electromagnetic calorimeter, leakage into the hadronic calorimeter, the quality of the track-to-cluster matching and measurements of transition radiation made with the TRT [65]. The transverse energy of the electron candidates in the ee j j (eν j j) channel must exceed 40 (65) GeV. Only electron candidates in the pseudorapidity region |η| < 2.47 and excluding the transition region between the barrel and endcap EM calorimeters (1.37 < |η| < 1.52) are used. The electron identification uses a multivariate technique to define different working points of selection efficiency. In the dielectron channel, the 'medium' identification working point with an efficiency above 90% for the candidates considered in this analysis is used. To achieve better rejection against jets misidentified as electrons, a tighter selection is used in the electron-neutrino channel. For the electron-neutrino channel, the 'tight' identification working point is used with an efficiency of 85% or more for candidates with p T above 65 GeV. The electrons are required to be isolated, using both calorimeter-and track-based criteria such that the isolation efficiency is 98%. Finally, the electron candidates have to be compatible with originating from the primary vertex.
Muon tracks are reconstructed independently in the ID and the MS [66]. Tracks are required to have a minimum number of hits in each system, and must be compatible in terms of geometrical and momentum matching. Information from both the ID and MS is used in a combined fit to refine the measurement of the momentum of each muon over |η| < 2.5 [67]. The efficiency for reconstructing muons is 98%. As for the electron channels, muon candidates are required to have p T > 40 GeV and p T > 65 GeV for the µµjj and µνjj searches respectively. They are further required to be compatible with originating from the primary vertex. A track-based isolation requirement is applied to the muons, yielding a selection efficiency of 99%.
Jets are reconstructed using the anti-k t algorithm [68] with a radius parameter R = 0.4 from topological clusters of calorimeter cells which are noise-suppressed and calibrated to the electromagnetic scale. They are calibrated using energy-and η-dependent correction factors derived from simulation and with residual corrections from in situ measurements [69]. The jets must satisfy p T > 60 GeV and |η| < 2.5. Additional jet quality criteria are also applied to remove fake jets caused by detector effects [70]. Furthermore, to eliminate jets containing a large energy contribution from pile-up, jets are tested for compatibility with the hard scatter vertex with a jet vertex tagger discriminant, utilising information from the ID tracks associated with the jet [71].
Jets containing b-hadrons (b-jets) are identified with an algorithm which is based on multivariate techniques. The algorithm combines information from the impact parameters of displaced tracks and from topological properties of secondary and tertiary decay vertices reconstructed within the jet. A working point at which the b-tagging efficiency is around 77% for jets originating from a b-quark is chosen, as determined with a MC simulation of tt processes [72, 73]. Scale factors needed to match the MC performance to data are compatible with unity [74].
The missing transverse momentum is defined as the negative vector transverse momentum sum of the reconstructed and calibrated physics objects, plus an additional soft term [75,76]. The soft term is built from tracks that are not associated with any reconstructed electron, muon or jet, but which are associated with the primary vertex. This procedure makes this component of the missing transverse momentum less sensitive to the effects of pile-up. The absolute value of the missing transverse momentum is denoted by E miss T . In the ν j j channel, an E miss T -related quantity, calculated as S = E miss T / p j1 T + p j2 T + p T is used to further reduce backgrounds from objects wrongly reconstructed as leptons. Here, p j1 T (p j2 T ) is the p T of the leading (subleading) jet and p T is the p T of the charged lepton. Ambiguities in the object identification which arise during reconstruction, i.e. when a reconstructed object can match multiple object hypotheses (electron, muon, jet), are resolved in several steps. First, electrons are removed if they share a track with a muon. An algorithm is also used to remove jet-lepton ambiguities based on the proximity of identified leptons and jets.

Control regions and background estimation
The major SM background processes to the LQ signal correspond to the production of Z/γ * +jets, W+jets and tt events in which at least one top quark decays semileptonically. These backgrounds are determined using data in selected CRs, as described below. Subdominant contributions arising from diboson, single-top, W → τν and Z → ττ production and are estimated entirely from simulation. There are also background contributions due to objects being misidentified as leptons or non-prompt leptons that are produced in the decay of hadrons. These backgrounds are collectively referred to as the fake (lepton) background and are estimated in a data-driven way where relevant.
Three CRs, independent of the SRs used for the leptoquark search (see Section 7), in which any high-mass LQ signal contributions are expected to be negligible are defined in order to estimate the main backgrounds. First, the Z/γ * +jets control region (Z CR) in the dilepton channels is defined by restricting the dilepton invariant mass, m , to values between 70 GeV and 110 GeV. These CRs contain samples of Z/γ * +jets events of purity 92% (94%) for the electron (muon) channel. Next, the W control region (W CR) in the lepton-neutrino channels requires the transverse mass2, m T , to be between 40 GeV and 130 GeV. In addition, E miss T has to be greater than 40 GeV and S > 4. Events that contain one or more b-jets are vetoed to reduce the tt contribution. The purity of the W CR is 74% (80%) for the electron (muon) channel. To improve the description of data, a reweighting of the Z/γ * +jets and W+jets predictions by S 2.2.1 is performed. The weights in this procedure are parameterised as a function of the dijet mass, m j j , in the respective CR, and applied to both CRs and SRs (see Section 7). Figure 2 shows distributions of m j j and leading jet p T in the Z CR for the µµjj channel before and after reweighting. The data are better described by the Monte Carlo simulations following this reweighting. The total uncertainty (statistical, systematic and theory) is shown. The total uncertainty for CR distributions is dominated by the uncertainty in the theoretical predictions which is correlated between bins. Experimental uncertainties from jet energy scale and jet energy resolution are typically around 1%. Similarly, the S 2.2.1 predictions of W+jets require reweighting to match the data. The functional form of the reweighting algorithm is similar to that used for the Z/γ * +jets sample. A systematic uncertainty is included to account for the reweighting procedure (Section 8). In the statistical method used to evaluate the exclusion limits which is described in Section 9, the normalisations of background estimates are adjusted in the fitting procedure. These normalisations are not applied to any of the distributions shown prior to that section. Finally, the tt control region (tt CR) for the µνjj and eνjj channels is defined with the same requirements on m T and E miss T as the W CR, but here the presence of at least two b-jets is required. The purity of the tt CR is 86% (89%) for the electron (muon) channel. The CRs are summarised in Table 1. The Z CRs and other measurement regions are used to extract particle-level differential cross-sections (see Section 6).
This analysis makes use of the matrix method [77] to estimate the background from misidentified electrons for the first-generation LQ search. This background is negligible for the second generation search. The matrix method is based on the estimation of the probabilities for prompt electrons and fake electron candidates that pass identification and isolation selections that are looser than the nominal selection described in Section 4, to also pass the nominal selection. These probabilities are referred to as the prompt rate and fake rate, respectively. In the loose selection, several criteria that are used to suppress fake contributions in the nominal selection are relaxed. In particular, no isolation is required in the loose selection. The prompt rate is estimated to a good approximation from MC simulations, while the fake rate 2 The transverse mass is defined as m T = 2 · p T · E miss T · (1 − cos(∆φ( , E miss T ))) where ∆φ( , E miss T ) is the azimuthal separation between the charged lepton and the E miss T .

Common selections
is estimated from a data sample enriched in fake backgrounds. To suppress contributions from prompt electrons in this sample, events are rejected if they contain at least two electron candidates that pass the 'medium' identification criteria. In both channels, events are also rejected if there are two electron candidates that fulfil the 'loose' object definition specific to the respective channel and have an invariant mass in an interval of ±20 GeV around the Z boson mass.
Backgrounds with non-prompt muons arise from decays of heavy-flavour hadrons inside jets. Their contribution to the dimuon final states is negligible as in the search documented in Ref.
[78]. In leptonneutrino final states, a contamination of about 5% was found [77], corresponding to about half the size of the fake electron background. Tests showed that a contamination of 5% would not alter the results of this analysis, and therefore this background is neglected. The uncertainty in the total background estimation due to neglecting this background is taken to be half the size of the background from fake electrons.
In Figure 3, the distributions of m min LQ and the dilepton invariant mass in the Z CR are shown for the µµjj and eejj channels. The quantities m min LQ ane m max LQ are defined as the lower and higher of the two invariant masses which can be reconstructed using the two lepton-jet pairs in the dilepton channels. In the analysis, the lepton-jet pairing is chosen such that the absolute mass difference between the two LQ candidates is minimised. The full set of experimental and theoretical uncertainties is considered for the uncertainty band. The spectra correspond to the predictions prior to the fit described in Section 9.
Distributions for the eνjj and µνjj channels are similarly shown for the W and tt CRs in Figures 4 and 5, respectively. The E miss T and m T spectra are shown in Figure 4, while the m j j and m T spectra are shown in Figure 5. The data and the predictions in the various CRs are in agreement within uncertainties (described in Section 8) following the reweighting of the S 2.2.1 predictions for Z/γ * +jets and W+jets processes. After the reweighting, the overall normalisation of the simulation is in very good agreement with the data in the V+jets CRs.

Extraction of particle-level cross-sections from the measurement regions
Measurements of the particle-level fiducial and differential cross-sections are made in several regions related to the search CRs described in the previous section. These measurement regions (MRs) correspond 13 tot. unc. [GeV] [GeV] ATLAS tot. unc.

ATLAS
[GeV]  2: Requirements at particle-level for all measurement regions (MRs) where particle-level differential crosssections are extracted. The purity refers to the proportion of the particle-level yield of the MR which is accounted for by the dominant process. The variable S T refers to the scalar sum of the p T of the jets and leptons.
MR Dominant process (purity) Required leptons and jets m selection to kinematic regions where new physics signals have already been excluded. They are chosen because several searches with related final states use similar selections as CRs (see for example Ref. [34]).

Particle-level objects
In order to extract fiducial and differential cross-sections in the MRs, particle-level object selections are defined, and chosen to be as close as possible to their detector-level counterparts described in Section 4.
Prompt dressed3 particle-level electrons and muons are used to define the fiducial region. They are required to have p T > 40 GeV and |η| < 2.5.
Particle-level jets are formed by clustering stable4 particles using the anti-k t (R = 0.4) algorithm [68], excluding muons and neutrinos. Particle-level jets are required to have p T > 60 GeV and |η| < 2.5. The electrons used in the jet clustering are not dressed. To match the requirements of the detector-level jet-lepton ambiguity resolution procedure, any particle-level leptons within ∆R < 0.4 from a selected jet are removed.

Measurement region definitions
The first two MRs are identical to the eejj and µµjj Z CRs described in Table 1. The third is an eµjj region where the leptons are required to have different flavours, and no requirement on the dilepton mass is applied. Finally, three additional regions are defined as for the eejj, µµjj and eµjj MRs, and labelled as extreme with the additional requirement that the scalar sum of the p T of the leptons and the two leading jets (S T ) be above 600 GeV. The requirements for events entering all MRs are summarised in Table 2.
The MRs are defined both at detector level and particle level, where the requirements are identical but use the objects passing the detector-level and particle-level selections respectively. The m j j reweighting is also applied to events from the simulated Drell-Yan sample at particle level, where the particle-level value of m j j is used to calculate the weight. In each MR, the cross-section measurements are made for dominant processes. The contributions from all other processes are subtracted from the data yield, as explained in Section 6.3.

Particle-level measurement method
Each MR is designed such that it is dominated by a single SM process (accounting for more than 85% of the yield, see Table 2), for which the cross-section can be measured exclusively. A simple unfolding procedure to particle-level, based on bin-by-bin correction factors, is used. In this method, the particle-level differential cross-section for the dominant process p in bin i of the distribution of variable X can be expressed as: where the sum runs over all sub-dominant SM processes which contribute to the yield, N i is the number of events observed in the bin, R p i (T p i ) is the yield at detector level (particle level) for process p, w i is the width of the bin and L is the integrated luminosity of the data sample. The bin-by-bin correction procedure doesn't account for migration between bins due to resolution effects when going from particle-level to detector-level distributions. For this reason, the binning of the distributions is chosen to be much broader than the experimental resolution to minimise the migration between bins. This binning is optimised for the measurements to ensure that at least 90% of events passing the particle-level selection remain in the same bin at detector level (for events passing both selections). This binning is not used for the search.
Differential cross-section measurements are made for the following observables: p T of the dilepton system; ∆φ between the leptons; minimum ∆φ between lepton and leading jet; minimum ∆φ between lepton and subleading jet; S T ; leading jet p T ; subleading jet p T ; ∆φ and ∆η between the leading and subleading jets; scalar sum of p T of leading and subleading jets (H T ); and invariant mass of the dijet system.

Multivariate analysis and signal regions
In order to dscriminate between signal and background in the LQ search, the TMVA [31] implementation of a BDT is used. A number of variables are expected to provide discrimination between signal and background. Generally, the p T of the leptons and jets as well as related variables such as the sum of p T will possess higher values for the signal than for the background, in particular for high LQ mass. Since the signature arises from the decay of parent LQs with well-defined masses, mass-sensitive discriminating variables are further candidates for BDT input variables. In the dilepton channels, the lepton-jet masses m min LQ and m max LQ are reconstructed, as described in Section 5. In the lepton-neutrino channels, the mass of one LQ (m LQ ) and the transverse mass of the second LQ (m T LQ ) can be reconstructed. The transverse mass is defined as is the azimuthal angle between the jet and the direction of missing transverse momentum vector. The procedure is analogous to that used in the dilepton channel; both possible pairings of jet and lepton (or E miss T ) are tested and the pairing that results in the smaller absolute difference between m LQ and m T LQ is chosen. Events with negative weights in the simulation are ignored in the training, after verifying that the distributions of important discriminating variables do not differ beyond statistical uncertainties between the events with positive and negative weights, respectively. All background processes are used in the BDT training. The training is done for each LQ mass hypothesis for which signal simulations were produced. To ensure independence from the CRs, in the dilepton channel, only events with a dilepton invariant mass above 130 GeV are considered in the training. Similarly, only events with a transverse mass greater Table 3: BDT input variables in the dilepton and lepton-neutrino channels.

Channel
Input variables GeV are considered in the lepton-neutrino channel. Here, to further suppress the fake-electron contribution, it is required that E miss T > 150 GeV and that S > 3. These requirements define training regions (TRs) and are also applied to define the SRs. The latter regions are selected according to the following criteria: m > 130 GeV ( j j) and m T > 130 GeV, E miss T > 150 GeV, S > 3 ( ν j j). Distributions of variables used for training are shown in Appendix B. Following losses due to acceptance selections and detector inefficiencies, the fraction of events containing pair-produced LQs is estimated from the simulation to rise from less than 1% at low masses (around 200 GeV) to 60-70% for masses around 1500 GeV.
To ensure that no bias is introduced, two BDTs are used for each mass point; one is trained on one half of the events and evaluated on the other half, and vice versa for the second BDT. Different methods of dividing the simulation samples into training and test samples were tested and compared. Table 3 lists the variables that were found to give optimal sensitivity, ordered by their discriminating power in the BDT. In the dilepton channel, these are both reconstructed LQ candidate masses, m min LQ and m max LQ , the dilepton invariant mass, m , the subleading jet p T (p j2 T ) and the subleading lepton p T (p 2 T ). The lepton-neutrino channel also makes use of both of the LQ mass variables, m LQ and m T LQ , and the other variables are the transverse mass, m T , the missing transverse momentum, E miss T , the subleading jet p T , and the charged-lepton p T , p T . It was checked and confirmed that not only the individual input variables but also their pairwise correlations are modelled well in the simulation. Output distributions for the BDT discriminant in the TRs are shown in Figure 6 for a signal sample of LQ mass 1.3 TeV. The contribution of a LQ of mass 1.3 TeV, normalised to the production cross-section, is also shown for the four channels. The background includes the reweighted predictions of Z/γ * +jets and W+jets. An increased separation between signal and background at higher values of the BDT output is seen.
In addition to the selections described earlier, the SRs for this analysis are defined separately for the dilepton and lepton-neutrino channels as a single bin in the output score distribution of the respective BDT. The bin range is determined for each mass point separately by maximising the quantity [79] Z = (s + b) log(1 + s/b) − s, where s and b are the signal and background expectations, respectively. In this optimisation, a number of further requirements are placed on the background expectations: there have to be at least two background events in the chosen region; the statistical uncertainty of the number of background events estimated by the simulation has to be less than 20%; if less than ten background events are expected, the statistical uncertainty is required to be less than 10%. This is done to avoid selecting bins with very low background expectation and thus to increase the fit stability.
In total, there are 58 SRs in the electron and the muon channel each: two SRs ( j j and ν j j) for each of the 29 mass hypotheses considered.

Experimental uncertainties
The uncertainty in the combined 2015+2016 integrated luminosity is 2.1%. It is derived, following a methodology similar to that detailed in Ref. [80], and using the LUCID-2 detector for the baseline luminosity measurements [81], from calibration of the luminosity scale using x-y beam-separation scans.
An uncertainty due to the reweighting of the vertex multiplicity to describe pile-up is estimated by scaling the average number of interactions per bunch-crossing in data by different factors.
The major sources of uncertainties affecting the electron measurement derive from the electron energy scale and resolution measurements. The analysis also considers uncertainties due to the modelling of the efficiencies of the four electron selection criteria: trigger, reconstruction, identification and isolation.
Sources of experimental uncertainties related to muon reconstruction and calibration include uncertainties in the determination of the MS momentum scale, MS momentum resolution, ID momentum resolution and additional charge-dependent corrections. Furthermore, uncertainties in the determination of the four efficiency scale factors, introduced at the end of Section 3.2, for trigger, identification, isolation and track-to-vertex matching are taken into account.
There are two main sources of uncertainty related to the jet reconstruction: jet energy scale (JES) and jet energy resolution (JER). Another source of jet-related uncertainty corresponds to corrections made for the b-tagging efficiency. The uncertainty due to the JES is largely derived from various in situ techniques. Four components of this uncertainty are considered.
The uncertainties due to the electron, muon and jet energy scale and resolution are propagated to the estimation of E miss T . In addition, there is an uncertainty from the soft term, which has three components: one corresponding to the E miss T scale and two components for the E miss T resolution uncertainty. The resolution uncertainty is split into components parallel and perpendicular to an axis in the transverse plane which is defined along the direction of a vectorial sum of all the hard objects in the event (electrons, muons and jets).
An uncertainty in the fake rate of electrons is determined by varying the MC prediction that is used to remove the contributions of prompt electrons from the fake-enriched sample by 30%. The effect is propagated to the fake background estimate. The uncertainty due to neglecting the muon fake background is taken to be half the size of the electron fake background, see Section 5.

Theoretical uncertainties
The S 2.2.1 V+jets samples include event weights reflecting variations of the nominal PDF set (NNPDF3.0) and the use of two different PDF sets: MMHT2014NNLO68CL [82] and CT14NNLO [83]. The NNPDF intra-PDF uncertainty is estimated as the standard deviation of the set of 101 NNPDF3.0 eigenvectors. This procedure takes into account the effect of varying α S by ±0.001 around its nominal value of 0.119.
The envelope of the differences between the nominal NNPDF set and the other two PDF sets is used as an additional uncertainty.
The samples include weights for seven sets of variation of the renormalisation (µ ) and factorisation (µ ) scale, i.e. varying the scales up and down by a factor of two, either together or independently. The envelope of all these variations is taken as an estimate of the scale uncertainty.
A further uncertainty from the reweighting of the V+jets simulations in m j j is considered. The full difference between the reweighted and the unweighted distributions is taken as an estimate of the uncertainty induced by the reweighting.
The uncertainties in the modelling of the production of tt are assessed from a number of alternative simulation samples. Differences between different generators and different models for fragmentation and hadronisation are considered. In addition, parameters affecting initial-and final-state radiation are also varied. For the tt sample the default PDF set is NNPDF3.0 and the PDF uncertainty is estimated from the PDF4LHC15 prescription [84].

Uncertainties for the search
As described in Section 9 three CRs are fit simultaneously to obtain normalisation factors. Sources of theoretical uncertainty described above are taken into account in this procedure. Furthermore, theory uncertainties in the SRs correspond to variations in the output BDT spectra and are thus taken into account in the final limit setting calculation.
The electron energy scale and reconstruction affect the total background yield in the SRs for the electron channels by 2-14%. Similarly uncertainties due to the muon reconstruction and calibration lead to background uncertainties of between 2 and 20% for the muon channels. The impact of uncertainties related to the JES (JER) lead to uncertainties on the predicted background of 5-20% (2-20%). The effect of the soft-term uncertainties in the E miss T estimations leads to variations of the background estimate from less than 1% to 15%. Generally, the experimental uncertainties in the signal yields in the SRs are not larger than 2%.
Uncertainties of approximately 100% for the fake electron background are estimated at low values of lepton-jet mass in the TR and about 20% at masses above 1 TeV. The statistical uncertainty in the electron fake rate determination is below 10%. As discussed earlier, the contribution from fake muons is typically half the size of the electron fake background, and this is used as a (conservative) estimate of the uncertainty in the total background due to neglecting the muon fake contribution.
The final modelling uncertainties in the background yields for Z+jets and W+jets are around 2-10%, with the scale uncertainty typically being the dominant source. Signal predictions have an uncertainty of a similar size. The background from tt production is affected by a theoretical unceratinty of up to 50%, the large value arising due to the statistical imprecision of the MC samples.

Uncertainties for the cross-section measurements
The uncertainties described below are those which apply to the particle-level cross-section measurements in the MRs, in addition to the relevant uncertainties from the previous sections. As is described in Section 6, the cross-section calculation involves the subtraction of non-dominant processes from the observed data, and the application of bin-by-bin correction factors. The experimental uncertainties play a role in both of these steps, since they affect the detector-level yield of both the non-dominant processes (when subtracting from data) and the dominant process (when calculating the correction factors). The dominant experimental uncertainties are the JES and the lepton efficiencies, which have effects of up to 5% and 2% on the fiducial cross-sections respectively. Theory and modelling uncertainties in the dominant process affect both the particle-level and detector-level yields, and therefore largely cancel out when calculating the correction factors T/R (see Eq. 1).
The theory uncertainties in the dominant processes in the Z+jets and tt MRs therefore have only a small impact (≤ 2%) on the fiducial cross-sections. The theory uncertainties in the estimation of non-dominant processes generally have a minimal impact since these processes usually account for only small fractions of the MR yield. The exception is the case of the tt background to the Z MRs, which also contribute a small but non-negligible theory uncertainty.
In addition, bin-by-bin correction uncertainties are derived by comparing the nominal correction factors with those obtained using re-weighted V+jets and alternative tt samples. The size of these modelling uncertainties ranges between 0.5% and 4% of the fiducial cross-sections.
The overall uncertainty of the fiducial cross-sections, accounting for all sources of systematic uncertainty, is between 5% and 9% depending on the MR.

Statistical procedure
The results of the analysis are interpreted using a profile likelihood method. The signal and backgrounds are described by a binned probability density function (p.d.f.) built using either the three CRs described in Section 5, or the two SRs, one in the dilepton channel and one in the lepton-neutrino channel, as described in Section 7. All CRs and SRs consist of only a single bin.
The CRs are enriched in a specific background process and have a negligible signal contamination; they are thus used to normalise the predicted backgrounds to data, whereas the SRs drive the signal extraction. The p.d.f. for the fit to the SRs includes the signal strength, i.e. the scaling factor with respect to the SM predictions for the signal cross-section, as the parameter of interest. Systematic uncertainties are incorporated in the p.d.f. as statistical nuisance parameters. They are introduced as normalisation uncertainties, i.e. they can only affect the relative size of a given background in different regions while preserving the overall number of expected events, in order to remove strong correlations with the normalisation factors for the main backgrounds that are extracted from the CRs and which are used to scale MC yields to match data. Experimental uncertainties are treated as being fully correlated between different physics processes and fit regions. Theoretical or modelling uncertainties are treated as correlated in different fit regions, but uncorrelated between processes. The statistical uncertainties of the MC samples are included in the p.d.f. as nuisance parameters by constraint terms described by a Γ distribution.
The fit is performed in two stages: first, only the CRs are included in the fit in order to extract the normalisation factors for the three main backgrounds. These normalisation factors are then applied to the main backgrounds in the second step, which is a fit to only the SRs. The uncertainty in the normalisation factors from the CR fit is introduced in the form of Gaussian nuisance parameters. The best-fit central values and constraints on other nuisance parameters are not transferred from the CRs to the SRs. The optimal value and error of the signal strength and nuisance parameters as well as their correlations are determined simultaneously when the p.d.f. is fitted to data.
For some systematic uncertainties, certain adjustments are made before they are incorporated in the fit, as described in the following. Uncertainties larger than 50% are capped at 50%, because their large size is due to the statistical uncertainty of simulation samples used to estimate them. This applies in particular to the theoretical uncertainties in the tt estimation. Uncertainties with very asymmetric up and down variations are symmetrised, using the larger of the two variations. In the SR fit, experimental uncertainties that before normalisation have an effect smaller than 2% on the total background or the signal expectation are not considered in the fit for the background or the signal, respectively. Other nuisance parameters are not considered for certain background processes if their effect is less than 2% on that process before normalising the uncertainty.

Results of the cross-section measurements
The measured fiducial particle-level cross-sections are shown in Table 4 for each of the six MRs. The values in all MRs are found to agree with the generator predictions within uncertainties. The uncertainties in the measured cross-sections are in all cases dominated by experimental uncertainties (in particular the jet energy scale). The theory uncertainties in the generator predictions are dominated by variations in the cross-sections, arising from QCD scale variations in the phase-space regions of the measurements. In both the standard and extreme MRs, the measured fiducial cross-sections in the Z MRs are found to agree between electron and muon decay modes, as expected. Examples of the particle-level differential cross-sections are shown in Figures 7 and 8, where they are compared with the cross-sections predicted from the nominal simulation without the m j j reweighting. Predictions from alternative generators are also shown in each case. Differential cross-sections for a larger list of variables are included in Appendix A and in the HEPData [85,86] record for this paper.
For all regions, the nominal generator predictions are found to agree with the measured fiducial crosssections. In the eejj and µµjj MRs (and their extreme counterparts) the predicted differential cross-sections for variables involving jet energies exhibit a degree of mis-modelling by the S 2.2.1 simulation. This can be seen in Figures 7 and 8, for the leading jet p T as an example. Similar mis-modelling is seen for m j j , H T , subleading jet p T , and S T differential measurements. All other quantities are shown to be well modelled by the nominal S 2.2.1 simulation. In all cases, the prediction of an alternative generator, M G 5_ MC@NLO (with events generated at NLO accuracy for Z + 0, 1, 2 jets, showered using P 8 with the A14 tune, and for which different jet multiplicities were merged using the FxFx prescription [87]), is also shown. Typically, the M G 5_ MC@NLO prediction is also discrepant with the data for the same observables, but often in a direction opposite to that of the S 2.2.1 prediction. This suggests that the discrepancies are caused by choices of the generator parameters, which would have been tuned in a different phase-space region given by previous Z + jets measurements. The measurements provided by this paper will therefore help to validate new tunes of the generators to ensure better agreement in this region for future searches.
For the eµjj MRs, the predictions from the nominal P simulation are found to agree with the data for all observables aside from the dilepton p T , where a slight mis-modelling is observed in the high tail, as can be seen in the examples shown in Figures 7 and 8. The predictions of an alternative generator (M G 5_ MC@NLO) are also shown, and these do agree with the data for this observable. This points to the generator tuning choice as the source of the disagreement.  Figure 7: Examples of the measured particle-level differential cross-sections in the eejj, µµjj and eµjj measurement regions. The data are unfolded bin-by-bin exclusively to the dominant process in each channel (Z → ee, Z → µµ and tt, respectively). The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty on the measurement which is indicated by the error bar on each point. The variable min∆Φ( j0, l) refers to the minimal difference in φ between the leading jet and a prompt lepton.    Figure 8: Examples of the measured particle-level differential cross-sections in the extreme eejj, µµjj and eµjj measurement regions. The data are unfolded bin-by-bin exclusively to the dominant process in each channel (Z → ee, Z → µµ and tt, respectively). The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty on the measurement which is indicated by the error bar on each point. For the leading jet p T measurement, the uncertainty in the second bin is smaller because the bin-by-bin correction factor from the alternative tt samples agrees more closely with the nominal ones than in neighbouring bins. The variable min∆Φ( j0, l) refers to the minimal difference in φ between the leading jet and a prompt lepton.

Results of the leptoquark search
The data event yields (black points) in a representative subset of the SRs are shown in Figure 9 together with the predicted background yields as evaluated with the fit. The bottom panel shows the significance of the deviations, assuming Poisson statistics and taking only statistical uncertainties into account. Even without including systematic uncertainties, no significant excess above the SM expectation is observed in any of the SRs. The modified frequentist CL s method [88] is used to set limits on the strength of the LQ signal.
Each BDT uses as inputs variables that reconstruct the LQ mass. Since these have a high discriminating power, the sensitivity of the SRs exhibits a strong dependence on the LQ mass below values of m LQ around 600 GeV. Therefore, in this regime a simple interpolation of acceptance between simulated mass points does not give a reliable estimate of the limit in the mass interval. To account for this limitation, results are presented separately for low and high mass regions, defined as being less than and greater than 600 GeV, respectively. For masses below around 600 GeV the intervals between mass points are greater than the typical mass resolution for adjacent masses (the converse is true for masses above around 600 GeV).
A coarse scan of the mass is performed in the low mass region in intervals of 50 GeV. Exclusion limits that are obtained at these scan points are used as conservative limits for the intermediate masses between these points. While for masses above ∼ 600 GeV the the mass-scan intervals are smaller than the mass resolution at the test points, the intervals used below ∼ 600 GeV are larger than the mass resolution. Therefore, above ∼ 600 GeV a simple interpolation is used to obtain the limits in the mass intervals. In the range below ∼ 600 GeV this is not appropriate and a different approach is used. Since the limit within a given   SR is strongest at the mass point for which the SR is designed for and weakest at the neighbouring mass points, the limit is estimated for a given mass interval, from m 1 to m 2 , in the following way; two limits are evaluated for m 1 and m 2 using the SR defined by m 1 (i.e. the respective BDT trained for the mass hypothesis m 1 ). Similarly, two estimations are calculated for the SR defined by m 2 . The weaker limit between the two results is retained for a given SR. Of the two remaining results from the two SRs, the stronger limit is used.
The 95% CL upper limits on the cross-section for scalar-LQ pair production, normalised to the predicted cross-section, are presented as a function of m LQ in Figure 10 for first-and second-generation LQs for an assumed value of β = 0.5. Expected limits with their one and two standard-deviation bands are also shown. The observed limits are consistent with the expected limits for all mass points for both channels. Lower mass limits of about 1.25 TeV are obtained, representing an increase of around 400 GeV compared with earlier work [21].
Exclusion contours in the β-m LQ plane are shown in Figure 11 for different values of β for the first two LQ generations. The sensitivity in β is greatest for the lowest mass region considered (∼ 200 GeV) for which the search is sensitive to β values around 10 −2 . For β = 1, a mass of 1400 GeV (1560 GeV) can be excluded for first-generation (second-generation) LQs. Limit values are also given for first-and second-generation LQs in Table 5. The considerably stronger observed limit for the second generation is due to a downward fluctuation in the data in the relevant SRs and the fact that searches for different generations are independent.
[GeV]  Figure 10: Upper limits (observed and expected) on the pair-production cross-section for (top) first-and (bottom) second-generation LQs normalised to the predicted cross-section (σ/σ th ), as a function of LQ mass for an assumed value of β = 0.5. Limits are presented for low (left) and high (right) mass regions, which correspond to masses greater than and less than 600 GeV, respectively. The ± 1σ (±2σ) uncertainty bands on the expected limit represent all sources of systematic and statistical uncertainty. Expected Figure 11: Upper limits (observed and expected) on the branching ratio for (top) first-and (bottom) second-generation LQs into a lepton and quark as a function of LQ mass. Limits are presented for low (left) and high (right) mass regions, which correspond to masses greater than and less than 600 GeV, respectively. The ± 1σ (±2σ) uncertainty bands on the expected limit represent all sources of systematic and statistical uncertainty.

Summary and conclusions
Searches for pair production of first-and second-generation scalar leptoquarks in pp collisions at √ s = 13 TeV have been made using the ATLAS detector at the LHC. The searches exploit a data set corresponding to 36.1 fb −1 of integrated luminosity and probe the lepton-quark and lepton-neutrino LQ decay channels. No significant excess above the SM background expectation is observed in any channel and exclusion limits have been evaluated. The results presented here significantly extend the sensitivity in mass compared to previous searches. Within the minimal Buchmüller-Rückl-Wyler model and assuming a branching ratio for the decay into a charged lepton and a quark of 50%, leptoquarks with masses up to 1.25 TeV are excluded in both channels at 95% CL.
In addition to the search, measurements have been made of particle-level fiducial and differential crosssections for six measurement regions related to the control regions used for the search. Two measurements regions are identical to the Z control regions used in the search, and their fiducial cross-sections are measured to be 3.28 ± 0.22 pb and 3.32 ± 0.23 pb for the Z → ee and Z → µµ channels respectively. The measurements agree with the cross-sections predicted by S 2.2.1, which are 3.24 ± 1.02 pb and 3.12 ± 1.01 respectively. A third measurement region is dominated by tt → eµ, where the fiducial cross-section is measured to be 1.50 ± 0.13 pb, compared to 1.60 ± 0.38 pb predicted by P +P 8. Measurements are also made in three other regions which have an additional S T > 600 GeV requirement.
The differential cross-sections for variables involving jet energies are found to exhibit a degree of mismodelling by the simulation, in particular in the Z measurement regions. Those involving leptons or the opening angle between jets are typically found to be well modelled.

Appendix A MR differential cross-section measurements
The measured differential cross-sections in the MRs are shown in Figure 12 for the eejj channel, in Figure 13 for the µµjj channel and in Figure 14 for the eµjj channel. The measured generator-level differential cross-sections in the eejj channel. The data are unfolded bin-by-bin exclusively to the Z → ee process. The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variable min∆Φ( j1, l) refers to the minimal difference in φ between the subleading jet and a prompt lepton. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons.
The measured differential cross-sections in the MRs where an additional selection of S T > 600 GeV has been applied are shown in Figure 15 for the eejj channel, in Figure 16 for the µµjj channel, and in Figure 17    The measured generator-level differential cross-sections in the µµjj channel. The data are unfolded bin-by-bin exclusively to the Z → µµ process. The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variable min∆Φ( j1, l) refers to the minimal difference in φ between the subleading jet and a prompt lepton. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons. The variable S T refers to the scalar sum of the p T of the jets and leptons.

ATLAS
for the eµjj channel.   The measured generator-level differential cross-sections in the eµjj channel, defined as for the eejj and µµjj MRs, but requiring the two leptons to have different flavours, and with no requirement on the dilepton mass. The data are unfolded bin-by-bin exclusively to the tt process. The MC prediction for the dominant process is also shown. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variables min∆Φ( j0, l) and min∆Φ( j1, l) refers to the minimal difference in φ between a prompt lepton and the leading or subleading jet respectively. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons. The variable S T refers to the scalar sum of the p T of the jets and leptons.   The measured generator-level differential cross-sections in the eejj channel, where an additional selection of S T > 600 GeV has been applied. The data are unfolded bin-by-bin exclusively to the Z → ee process. The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variable min∆Φ( j1, l) refers to the minimal difference in φ between the subleading jet and a prompt lepton. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons. The variable S T refers to the scalar sum of the p T of the jets and leptons.       The measured generator-level differential cross-sections in the µµjj channel, where an additional selection of S T > 600 GeV has been applied. The data are unfolded bin-by-bin exclusively to the Z → µµ process. The MC prediction for the dominant process is also shown, with no m j j reweighting applied. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variable min∆Φ( j1, l) refers to the minimal difference in φ between the subleading jet and a prompt lepton. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons. The variable S T refers to the scalar sum of the p T of the jets and leptons.     Figure 17: The measured generator-level differential cross-sections in the eµjj channel, defined as for the eejj and µµjj MRs, but requiring the two leptons to have different flavours, and with no requirement on the dilepton mass, where an additional selection of S T > 600 GeV has been applied. The data are unfolded bin-by-bin exclusively to the tt process. The MC prediction for the dominant process is also shown. The red band represents the statistical component of the total uncertainty which is indicated by the error bar on each point. The variables min∆Φ( j0, l) and min∆Φ( j1, l) refers to the minimal difference in φ between a prompt lepton and the leading or subleading jet respectively. The variable H T refers to the scalar sum of the jet p T values. The variables ∆η( j j), ∆φ( j j) and ∆φ(ll) refer to the difference in η and φ between the two leading jets or leptons. The variable S T refers to the scalar sum of the p T of the jets and leptons. Figures 18 and 19 show kinematic distributions for the TRs for the BDT for the dilepton and lepton-neutrino channels, respectively. These regions corresponds to the SRs albeit with the BDT selection removed.       [15] W. Buchmüller