Search for new phenomena in events with same-charge leptons and b-jets in pp collisions at s=13\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sqrt{s}=13 $$\end{document} TeV with the ATLAS detector

A search for new phenomena in events with two same-charge leptons or three leptons and jets identified as originating from b-quarks in a data sample of 36.1 fb−1 of pp collisions at s=13\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sqrt{s}=13 $$\end{document} TeV recorded by the ATLAS detector at the Large Hadron Collider is reported. No significant excess is found and limits are set on vector-like quark, four-top-quark, and same-sign top-quark pair production. The observed (expected) 95% CL mass limits for a vector-like T- and B-quark singlet are mT > 0.98 (0.99) TeV and mB > 1.00 (1.01) TeV respectively. Limits on the production of the vector-like T5/3-quark are also derived considering both pair and single production; in the former case the lower limit on the mass of the T5/3-quark is (expected to be) 1.19 (1.21) TeV. The Standard Model four-top-quark production cross-section upper limit is (expected to be) 69 (29) fb. Constraints are also set on exotic four-top-quark production models. Finally, limits are set on same-sign top-quark pair production. The upper limit on uu → tt production is (expected to be) 89 (59) fb for a mediator mass of 1 TeV, and a dark-matter interpretation is also derived, excluding a mediator of 3 TeV with a dark-sector coupling of 1.0 and a coupling to ordinary matter above 0.31.


Introduction
aforementioned B-, T-, and T 5/3 -quarks and the B −4/3 -quark that has charge −4/3.1 They may appear as singlets, doublets, or triplets under SU (2). In many models, the VLQ couple predominantly to thirdgeneration SM quarks in order to address the naturalness problem, mostly driven by the couplings between the top quark and the Higgs boson [2]. Therefore, in this paper it is assumed that couplings to first-and second-generation SM quarks are negligible. Several production and decay scenarios could lead to an enhanced rate of multilepton events [2,17,18]. The Band T-quarks could decay via both the charged and neutral current channels: B → Wt, Hb, Z b, and T → W b, Ht, Zt, with model-dependent branching ratios. The most likely scenarios resulting in same-charge lepton pair or trilepton production are where two or more of the vector bosons decay leptonically. Results are given for the SU(2) singlet models of Ref. [2], as well as in a model-independent framework where all branching ratios are considered. The only decay mode of the T 5/3 quark is into W + t → W + W + b. If both W bosons decay leptonically, then a same-charge lepton pair is produced from a single T 5/3 decay. Therefore, results for the T 5/3 are presented for both pair and single production. The single production results depend on the assumed strength of the T 5/3 tW coupling. Figure 1 shows typical Feynman diagrams leading to the signature considered in this paper.

Four-top-quark production
Four-top-quark production is expected to occur in the SM with a next-to-leading-order cross-section of 9.2 fb at √ s = 13 TeV [19] and leads to a same-charge lepton pair or a trilepton final state with a branching ratio of 12.1%, including leptonically decaying τ-leptons. In addition, the four-top-quark production rate could be enhanced in several BSM scenarios. Three benchmarks are considered in this paper. The first is based on an effective field theory (EFT) approach where the BSM contribution is represented via a contact interaction (CI) independently of the details of the underlying theory: where t R is the right handed top spinor, γ µ are the Dirac matrices, C 4t is a dimensionless constant and Λ is the new-physics energy scale. Only the contact interaction operator with right-handed top quarks is considered as left-handed top operators are already strongly constrained by electroweak precision data [20]. The four-top-quark production mechanism in this model is shown in Figure 2(a).
The second BSM four-top-quark production model is one with two universal extra dimensions (2UED) that are compactified in the real projective plane geometry (RPP), as described in Ref. [21]. The compactification of the two extra dimensions, characterised by the radii R 4 and R 5 , leads to the discretisation of the momenta along these directions with the allowed values labelled by the integers i and j. Each momentum state appears as a particle called a Kaluza-Klein (KK) excitation with a mass m, defined by (i, j) values and later referenced as a 'tier'. At leading order, the mass of a KK excitation of a particle with a mass m 0 is The additional mass differences within a given tier (i, j) are due to next-to-leading-order corrections and are small compared with the masses [21]. By using the notations m KK = 1/R 4 and ξ = R 4 /R 5 , Eq.
The four-top-quark signal of the model considered in this paper arises from pair-produced particles of tier (1, 1), which then chain-decay into the lightest particle of this tier, the KK excitation of the photon, A (1,1) , by emitting SM particles [22], as shown in Figure 2(b). This heavy photon A (1,1) decays into tt with a branching ratio assumed to be 100%. Therefore, additional quarks and leptons are expected to be produced in association with the four-top-quark system, which makes this signature quite different from the other considered benchmarks, as shown in Figure 2. In addition, cosmological observations constrain m KK between 600 GeV and 1000 GeV [22,23], leading to typical resonance masses between 0.6 TeV and 2 TeV depending on the ratio ξ of the two compactification radii. This analysis probes different scenarios varying both m KK and ξ, where the four-top-quark signal arises from particles of tier (1, 1) [22]. The third BSM four-top-quark production model is one with two Higgs doublets Φ 1 and Φ 2 (2HDM), which spontaneously break the electroweak symmetry SU(2) L × U(1) Y [24]. In this model, Φ 1 couples only to down-type quarks and leptons, and Φ 2 couples only to up-type quarks and neutrinos [25]. The parameter space is constrained to avoid large FCNC at tree level, resulting in four different sets of Yukawa couplings between the Higgs doublets and SM fermions. Among these, the Type-II 2HDM is considered.
Measurements of the properties of the SM Higgs boson constrain all 2HDM types to be in the so-called alignment limit [25], where the mass eigenstates are aligned with the gauge eigenstates in the new scalar sector. In this model, the tttt final state arises from the production of heavy neutral Higgs bosons H (scalar) and A (pseudo-scalar) in association with a tt pair, with the H or A boson decaying into tt as shown in Figure 2(c): gg → ttH/A → tttt.
In the alignment limit, the scalar and the pseudo-scalar Higgs boson have the same mass m H/A and both contribute to the four-top-quark production with similar kinematics. The cross-section predicted by this model depends on m H/A and the ratio tan β of vacuum expectation values of the two Higgs doublets. This benchmark is particularly interesting since the four-top-quark kinematics are rather soft compared with the CI signature, especially at low masses where the direct search for H/A → tt loses sensitivity due to interference effects with the SM tt production [26].

Same-sign top-quark pair production
Same-sign top-quark pair production (tt) is suppressed to a negligible level in the SM but allowed in BSM models. This signature is distinct from VLQ or tttt production, and is treated separately in the analysis.
In particular, only positively charged lepton pairs are considered for this signal (since tt production has a cross-section higher by a typical factor 100 thantt production at the LHC). The kinematic criteria also differ from those applied in the VLQ and four-top-quark searches. The considered benchmark is a generic dark-matter model relying on an effective theory invariant under SU(2) L × U(1) Y [27]. In this model, a top quark is produced in association with an FCNC mediator V which could then decay into dark-matter χ or SM particles tū/tu: where g SM and g DM represent the coupling strengths of the mediator to SM and dark-matter particles, respectively, and L kin [ χ, V µ ] represents the kinetic term of the mediator and the dark-matter fields. The tt final state could arise if V couples to the top quark, in both the tand s-channels, leading to the three processes shown in Figure 3, with a relative contribution which depends on the total width of the mediator.
The results are interpreted for each process in Figure 3 independently, allowing constraints to be placed on generic FCNC via the process uu → tt as well as specific resonances decaying into tū. The results are also interpreted for different values of the mass of the mediator m V , g SM , and g DM , taking into account width effects. This provides additional sensitivity to dark-matter mediators when its branching ratio into SM particles is sizeable,2 where a direct search based on final states with missing transverse energy and a top quark [28] might lose sensitivity. system covers the range |η| < 2.4 with resistive plate chambers in the barrel, and thin gap chambers in the endcap regions.
The ATLAS detector has a two-level trigger system to select events for offline analysis [31]. The first-level trigger is implemented in hardware and uses a subset of detector information to reduce the event rate to a design value of 100 kHz. This is followed by a software-based high-level trigger which reduces the event rate to about 1 kHz.

Data sample and trigger requirements
The data were recorded in LHC proton-proton (pp) collisions at √ s = 13 TeV in 2015 and 2016, corresponding to an integrated luminosity of 36.1 ± 0.8 fb −1 . The luminosity and its uncertainty are derived, following a methodology similar to that detailed in Ref.
[32], from a calibration of the luminosity scale using xy beam-separation scans. In this dataset the average number of simultaneous pp interactions per bunch crossing in addition to the triggered hard-scatter interaction, pile-up, was approximately 24. Data-quality requirements were applied to ensure that events were selected only from periods where all subdetectors were operating at nominal conditions, and where the LHC beams were in stable-collision mode. The events used in the analysis were required to have at least one primary vertex formed from at least two charged-particle tracks with transverse momentum p T > 0.4 GeV, and to have been triggered either by two leptons or a single high-p T lepton. Only triggers with loose lepton quality and isolation requirements were used, since tight requirements at the trigger level would complicate the estimation of the background originating from fake or non-prompt leptons. The dilepton triggers provide sensitivity at low lepton p T values, and the single-lepton triggers provide additional efficiency for high-p T leptons. The p T thresholds for the dilepton triggers varied from 8 to 24 GeV depending on the lepton flavours and the year in which the event was recorded. The single-muon trigger had a p T threshold of 50 GeV; the corresponding single-electron trigger had a p T threshold of 24 GeV for data recorded in 2015 and 60 GeV for data recorded in 2016. The trigger efficiency depends on the lepton flavour combination, but in all cases is > 95% for events of interest in this analysis.

Object selection criteria
This analysis makes use of reconstructed electrons, muons, jets, b-tagged jets, and missing transverse momentum. Their selection is described in this section and summarised in Table 1. Electrons are reconstructed from clusters of energy deposits in electromagnetic calorimeter cells with a matching inner detector track [33]. The candidate electrons are required to have p T > 28 GeV and be in the |η| < 2.47 region, excluding the transition region between the barrel and endcap calorimeters (1.37 < |η| < 1.52). For events with two electrons or one electron and one muon, electrons with |η| > 1. 37 are not considered since such events are subject to backgrounds from electron charge misidentification, which has a substantially higher probability of occurring for electrons at high |η|, as detailed in Section 7. Muons are reconstructed from tracks in the muon spectrometer and inner detector [34]. They must have p T > 28 GeV and |η| < 2.5.
Electrons and muons are required to be consistent with originating from the primary event vertex, using the quantities |d 0 /σ d 0 |, where d 0 is the impact parameter relative to the primary vertex in the x-y plane and σ d 0 is its uncertainty, and |z 0 sin θ|, where z 0 is the r-φ projection of the impact point onto the beamline.
All leptons are required to satisfy either relaxed or nominal identification criteria. The nominal sample, which is a subset of the relaxed sample, is used in the final analysis, and the relaxed sample is used to estimate one component of the reducible background as described in Section 7. For electrons, the relaxed (nominal) selection requires that the electron satisfies the likelihood medium (tight) requirements defined in Ref.
[33], while for muons both the relaxed and nominal selections require that the muon satisfies the medium criteria defined in Ref. [34]. Nominal leptons are required to be isolated from other activity in the detector: the scalar sum of the p T of tracks within a variable-size cone around the lepton (excluding its own track), must be less than 6% of the lepton p T . The track isolation cone size for electrons (muons) ∆R = (∆η) 2 + (∆φ) 2 is given by the smaller of ∆R = 10 GeV/p T and ∆R = 0.2 (0.3). In addition, in the case of electrons the sum of the transverse energy of the calorimeter energy clusters in a cone of ∆R = 0.2 around the electron (excluding the energy from the electron itself) must be less than 6% of the electron p T .
Jets are reconstructed from clusters of energy in the calorimeter using the anti-k t algorithm [35] with a radius parameter of 0.4. Jets are considered if p T > 25 GeV and |η| < 2.5. Quality criteria are applied to jets to ensure that they are not reconstructed from detector noise, beam losses, or cosmic rays [36]. If any jet fails to satisfy these criteria, the event is vetoed. To reject jets from pile-up, an observable called the jet vertex tagger (JVT) is formed by combining variables that discriminate pile-up jets from hard-scattering jets [37]. Jets with p T < 60 GeV and |η| < 2.4 that have associated tracks are subject to a requirement on JVT that is 92% efficient for hard-scattering jets while rejecting 98% of pile-up jets. If such jets have no associated tracks they are removed. Jets containing a b-hadron are identified using a multivariate technique [38]. An operating point is defined by a threshold in the range of discriminant output values, and is chosen to provide specific b-, c-, and light-jet efficiencies in simulated tt events. The operating point used in this analysis has a 77% b-jet efficiency with rejection factors of 6 and 134 for cand light-jets, respectively.
The missing transverse momentum is calculated as the negative vectorial sum of the transverse momenta of reconstructed calibrated objects in the event. Its magnitude is denoted E miss T , and is computed using electrons, photons, hadronically decaying τ-leptons, jets and muons as well as a soft term calculated with tracks matched to the primary vertex which are not associated with any of these objects [39].
A set of requirements are applied to resolve overlaps between reconstructed objects. This procedure is applied to the leptons satisfying the relaxed selection criteria. In the first step, electrons which share a track with a muon are removed, to avoid cases where muon radiation would mimic an electron. Next, the jet closest to an electron within ∆R y ≡ (∆y) 2 + (∆φ) 2 = 0.2 is removed to avoid double counting. Then, to reduce the contributions from non-prompt electrons originating from heavy-flavour decays, electrons within ∆R y = 0.4 of any remaining jets are removed. Finally, the overlap between muons and jets is considered: jets with less than three tracks and within ∆R y = 0.4 of a muon are removed. Muons are then removed if they are within ∆R y = 0.04 + 10 GeV/p T,µ of remaining jets.
The events are preselected if they contain at least one jet, and at least two leptons that satisfy the nominal selection criteria. If exactly two of the three highest-p T leptons satisfy the nominal criteria, they must have the same electric charge, and if these two leptons are electrons, a quarkonia/Z-veto is applied to their invariant mass: m ee > 15 GeV and |m ee − 91 GeV| > 10 GeV. Events that satisfy these criteria are called 'same-charge lepton' events. If the three highest-p T leptons satisfy the nominal criteria, no requirement is imposed on their charge or on the invariant mass of any pair. These events are called 'trilepton' events. Same-charge lepton and trilepton events are treated separately in the analysis. Table 1: Summary of object identification and definition. 'ID quality' refers to the identification criteria used for each object type. For electrons, 'mediumLH' and 'tightLH' refer to the likelihood medium and tight requirements defined in Ref.
[33]; for muons the criteria for 'medium' ID quality are defined in Ref. [34]. For jets, 'cleaning' means applying a procedure to reduce contamination from spurious jets [36], and 'JVT' means applying criteria to select jets that are consistent with being produced at the primary vertex rather than from pile-up [37]. For b-jets, 'MVA77' refers to placing a requirement on the multivariate discriminant defined in Ref.
[38] that is 77% efficient for b-jets in simulated tt events.

Simulation
Monte Carlo (MC) simulation was used to model the signals and the irreducible backgrounds. Evt-Gen v1. The main sources of irreducible backgrounds are ttV production (where V represents either a W or Z boson), ttH production, and diboson production. Smaller contributions from triboson, V H, ttt, ttWW, t ZW, and t Z production are shown in the tables and figures as 'Other bkg'. The SM four-topquark production is included as a background for all BSM searches, but is considered as the signal in the search for SM four-top-quark production. The matrix elements for ttV, ttH, ttt, tttt, ttWW, and t ZW production processes were modelled with MG5_ MC@NLO v2.2.2 and P v8.186 for hadronisation and showering, using the NNPDF3.0NLO PDF set. Next-to-leading-order (NLO) matrixelement calculation was used for ttV, ttH and t ZW while leading-order (LO) calculation was used for ttt, tttt and ttWW.  [52]. The V H production process was modelled using P v8.186, with the NNPDF2.3LO PDF set. The crosssections for all processes are calculated at NLO in QCD, except for t Z where the leading-order calculation is used.

Simulated P
v8.186 minimum-bias events were overlaid on each simulated event to model the effects of pile-up; the generated events were then reweighted so that the distribution of the number of interactions per bunch crossing matched the distribution observed in the data. The response of the ATLAS detector for most samples was modelled using G 4 [53] within the ATLAS simulation infrastructure [54]. The ttt, single T 5/3 , and same-sign top-quark pair production samples were processed with a fast simulation that relies on a parameterisation of the calorimeter response [55]. Events were reconstructed using the same algorithms as used for the collider data. Corrections were applied to the simulated events to account for differences observed in trigger efficiencies, object identification efficiencies and resolutions when comparing the simulation with data.

Estimation of reducible backgrounds
In addition to the irreducible backgrounds described above, there are reducible backgrounds where a jet or lepton from heavy-flavour hadron decay mimics a prompt lepton5 (called 'fake/non-prompt lepton background' in the following), or the charge of a lepton is misidentified. These backgrounds are estimated using data-driven techniques.
The fake/non-prompt lepton background yield is estimated with the matrix method [56,57], which uses the relaxed and nominal lepton categories defined in Table 1. The fraction of prompt leptons satisfying the relaxed criteria that also satisfy the nominal criteria is referred to as r. Similarly, the fraction of fake/nonprompt leptons satisfying the relaxed requirements that also satisfy the nominal requirements is referred to as f . Using the measured values of r and f , the number of events with at least one non-prompt/fake lepton in the nominal sample can be inferred from the numbers of relaxed and nominal leptons in the relaxed sample, and this number is taken as the fake/non-prompt yield. A Poisson likelihood approach is used to estimate the final fake/non-prompt yield and its statistical uncertainty. This approach guarantees that the estimated yield is not negative, and provides a more reliable estimate of the statistical uncertainty in regions with a small number of selected events.
Single-lepton control regions enriched in prompt and fake/non-prompt leptons are used to measure r and f . The criteria used to select the single-lepton events are different for electrons and muons due to the 5 Prompt leptons are leptons which do not originate from hadron decays or conversion processes. different sources of fake/non-prompt leptons for each flavour. For electrons, r is measured using events with E miss T > 150 GeV, where the dominant contribution is from W → eν, and f is measured using events with the transverse mass of the E miss T -lepton system6 m T (W) < 20 GeV and E miss T + m T (W) < 60 GeV, where the dominant contribution is from multijet production (including heavy-flavour production) where one or more jets are misidentified as electrons. For muons, r is measured using events with m T (W) > 100 GeV, a sample dominated by W → µν, and f is measured using events where the transverse impact parameter of the muon relative to the primary vertex is more than five standard deviations away from zero, consistent with muons originating from heavy-flavour hadron decays. The small contribution of prompt leptons in the control samples used to measure f is estimated from simulation and this contribution is subtracted from the sample. The values of r and f are parameterised in terms of variables of the leptons (|η|, p T , and the angular distance to the nearest jet) and the number of b-tagged jets. For muons, r ranges from 55% to 97% while f ranges from 7% to 30%. For electrons, r ranges from 70% to 95% while f ranges from 8% to 30%. In general, the values of r and f are smaller for leptons near a jet, and larger for high-p T leptons.
The second reducible background, corresponding to events where the charge of a lepton is misidentified, is considered only for electrons since the probability of muon charge misidentification is negligibly small. There are two primary mechanisms by which the electron charge can be misidentified: the first is the 'trident' process in which an electron emits an energetic bremsstrahlung photon, which subsequently produces an e + e − pair. This can result in a track of the incorrect charge being associated with the electron. The second is the mismeasurement of the curvature of the electron track. The probability for an electron to have its charge incorrectly reconstructed is measured using a sample of dielectron events with invariant mass consistent with the Z boson. The trident process can result in misidentified charge for an electron that is also likely to be considered fake/non-prompt due to the presence of nearby charged tracks. To avoid double-counting the background contribution from such electrons, the matrix method is used to subtract the fake/non-prompt electron yield from the Z sample. The charge misidentification probability is calculated in bins of electron |η| and p T , using a likelihood fit that adjusts these binned probabilities to find the best agreement with the observed numbers of same-charge and opposite-charge electron pairs. The charge misidentification probability varies from 2 × 10 −5 (for electrons at low p T and small |η|) to 10 −2 for electrons at high p T and |η| near the edge of the barrel calorimeter; for electrons with larger values of |η| the probability can reach 10%.
Since charge misidentification is negligible for muons and not relevant for trilepton events (for which no lepton charge requirements are imposed), the background from charge misidentification (called charge mis-ID hereafter) only appears in ee or eµ events. To estimate its yield, ee and eµ events are selected using all the criteria applied in the analysis, with the exception that the leptons are required to have opposite charge. Then the charge misidentification probabilities are applied to this sample to determine the background yield.

Signal and validation regions
Several signal regions (SR) are defined to represent the broad range of BSM signals considered. The selection criteria are designed to maximise the sensitivity to the signals. The signal regions are separated into two categories: one category is designed for maximal sensitivity to VLQ and four-top-quark production, while the second category is optimised for the same-sign top-quark pair production searches. For the VLQ and four-top-quark searches, the preselected sample is first split according to the numbers of leptons (two or three) and b-tagged jets (one, two, or greater than two). Within each of the resulting subsamples, requirements are placed on H T and E miss T , where H T is the scalar sum of the p T of all selected jets and leptons, to maximise the average sensitivity for the signal models considered. In addition, to fully exploit specific features of VLQ and four-top-quark signatures, the signal regions with at least three b-tagged jets are further split. Relaxed H T and high jet multiplicity requirements are sensitive to the four-top-quark signature, while high H T and low jet multiplicity requirements enhance sensitivity to the VLQ signature. For all the signal regions described above, lepton flavours are considered inclusively to increase the number of data events in the loosely selected samples used to estimate the reducible backgrounds. The values of r and f appropriate to each lepton's flavour are used to estimate the fake/non-prompt lepton background. The selection criteria are summarised in Table 2, and the selection efficiencies for some signal models are shown in Table 3.
The same-sign top-quark selection requires exactly two leptons with positive charge, reflecting the preponderance of tt overtt production in pp collisions by a typical factor of 100. Additional criteria are imposed to maximise the sensitivity of the search: at least one b-tagged jet, H T greater than 750 GeV, E miss T greater than 40 GeV, and the azimuthal separation between the two leptons |∆φ | greater than 2.5. Since the optimal kinematic selection is looser than for VLQ and four-top-quark signal regions, more statistics are available for estimating the reducible backgrounds, so the lepton flavours (ee, eµ, and µµ) are treated separately in the search for same-sign top-quark pair production. These selection criteria are summarised in Table 4 and the selection efficiencies for the three same-sign top-quark pair signal processes are shown in Table 5.
In addition to the signal regions, a set of validation regions (VR) with criteria similar to those used for the SR, in which the expected signal yield is small, are defined. The VR are used to verify that the background is correctly modelled in regions that are kinematically similar to the signal regions. The definitions of the validation regions are presented in Tables 2 and 4, and the corresponding expected and observed yields are shown in Tables 6 and 7. These tables also report the probability for the expected background to fluctuate to equal or exceed the observed yield in each validation region; the smallest such probability is 0.10, which occurs in VR2b2 . The distributions of E miss T and H T in each validation region are shown in Figures 4-6. The χ 2 probabilities for compatibility of the observed and expected distributions are reasonable when all systematic uncertainties, including their bin-to-bin correlations, are considered. The smallest probability is 2%, which occurs for the E miss T distribution in VR1b2 . The systematic uncertainties are described in Section 9. Data Other bkg Total bkg unc.     Figure 5: Distributions of H T in each of the validation regions used for the four-top-quark and VLQ searches. The first (second) column shows distributions of dilepton (trilepton) events while each row corresponds to a given b-tagged jet multiplicity. The uncertainty, shown as the hashed region, includes both the statistical and systematic uncertainties from each background source. Table 2: Definitions of the validation and corresponding signal regions for the four-top-quark and VLQ searches, where N j is the number of jets, N b is the number of b-tagged jets, and N is the number of leptons. The name of each signal (validation) region begins with "SR" ("VR"), with the rest of the name indicating the number of leptons and number of b-tagged jets required. The suffix "_L" denotes the signal regions with relaxed H T but stricter N j requirements. For regions that require two leptons, the leptons must have the same charge. Events that appear in any of the signal regions are vetoed in the validation regions.  Table 4: Definitions of the validation and signal regions for the same-sign top-quark pair production search, where N b is the number of b-tagged jets, N is the number of leptons, and |∆φ | is the azimuthal angle between the leptons. The name of each signal (validation) region begins with "SR" ("VR"). The validation region is inclusive in lepton flavour.

ATLAS
e + e + SRtteµ e + µ + SRttµµ µ + µ +    Table 7: Expected background and observed event yields in the validation region for the same-sign top-quark pair production search. The 'Other bkg' category includes contributions from all rare SM processes listed in Section 6. The first uncertainty is statistical and the second is systematic. The p-value is the probability for the expected background to fluctuate to equal or exceed the observed yield.
Source VRtt Other bkg 0.7 ± 0.1 ± 0.6 Charge mis-ID 4.0 ± 0.2 ± 1.4 Fake/non-prompt 4.7 randomly into four subsamples, computing the efficiencies in each of them, and observing the variation in the fake/non-prompt lepton yield. This variation is then divided by two since each of the subsamples has only one fourth of the statistics of the full sample. This procedure accounts for any correlations in the efficiencies. The third uncertainty is estimated by varying the normalisation of the MC subtraction in the fake control sample by 10%. The resulting uncertainty depends on the region the fake/non-prompt lepton background is estimated in, since the fake sample can vary kinematically, but is generally around 40−50% of the expected fake/non-prompt lepton yield for the dominant uncertainty in the signal regions. The first is the dominant uncertainty, particularly from variations in the fake-lepton efficiency when the selection criteria for the control samples are changed.
The uncertainties of the charge mis-ID background arise from uncertainties of the measured rates for electron charge misidentification and uncertainties of the fake/non-prompt lepton background. The uncertainties of the charge misidentification rates include the following contributions: the statistical uncertainty of the likelihood fit to determine the rates (≈ 15%), the changes in rates observed when the mass windows used to define the Z → ee and sideband regions are varied from 0 GeV to 20 GeV (≈ 6%), and the differences observed between the results of the likelihood fit and the true rates when the method is applied to simulation (≈ 5%). These uncertainties sum in quadrature to about 20% of the expected charge mis-ID yield in the signal regions. The systematic uncertainty of the fake/non-prompt component is estimated as described above, and impacts the charge mis-ID background through a variation in the fake/non-prompt background that is subtracted when calculating the charge misidentification rates (≈ 10%). This component of the uncertainty is anti-correlated between the fake/non-prompt and charge mis-ID backgrounds.
Since the optimised selection criteria result in small expected background yields in the signal regions, the dominant uncertainty in the analysis is statistical. Among the systematic uncertainties, the leading contributors are from uncertainties of the fake/non-prompt lepton background estimate, the modelling of the irreducible backgrounds (in terms of both their production cross-sections and acceptance) and uncertainties of the efficiency for identifying b-jets. Summaries of the leading sources of systematic uncertainty in each signal region are provided in Tables 8 and 10 for the total background yield, and in Tables 9 and 11 for representative signal models (a T vector-like quark with m T = 1 TeV, and exclusive tt production with m V = 2 TeV, respectively).  Lepton ID  2  1  1  1  3  3  2  3  efficiency  Pile-up  5  2  3  3  3  5  1  6  reweighting  Luminosity  1  1  2  2  2  2  2  2  Fake/non-prompt  20  12  13  8  7  2  3  1  Charge mis-ID  2  3  1  2  ----Cross-section  25  13  22  32  32  26  21 24 × acceptance Table 9: Uncertainty of the event yields in the signal regions for a representative signal (vector-like T quark, m T = 1 TeV) due to the leading sources of experimental systematic uncertainty. The expected yield for this signal in each region is also given.

Results
To test for the presence of a BSM signal, the observed numbers of events in a set of signal regions are compared with the expected background yields in those regions. The searches for VLQ and four-top-quark production use the combination of the signal regions defined in Table 2, while the searches for tt production use the combination of the signal regions defined in Table 4. In the case where the SM four-top-quark production is probed, this process is removed from the background contribution. In all other cases, the quoted significances refer to BSM benchmarks.
A Poisson likelihood ratio test is used to assess the probability that the observed yields are compatible with the sum of the expected background and signal, with the nominal signal cross-section scaled by a value µ. Systematic uncertainties are introduced as nuisance parameters that have Gaussian or log-normal constraints corresponding to their uncertainty values. For any given choice of µ the likelihood ratio q µ is compared with the distribution of values that would be expected under the background-only and signal plus background hypotheses. The probabilities p b (µ) of the background fluctuating to be more signal-like than the data, and p s+b (µ) of the signal plus background fluctuating to be more background-like than the data are both by comparing q µ with these distributions. The values of p b (µ) and p s+b (µ) are derived using the asymptotic approximation described in Ref.
[59]. The quantity R CL s [60] is then defined as .
If the data are statistically consistent with the background expectation, R CL s (µ) will tend to decrease as µ increases. All values of µ for which R CL s (µ) is less than 0.05 are considered as being excluded at 95% confidence level (CL). If, for a particular signal model, R CL s (µ = 1) is less than 0.05, that model is excluded.
The observed yields in each of the signal regions, along with the expected yields from background sources and some representative BSM physics models are shown in Tables 12 and 13 and in Figure 7. There are no statistically significant differences between the event yields and the expected background, although in two of the signal regions, SR3b2 _L and SR3b3 _L, the event yield exceeds the background by 1.7 and 1.8 standard deviations, respectively. The resulting combined significance depends on the signal being considered, reaching 3.0 standard deviations for SM four-top-quark production (where this contribution is not included among the backgrounds), while a significance of 0.9 standard deviations is expected. More than half of the excess is observed in events with two muons, three b-tagged jets and H T around 700 GeV. The largest significance for any of the BSM models considered is 2.3 standard deviations, which is obtained for the 2HDM model. Therefore no evidence of BSM signals is found, and limits are set as detailed below.
Several studies were done to validate the background estimate. One potential issue was noted when applying the matrix method for muons to the same sample of events used to calculate the fake/non-prompt muon efficiency, where the predicted yield was observed to deviate from data at the level of 1.2 standard deviations near ∆R(µ, jet) = 1.0. Applying a two-dimensional parameterisation of the efficiencies in p T,µ × ∆R(µ, jet) substantially improves the level of agreement, and the background in the signal regions was recomputed with this parameterisation. In addition, the prompt-and fake-lepton efficiencies used in the estimation of the fake/non-prompt lepton background were recomputed using different requirements for the number of b-tagged jets in the control regions (this test is especially important for electrons, where the fraction of candidates arising from photon conversion versus heavy-flavour decay varies strongly with the presence or absence of a b-tagged jet), and also using a completely different set of control regions (dilepton events where a tag-and-probe procedure was applied). The fake/non-prompt lepton background was also estimated using the fake-factor method [61] instead of the matrix method. The level of compatibility between the expected background and observed data yields was similar in all of these variations.
Further, the events in the signal regions were scrutinised to determine if some of them might have arisen from detector defects or other anomalies. The distribution of objects in η, φ, and p T was found to be consistent with expectations, as was the temporal distribution of the events across the data-taking period. The reconstructed muon candidates in these events were inspected, and their features (such as the χ 2 of their fitted tracks, and compatibility of the momenta measured in the inner detector and the muon spectrometer) were found to be unremarkable. The three-lepton samples were split between those with and without a lepton pair that formed a Z-boson candidate. In the subsample with a Z-boson candidate, four events are observed with an expected background of 2.4 ± 0.6, while in the subsample without a Z-boson candidate, five events are observed with an expected background of 1.7 ± 0.6. The composition of b-tagged jets (the fractions of such jets that arise from b-, c-, or light-quarks or gluons) was studied in simulated background events. It was found that the dominant source of b-tagged jets in both the signal and validation regions was in fact b-jets, which accounted for 76 -95% of the b-tagged jets in each region. In addition, the kinematic properties of the events were compared with the expectations from the BSM four-top-quark production benchmark models, and found to agree poorly with all of them, particularly in the b-tagged jet multiplicity.   : Expected background and observed event yields in the signal regions for the same-sign top-quark pair production search. The 'Other bkg' category includes contributions from all rare SM processes listed in Section 6. The first uncertainty is statistical and the second is systematic. The significance is the number of standard deviations by which the tt signal plus background hypothesis is preferred to the background-only hypothesis. It is calculated using the same procedure used to calculate the reported limits.
Source SRttee SRtteµ SRttµµ ttW 0.91 ± 0.09 ± 0.19 2.64 ± 0.15 ± 0.48 1.86 ± 0.13 ± 0.37 tt Z 0.35 ± 0.07 ± 0.09 0.91 ± 0.09 ± 0.12 0.47 ± 0.08 ± 0.09 Dibosons 0.40 ± 0.45 ± 0.09 1.4 ± 0.6 ± 0.9 0.5 ± 0.5 ± 0.5 ttH 0.19 ± 0.06 ± 0.02 0.53 ± 0.08 ± 0.08 0.58 ± 0.07 ± 0.05 tttt 0.12 ± 0.02 ± 0.06 0.30 ± 0.02 ± 0.15 0.22 ± 0.03 ± 0.11 Other bkg 0.29 ± 0.06 ± 0.13 0.51 ± 0.08 ± 0.16 0.33 ± 0.08 ± 0.12 Fake/non-prompt 3.4 Limits on Band T-quark pair production are set in two scenarios. In the first, it is assumed that the branching ratios are given by the singlet model of Ref. [2]. These branching ratios vary slightly with the VLQ mass; they are approximately (0.48, 0.27, 0.25) for B → (Wt, Z b, Hb) and (0.49, 0.22, 0.27) for T → (W b, Zt, Ht). The resulting 95% CL upper limits on the production cross-section as a function of the VLQ mass are shown in Figure 8. Lower limits on the Band T-quark masses are extracted from these cross-section limits, resulting in observed (expected) excluded mass m B < 1.00 TeV (1.01 TeV) and m T < 0.98 TeV (0.99 TeV). The expected and observed limits agree well in spite of the observed excesses in some signal regions because the expected yield of VLQ in those regions is small. In the second scenario, no assumptions are made about the branching ratio, and lower limits on the masses are determined for any possible set of branching ratios, as shown in Figure 9. Since a single T 5/3 quark could decay into a same-charge lepton pair, limits on both single and pair production of T 5/3 quarks are set. If only pair production is considered, then the cross-section limit as a function of mass is unambiguous since the only allowed decay channel is T 5/3 → Wt, as shown in Figure 10(a). The corresponding lower observed (expected) limit on the T 5/3 -quark mass is 1.19 TeV (1.21 TeV). If single production is considered in addition to pair production, the limit depends on the assumed strength of the T 5/3 tW coupling, as shown in Figure 10 Limits on four-top-quark production are set in a variety of models. The observed (expected) limit on the cross-section assuming SM kinematics is 69 fb (29 fb), and the observed (expected) limit assuming kinematics from the EFT model is 39 fb (21 fb). These results are summarised in Table 14, along with the limits on the ratio of the contact interaction strength to the cut-off scale in the EFT model. The latter limit can also be expressed in the plane of the interaction strength |C 4t | versus cut-off scale Λ, as shown in Figure 11(a). Limits on the 2UED/RPP model are shown in Figures 11(b) and 11(c). The 2UED/RPP limits corresponds to an observed lower limit on the parameter m KK of 1.45 TeV, where a limit of 1.48 TeV is expected in the background-only model. Limits on four-top-quark production in the 2HDM interpretation are shown in Figure 12. Two scenarios are considered: one where only the heavy scalar Higgs boson contributes to the process pp → ttH and H → tt, and one where the heavy scalar   Limits on same-sign top-quark pair production are set using the signal regions dedicated to this signature (SRttee, SRtteµ, and SRttµµ). These are interpreted in the context of a dark-matter model with three parameters: the mass of the exotic FCNC mediator particle m V , and the couplings g DM and g SM of the mediator to dark-matter and SM particles, respectively. In the context of this model, three different same-sign top-quark pair production processes are considered: i) production via a t-channel mediator, ii) production via an on-shell s-channel mediator, and iii) production via an off-shell s-channel mediator. Limits on the production cross-section for each mechanism as a function of m V are shown in Figure 13. They are independent of the model parameter values and constrain generic processes such as uu → tt to a cross-section of less than 89 fb, where a limit of 59 fb is expected in the background-only model. More generally, the total signal includes the three contributions with a relative importance which depends on the mediator's total width, and thus on the model parameters. Limits in the plane of g SM versus m V for three values of g DM are shown in Figure 14 where the width effects are taken into account. As a reference, the observed upper limit on g SM is 0.31 when m V is taken to be 3 TeV and g DM is taken to be one, where a limit of 0.28 is expected in the background-only hypothesis. Assuming that m V = 1 TeV and g DM = 1, the observed (expected) upper limit on g SM is 0.14 (0.13). In all plots, the expected 95% CL limits are shown with their ±1 and ±2 standard deviation bands. In the context of the two-Higgs-doublet model, the Higgs boson width can be large for low tan β. In spite of that, it was checked that the signal efficiency has a negligible dependence on tan β in the region of interest.  Figure 13: Expected and observed 95% CL upper limits on the cross-section for (a) prompt tt production, (b) on-shell mediator, (c) off-shell mediator subprocesses of the same-sign top-quark pair production. In all plots, the expected 95% CL limits are shown with their ±1 and ±2 standard deviation bands. Each subprocess is considered as a generic BSM signature and therefore no theory prediction is shown.

Conclusion
A search for processes beyond the Standard Model is performed using 36.1 fb −1 of proton-proton collisions at √ s = 13 TeV recorded by the ATLAS detector at the LHC, based on events with at least two leptons, including a pair of the same electric charge, at least one b-tagged jet, sizeable missing transverse momentum, and large H T . Several BSM processes are considered that could enhance the yield of such events over the small expected background. The search is performed in the context of BSM models, with signal regions defined for different models. No significant excess over the expected background is observed. The regions of parameter space excluded by the data are quantified by setting limits at 95% confidence level. The masses of vector-like Tand B-quarks are (expected to be) constrained to m T > 0.98 TeV (0.99 TeV), m B > 1.00 TeV (1.01 TeV) assuming branching ratios of the W, Z, and H decay modes as predicted by a singlet model, and the mass of the vector-like T 5/3 quark is (expected to be) constrained to m T 5/3 > 1.19 TeV (1.21 TeV) based only on pair production and assuming a branching ratio B(T 5/3 → Wt) = 100%. With single T 5/3 production included, the observed (expected) lower mass limit is 1.6 TeV (1.7 TeV) for a T 5/3 tW coupling of 1.0. The four-top-quark production cross-section is (expected to be) less than 69 fb (29 fb) assuming SM kinematics and less than 39 fb (21 fb) assuming kinematics from EFT model. The lower limit on the Kaluza-Klein mass in the context of models with two universal extra dimensions, is (expected to be) 1.45 TeV (1.48 TeV). Finally, limits are set on a dark-matter model based on a flavour changing neutral current producing a pair of top quarks with the same electric charge. The uu → tt cross-section is (expected to be) lower than 89 fb (59 fb) for a FCNC mediator mass of 1 TeV. Considering a full dark-matter model with a dark-sector coupling g DM = 1, the observed (expected) excluded values for the coupling to SM particles are g SM > 0.31 (0.28) for a mediator mass of m V = 3 TeV, and g SM > 0.14 (0.13) for m V = 1 TeV.