1 Introduction

Final states with two leptons of same charge, denoted as same-sign (SS) dileptons, are produced rarely by standard model (SM) processes in proton–proton (\(\mathrm{p}\mathrm{p}\)) collisions. Because the SM rates of SS dileptons are low, studies of these final states provide excellent opportunities to search for manifestations of physics beyond the standard model (BSM). Over the last decades, a large number of new physics mechanisms have been proposed to extend the SM and address its shortcomings. Many of these can give rise to potentially large contributions to the SS dilepton signature, e.g., the production of supersymmetric (SUSY) particles [1, 2], SS top quarks [3, 4], scalar gluons (sgluons) [5, 6], heavy scalar bosons of extended Higgs sectors [7, 8], Majorana neutrinos [9], and vector-like quarks [10].

In the SUSY framework [11,12,13,14,15,16,17,18,19,20], the SS final state can appear in R-parity conserving models through gluino or squark pair production when the decay of each of the pair-produced particles yields one or more \(\mathrm{W}\) bosons. For example, a pair of gluinos (which are Majorana particles) can give rise to SS charginos and up to four top quarks, yielding signatures with up to four \(\mathrm{W}\) bosons, as well as jets, \(\mathrm{b}\) quark jets, and large missing transverse momentum (\(E_{\mathrm{T}}^{\text {miss}}\)). Similar signatures can also result from the pair production of bottom squarks, subsequently decaying to charginos and top quarks.

While R-parity conserving SUSY models often lead to signatures with large \(E_{\mathrm{T}}^{\text {miss}}\), it is also interesting to study final states without significant \(E_{\mathrm{T}}^{\text {miss}}\) beyond what is produced by the neutrinos from leptonic \(\mathrm{W}\) boson decays. For example, some SM and BSM scenarios can lead to the production of SS or multiple top quark pairs, such as the associated production of a heavy (pseudo)scalar, which subsequently decays to a pair of top quarks. This scenario is realized in Type II two Higgs doublet models (2HDM) where associated production with a single top quark or a \(\mathrm{t}\overline{\mathrm{t}}\) pair can in some cases provide a promising window to probe these heavy (pseudo)scalar bosons [21,22,23].

This paper extends the search for new physics presented in Ref. [24]. We consider final states with two leptons (electrons and muons) of same charge, two or more hadronic jets, and moderate \(E_{\mathrm{T}}^{\text {miss}}\). Compared to searches with zero or one lepton, this final state provides enhanced sensitivity to low-momentum leptons and SUSY models with compressed mass spectra. The results are based on an integrated luminosity corresponding to 35.9\(\,\text {fb}^{-\text {1}}\) of \(\sqrt{s} = 13\,\text {TeV} \) proton–proton collisions collected with the CMS detector at the CERN LHC. Previous LHC searches in the SS dilepton channel have been performed by the ATLAS [25,26,27] and CMS [24, 28,29,30,31,32] Collaborations. With respect to Ref. [24], the event categorization is extended to take advantage of the increased integrated luminosity, the estimate of rare SM backgrounds is improved, and the (pseudo)scalar boson interpretation is added.

The results of the search are interpreted in a number of specific BSM models discussed in Sect. 2. In addition, model-independent results are also provided in several kinematic regions to allow for further interpretations. These results are given as a function of hadronic activity and of \(E_{\mathrm{T}}^{\text {miss}}\), as well as in a set of inclusive regions with different topologies. The full analysis results are also summarized in a smaller set of exclusive regions to be used in combination with the background correlation matrix to facilitate their reinterpretation.

2 Background and signal simulation

Monte Carlo (MC) simulations are used to estimate SM background contributions and to estimate the acceptance of the event selection for BSM models. The MadGraph 5_amc@nlo 2.2.2 [33,34,35] and powheg  v2 [36, 37] next-to-leading order (NLO) generators are used to simulate almost all SM background processes based on the NNPDF3.0 NLO [38] parton distribution functions (PDFs). New physics signal samples, as well as the same-sign \(\mathrm{W}^{\pm } \mathrm{W}^{\pm }\) process, are generated with MadGraph 5_amc@nlo at leading order (LO) precision, with up to two additional partons in the matrix element calculations, using the NNPDF3.0 LO [38] PDFs. Parton showering and hadronization, as well as the double-parton scattering production of \(\mathrm{W}^{\pm } \mathrm{W}^{\pm }\), are described using the pythia  8.205 generator [39] with the CUETP8M1 tune [40, 41]. The Geant4 package [42] is used to model the CMS detector response for background samples, while the CMS fast simulation package [43] is used for signal samples.

To improve on the MadGraph modeling of the multiplicity of additional jets from initial-state radiation (ISR), MadGraph \(\mathrm{t}\overline{\mathrm{t}}\) MC events are reweighted based on the number of ISR jets (\(N_J^\mathrm{ISR}\)), so as to make the light-flavor jet multiplicity in dilepton \(\mathrm{t}\overline{\mathrm{t}}\) events agree with the one observed in data. The same reweighting procedure is applied to SUSY MC events. The reweighting factors vary between 0.92 and 0.51 for \(N_J^\mathrm{ISR}\) between 1 and 6. We take one half of the deviation from unity as the systematic uncertainty in these reweighting factors.

The new physics signal models probed by this search are shown in Figs. 1 and 2. In each of the simplified SUSY models [44, 45] of Fig. 1, only two or three new particles have masses sufficiently low to be produced on-shell, and the branching fraction for the decays shown are assumed to be 100%. Gluino pair production models giving rise to signatures with up to four \(\mathrm{b}\) quarks and up to four \(\mathrm{W}\) bosons are shown in Fig. 1a–e. In these models, the gluino decays to the lightest squark (\(\widetilde{\mathrm{g}}\rightarrow \widetilde{\mathrm{q}} \mathrm{q} \)), which in turn decays to same-flavor (\(\widetilde{\mathrm{q}} \rightarrow \mathrm{q} \widetilde{\chi }^{0}_{1} \)) or different-flavor (\(\widetilde{\mathrm{q}} \rightarrow \mathrm{q} ' \widetilde{\chi }^\pm _{1} \)) quarks. The chargino decays to a \(\mathrm{W}\) boson and a neutralino (\(\widetilde{\chi }^\pm _{1} \rightarrow \mathrm{W}^{\pm } \widetilde{\chi }^{0}_{1} \)), where the \(\widetilde{\chi }^{0}_{1}\) escapes detection and is taken to be the lightest SUSY particle (LSP). The first two scenarios considered in Fig. 1a, b include an off-shell third-generation squark (\(\widetilde{\mathrm{t}} \) or \(\widetilde{\mathrm{b}} \)) leading to the three-body decay of the gluino, \(\widetilde{\mathrm{g}}\rightarrow \mathrm{t}\overline{\mathrm{t}} \widetilde{\chi }^{0}_{1} \) (\(\mathrm{T}1{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\)) and \(\widetilde{\mathrm{g}}\rightarrow \overline{\mathrm{t}}\mathrm{b} \widetilde{\chi }^{+}_{1} \) (\(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{b} \mathrm{b}}\mathrm{W}\mathrm{W}\)), resulting in events with four \(\mathrm{W}\) bosons and four \(\mathrm{b}\) quarks. In the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{b} \mathrm{b}}\mathrm{W}\mathrm{W}\) model, the mass splitting between chargino and neutralino is set to \(m_{\widetilde{\chi }^\pm _{1}} - m_{\widetilde{\chi }^{0}_{1}} = 5\,\text {GeV} \), so that two of the \(\mathrm{W}\) bosons are produced off-shell and can give rise to low transverse momentum (\(p_{\mathrm{T}}\)) leptons. The next two models shown (Fig. 1c, d) include an on-shell top squark with different mass splitting between the \(\widetilde{\mathrm{t}} \) and the \(\widetilde{\chi }^{0}_{1}\), and consequently different decay modes: in the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\) model the mass splitting is equal to the top quark mass (\(m_{\widetilde{\mathrm{t}}} - m_{\widetilde{\chi }^{0}_{1}} = m_{\mathrm{t}}\)), favoring the \(\widetilde{\mathrm{t}} \rightarrow \mathrm{t} \widetilde{\chi }^{0}_{1} \) decay, while in the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{c} \mathrm{c}}\) model the mass splitting is only \(20\,\text {GeV} \), favoring the flavor changing neutral current \(\widetilde{\mathrm{t}} \rightarrow \mathrm{c} \widetilde{\chi }^{0}_{1} \) decay. In Fig. 1e, the decay proceeds through a virtual light-flavor squark, leading to a three-body decay to \(\widetilde{\mathrm{g}}\rightarrow \mathrm{q} \mathrm{q} ' \widetilde{\chi }^\pm _{1} \), resulting in a signature with two \(\mathrm{W}\) bosons and four light-flavor jets. The two \(\mathrm{W}\) bosons can have the same charge, giving rise to SS dileptons. This model, \(\mathrm{T}5{\mathrm{q} \mathrm{q} \mathrm{q} \mathrm{q}}\mathrm{W}\mathrm{W}\), is studied as a function of the gluino and \(\widetilde{\chi }^{0}_{1}\) mass, with two different assumptions for the chargino mass: \(m_{\widetilde{\chi }^\pm _{1}} = 0.5(m_{\widetilde{\mathrm{g}}} + m_{\widetilde{\chi }^{0}_{1}})\), producing mostly on-shell \(\mathrm{W}\) bosons, and \(m_{\widetilde{\chi }^\pm _{1}} = m_{\widetilde{\chi }^{0}_{1}}+20\,\text {GeV} \), producing off-shell \(\mathrm{W}\) bosons. Finally, Fig. 1f shows a model of bottom squark production followed by the \(\widetilde{\mathrm{b}} \rightarrow \mathrm{t} \widetilde{\chi }^\pm _{1} \) decay, resulting in two \(\mathrm{b}\) quarks and four \(\mathrm{W}\) bosons. This model, \(\mathrm{T}6{\mathrm{t} \mathrm{t}}\mathrm{W}\mathrm{W}\), is studied as a function of the \(\widetilde{\mathrm{b}}\) and \(\widetilde{\chi }^\pm _{1}\) masses, keeping the \(\widetilde{\chi }^{0}_{1}\) mass at 50\(\,\text {GeV}\), resulting in two of the \(\mathrm{W}\) bosons being produced off-shell when the \(\widetilde{\chi }^\pm _{1}\) and \(\widetilde{\chi }^{0}_{1}\) masses are close. The production cross sections for SUSY models are calculated at NLO plus next-to-leading logarithmic (NLL) accuracy [46,47,48,49,50,51].

The processes shown in Fig. 2, \(\mathrm{t}\overline{\mathrm{t}} \mathrm{H} \), \(\mathrm{t} \mathrm{H} \mathrm{q} \), and \(\mathrm{t} \mathrm{W}\mathrm{H} \), represent the top quark associated production of a scalar (\(\mathrm{H} \)) or a pseudoscalar (\(\mathrm{A}\)). The subsequent decay of the (pseudo)scalar to a pair of top quarks then gives rise to final states including a total of three or four top quarks. For the purpose of interpretation, we use LO cross sections for the production of a heavy Higgs boson in the context of the Type II 2HDM of Ref. [23]. The mass of the new particle is varied in the range [350, 550]\(\,\text {GeV}\), where the lower mass boundary is chosen in such a way as to allow the decay of the (pseudo)scalar into on-shell top quarks.

Fig. 1
figure 1

Diagrams illustrating the simplified SUSY models considered in this analysis

Fig. 2
figure 2

Diagrams for scalar (pseudoscalar) boson production in association with top quarks

3 The CMS detector and event reconstruction

The central feature of the CMS detector is a superconducting solenoid of 6\(\,\text {m}\) internal diameter, providing a magnetic field of 3.8\(\,\text {T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity (\(\eta \)) coverage provided by the barrel and endcap detectors. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [52].

Events of interest are selected using a two-tiered trigger system [53]. The first level (L1), composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100\(\,\text {kHz}\) within a time interval of less than 4 \(\,\upmu \text {s}\). The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to less than 1\(\,\text {kHz}\) before data storage.

Events are processed using the particle-flow (PF) algorithm [54, 55], which reconstructs and identifies each individual particle with an optimized combination of information from the various elements of the CMS detector. The energy of photons is directly obtained from the ECAL measurement. The energy of electrons is determined from a combination of the electron momentum at the primary interaction vertex as determined by the tracker, the energy of the corresponding ECAL cluster, and the energy sum of all bremsstrahlung photons spatially compatible with the electron track [56]. The energy of muons is obtained from the curvature of the corresponding track, combining information from the silicon tracker and the muon system [57]. The energy of charged hadrons is determined from a combination of their momentum measured in the tracker and the matching ECAL and HCAL energy deposits, corrected for the response function of the calorimeters to hadronic showers. Finally, the energy of neutral hadrons is obtained from the corresponding corrected ECAL and HCAL energy.

Hadronic jets are clustered from neutral PF candidates and charged PF candidates associated with the primary vertex, using the anti-\(k_{\mathrm{T}}\) algorithm [58, 59] with a distance parameter \(R = \sqrt{\smash [b]{ (\Delta \eta )^2 + (\Delta \phi )^2} }\) of 0.4. Jet momentum is determined as the vectorial sum of all PF candidate momenta in the jet. An offset correction is applied to jet energies to take into account the contribution from additional proton–proton interactions (pileup) within the same or nearby bunch crossings. Jet energy corrections are derived from simulation, and are improved with in situ measurements of the energy balance in dijet and photon+jet events [60, 61]. Additional selection criteria are applied to each event to remove spurious jet-like features originating from isolated noise patterns in certain HCAL regions. Jets originating from \(\mathrm{b}\) quarks are identified (b tagged) using the medium working point of the combined secondary vertex algorithm CSVv2 [62]. The missing transverse momentum vector \({\vec p}_{\mathrm{T}}^{\text {miss}}\) is defined as the projection on the plane perpendicular to the beams of the negative vector sum of the momenta of all reconstructed PF candidates in an event [63]. Its magnitude is referred to as \(E_{\mathrm{T}}^{\text {miss}}\). The sum of the transverse momenta of all jets in an event is referred to as \(H_{\mathrm{T}}\).

4 Event selection and search strategy

The event selection and the definition of the signal regions (SRs) follow closely the analysis strategy established in Ref. [24]. With respect to the previous search, the general strategy has remained unchanged. We target, in a generic way, new physics signatures that result in SS dileptons, hadronic activity, and \(E_{\mathrm{T}}^{\text {miss}}\), by subdividing the event sample into several SRs sensitive to a variety of new physics models. The number of SRs was increased to take advantage of the larger integrated luminosity. Table 1 summarizes the basic kinematic requirements for jets and leptons (further details, including the lepton identification and isolation requirements, can be found in Ref. [24]).

Table 1 Kinematic requirements for leptons and jets. Note that the \(p_{\mathrm{T}}\) thresholds to count jets and b-tagged jets are different

Events are selected using triggers based on two sets of HLT algorithms, one simply requiring two leptons, and one additionally requiring \(H_{\mathrm{T}} > 300\,\text {GeV} \). The \(H_{\mathrm{T}}\) requirement allows for the lepton isolation requirement to be removed and for the lepton \(p_{\mathrm{T}}\) thresholds to be set to 8\(\,\text {GeV}\) for both leptons, while in the pure dilepton trigger the leading and subleading leptons are required to have \(p_{\mathrm{T}} > 23~(17)\,\text {GeV} \) and \(p_{\mathrm{T}} > 12~(8)\,\text {GeV} \), respectively, for electrons (muons). Based on these trigger requirements, leptons are classified as high (\(p_{\mathrm{T}} > 25\,\text {GeV} \)) and low (\(10< p_{\mathrm{T}} < 25\,\text {GeV} \)) momentum, and three analysis regions are defined: high-high (HH), high-low (HL), and low-low (LL).

The baseline selection used in this analysis requires at least one SS lepton pair with an invariant mass above 8\(\,\text {GeV}\), at least two jets, and \(E_{\mathrm{T}}^{\text {miss}} > 50\,\text {GeV} \). To reduce Drell–Yan backgrounds, events are rejected if an additional loose lepton forms an opposite-sign same-flavor pair with one of the two SS leptons, with an invariant mass less than 12\(\,\text {GeV}\) or between 76 and 106\(\,\text {GeV}\). Events passing the baseline selection are then divided into SRs to separate the different background processes and to maximize the sensitivity to signatures with different jet multiplicity (\(N_\text {jets}\)), flavor (\(N_{\mathrm{b}}\)), visible and invisible energy (\(H_{\mathrm{T}}\) and \(E_{\mathrm{T}}^{\text {miss}}\)), and lepton momentum spectra (the HH/HL/LL categories mentioned previously). The \(m_\mathrm{T}^{\text {min}}\) variable is defined as the smallest of the transverse masses constructed between \({\vec p}_{\mathrm{T}}^{\text {miss}}\) and each of the leptons. This variable features a cutoff near the \(\mathrm{W}\) boson mass for processes with only one prompt lepton, so it is used to create SRs where the nonprompt lepton background is negligible. To further improve sensitivity, several regions are split according to the charge of the leptons (\(++\) or −−), taking advantage of the charge asymmetry of SM backgrounds, such as \(\mathrm{t}\overline{\mathrm{t}} \mathrm{W}\) or \(\mathrm{W}\) \(\mathrm{Z}\), with a single \(\mathrm{W}\) boson produced in \(\mathrm{p}\mathrm{p}\) collisions. Only signal regions dominated by such backgrounds and with a sufficient predicted yield are split by charge. In the HH and HL categories, events in the tail regions \(H_{\mathrm{T}} > 1125\,\text {GeV} \) or \(E_{\mathrm{T}}^{\text {miss}} > 300\,\text {GeV} \) are inclusive in \(N_\text {jets}\), \(N_{\mathrm{b}}\), and \(m_\mathrm{T}^{\text {min}}\) in order to ensure a reasonable yield of events in these SRs. The exclusive SRs resulting from this classification are defined in Tables 2, 3 and 4.

The lepton reconstruction and identification efficiency is in the range of 45–70% (70–90%) for electrons (muons) with \(p_{\mathrm{T}} >25\,\text {GeV} \), increasing as a function of \(p_{\mathrm{T}}\) and converging to the maximum value for \(p_{\mathrm{T}} >60\,\text {GeV} \). In the low-momentum regime, \(15<p_{\mathrm{T}} <25\,\text {GeV} \) for electrons and \(10<p_{\mathrm{T}} <25\,\text {GeV} \) for muons, the efficiencies are 40% for electrons and 55% for muons. The lepton trigger efficiency for electrons is in the range of 90–98%, converging to the maximum value for \(p_{\mathrm{T}} >30\,\text {GeV} \), and around 92% for muons. The chosen b tagging working point results in approximately a 70% efficiency for tagging a \(\mathrm{b}\) quark jet and a <1% mistagging rate for light-flavor jets in \(\mathrm{t}\overline{\mathrm{t}}\) events [62]. The efficiencies of the \(H_{\mathrm{T}}\) and \(E_{\mathrm{T}}^{\text {miss}}\) requirements are mostly determined by the jet energy and \(E_{\mathrm{T}}^{\text {miss}}\) resolutions, which are discussed in Refs. [60, 61, 64].

Table 2 Signal region definitions for the HH selection. Regions split by charge are indicated with (++) and (\(-\) \(-\)). All unlabeled region are included in the SR above them, for example the unlabeled regions between SR3 and SR11 are included in SR3, with the exception of the regions to the right of SR42-45, which are included in those regions
Table 3 Signal region definitions for the HL selection. Regions split by charge are indicated with (\(++\)) and (\(-\) \(-\)). All unlabeled region are included in the SR above them, for example the unlabeled regions between SR3 and SR8 are included in SR3, with the exception of the regions to the right of SR34-37, which are included in those regions
Table 4 Signal region definitions for the LL selection. All SRs in this category require \(N_\text {jets} \ge 2\)

5 Backgrounds

Standard model background contributions arise from three sources: processes with prompt SS dileptons, mostly relevant in regions with high \(E_{\mathrm{T}}^{\text {miss}}\) or \(H_{\mathrm{T}}\); events with a nonprompt lepton, dominating the overall final state; and opposite-sign dilepton events with a charge-misidentified lepton, the smallest contribution. In this paper we use the shorthand “nonprompt leptons” to refer to electrons or muons from the decays of heavy- or light-flavor hadrons, hadrons misidentified as leptons, or electrons from conversions of photons in jets.

Several categories of SM processes that result in the production of electroweak bosons can give rise to an SS dilepton final state. These include production of multiple bosons in the same event (prompt photons, \(\mathrm{W}\), \(\mathrm{Z}\), and Higgs bosons), as well as single-boson production in association with top quarks. Among these SM processes, the dominant ones are \(\mathrm{W}\) \(\mathrm{Z}\), \(\mathrm{t}\overline{\mathrm{t}} \mathrm{W}\), and \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) production, followed by the \(\mathrm{W}^{\pm }\mathrm{W}^{\pm }\) process. The remaining SM processes are grouped into two categories, “Rare” (including \(\mathrm{Z} \mathrm{Z} \), \(\mathrm{W}\) \(\mathrm{W}\) \(\mathrm{Z}\), \(\mathrm{W}\mathrm{Z} \mathrm{Z} \), \(\mathrm{Z} \mathrm{Z} \mathrm{Z} \), \(\mathrm{t} \mathrm{W}\mathrm{Z} \), \(\mathrm{t} \mathrm{Z} \mathrm{q} \), as well as \(\mathrm{t}\overline{\mathrm{t}} \mathrm{t}\overline{\mathrm{t}} \) and double parton scattering) and “X+\({\upgamma }\)” (including \(\mathrm{W}{\upgamma }\), \(\mathrm{Z} {\upgamma }\), \(\mathrm{t}\overline{\mathrm{t}} {\upgamma }\), and \(\mathrm{t} {\upgamma }\)). The expected yields from these SM backgrounds are estimated from simulation, accounting for both the theoretical and experimental uncertainties discussed in Sect. 6.

For the \(\mathrm{W}\) \(\mathrm{Z}\) and \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) backgrounds, a three-lepton (3L) control region in data is used to scale the simulation, based on a template fit to the distribution of the number of b jets. The 3L control region requires at least two jets, \(E_{\mathrm{T}}^{\text {miss}} > 30\,\text {GeV} \), and three leptons, two of which must form an opposite-sign same-flavor pair with an invariant mass within \(15\,\text {GeV} \) of the \(\mathrm{Z}\) boson mass. In the fit to data, the normalization and shapes of all the components are allowed to vary according to experimental and theoretical uncertainties. The scale factors obtained from the fit in the phase space of the 3L control region are \(1.26 \pm 0.09\) for the \(\mathrm{W}\) \(\mathrm{Z}\) process, and \(1.14 \pm 0.30\) for the \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) process.

The nonprompt lepton background, which is largest for regions with low \(m_\mathrm{T}^{\text {min}}\) and low \(H_{\mathrm{T}}\), is estimated by the “tight-to-loose” method, which was employed in several previous versions of the analysis [28,29,30,31,32], and significantly improved in the latest version [24] to account for the kinematics and flavor of the parent parton of the nonprompt lepton. The tight-to-loose method uses two control regions, the measurement region and the application region. The measurement region consists of a sample of single-lepton events enriched in nonprompt leptons by requirements on \(E_{\mathrm{T}}^{\text {miss}}\) and transverse mass that suppress the \(\mathrm{W}\rightarrow \ell \nu \) contribution. This sample is used to extract the probability for a nonprompt lepton that satisfies the loose selection to also satisfy the tight selection. This probability (\(\epsilon _\mathrm{TL}\)) is calculated as a function of lepton \(p_{\mathrm{T}} ^\text {corr}\) (defined below) and \(\eta \), separately for electrons and muons, and separately for lepton triggers with and without an isolation requirement. The application region is a SS dilepton region where both of the leptons satisfy the loose selection but at least one of them fails the tight selection. This region is subsequently divided into a set of subregions with the exact same kinematic requirements as those in the SRs. Events in the subregions are weighted by a factor \(\epsilon _\mathrm{TL} / (1-\epsilon _\mathrm{TL})\) for each lepton in the event failing the tight requirement. The nonprompt background in each SR is then estimated as the sum of the event weights in the corresponding subregion. The \(p_{\mathrm{T}} ^\text {corr}\) parametrization, where \(p_{\mathrm{T}} ^\text {corr}\) is defined as the lepton \(p_{\mathrm{T}}\) plus the energy in the isolation cone exceeding the isolation threshold value, is chosen because of its correlation with the parent parton \(p_{\mathrm{T}}\), improving the stability of the \(\epsilon _\mathrm{TL}\) values with respect to the sample kinematics. To improve the stability of the \(\epsilon _\mathrm{TL}\) values with respect to the flavor of the parent parton, the loose electron selection is adopted. This selection increases the number of nonprompt electrons from the fragmentation and decay of light-flavor partons, resulting in \(\epsilon _\mathrm{TL}\) values similar to those from heavy-flavor parent partons.

The prediction from the tight-to-loose method is cross-checked using an alternative method based on the same principle, similar to that described in Ref. [65]. In this cross-check, which aims to remove kinematic differences between measurement and application regions, the measurement region is obtained from SS dilepton events where one of the leptons fails the impact parameter requirement. With respect to the nominal method, the loose lepton definition is adapted to reduce the effect of the correlation between isolation and impact parameter. The predictions of the two methods are found to be consistent within systematic uncertainties.

Charge misidentification of electrons is a small background that can arise from severe bremsstrahlung in the tracker material. Simulation-based studies with tight leptons indicate that the muon charge misidentification probability is negligible, while for electrons it ranges between \(10^{-5}\) and \(10^{-3}\). The charge misidentification background is estimated from data using an opposite-sign control region for each SS SR, scaling the control region yield by the charge misidentification probability measured in simulation. A low-\(E_{\mathrm{T}}^{\text {miss}}\) control region, with \(\mathrm{e}^+\mathrm{e}^-\) pairs in the \(\mathrm{Z}\) boson mass window, is used to cross-check the MC prediction for the misidentification probability, both inclusively and – where the number of events in data allows it – as a function of electron \(p_{\mathrm{T}}\) and \(\eta \).

6 Systematic uncertainties

Several sources of systematic uncertainty affect the predicted yields for signal and background processes, as summarized in Table 5. Experimental uncertainties are based on measurements in data of the trigger efficiency, the lepton identification efficiency, the b tagging efficiency [62], the jet energy scale, and the integrated luminosity [66], as well as on the inelastic cross section value affecting the pileup rate. Theoretical uncertainties related to unknown higher-order effects are estimated by varying simultaneously the factorization and renormalization scales by a factor of two, while uncertainties in the PDFs are obtained using replicas of the NNPDF3.0 set [38].

Experimental and theoretical uncertainties affect both the overall yield (normalization) and the relative population (shape) across SRs, and they are taken into account for all signal samples as well as for the samples used to estimate the main prompt SS dilepton backgrounds: \(\mathrm{W}\) \(\mathrm{Z}\), \(\mathrm{t}\overline{\mathrm{t}} \mathrm{W}\), \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \), \(\mathrm{W}^{\pm } \mathrm{W}^{\pm }\). For the \(\mathrm{W}\) \(\mathrm{Z}\) and \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) backgrounds, the control region fit results are used for the normalization, so these uncertainties are only taken into account for the shape of the backgrounds. For the smallest background samples, Rare and X+\({\upgamma }\), a 50% uncertainty is assigned in place of the scale and PDF variations.

The normalization and the shapes of the nonprompt lepton and charge misidentification backgrounds are estimated from control regions in data. In addition to the statistical uncertainties from the control region yields, dedicated systematic uncertainties are associated with the methods used in this estimate. For the nonprompt lepton background, a 30% uncertainty (increased to 60% for electrons with \(p_{\mathrm{T}} > 50\,\text {GeV} \)) accounts for the performance of the method in simulation and for the differences in the two alternative methods described in Sect. 5. In addition, the uncertainty in the prompt lepton yield in the measurement region, relevant when estimating \(\epsilon _\mathrm{TL}\) for high-\(p_{\mathrm{T}}\) leptons, results in a 1–30% effect on the estimate. For the charge misidentification background, a 20% uncertainty is assigned to account for possible mismodeling of the charge misidentification rate in simulation.

Table 5 Summary of the sources of uncertainty and their effect on the yields of different processes in the SRs. The first eight uncertainties are related to experimental and theoretical factors for processes estimated using simulation, while the last four uncertainties are assigned to processes whose yield is estimated from data. The first seven uncertainties also apply to signal samples. Reported values are representative for the most relevant signal regions

7 Results and interpretation

A comparison between observed yields and the SM background prediction is shown in Fig. 3 for the kinematic variables used to define the analysis SRs: \(H_{\mathrm{T}}\), \(E_{\mathrm{T}}^{\text {miss}}\), \(m_\mathrm{T}^{\text {min}}\), \(N_\text {jets}\), and \(N_{\mathrm{b}}\). The distributions are shown after the baseline selection defined in Sect. 4. The full results of the search in each SR are shown in Fig. 4 and Table 6. The SM predictions are generally consistent with the data. The largest deviations are seen in HL SR 36 and 38, with a local significance, taking these regions individually or combining them with other regions adjacent in phase space, that does not exceed 2 standard deviations.

These results are used to probe the signal models discussed in Sect. 2: simplified SUSY models, (pseudo)scalar boson production, four top quark production, and SS top quark production. We also interpret the results as model-independent limits as a function of \(H_{\mathrm{T}}\) and \(E_{\mathrm{T}}^{\text {miss}}\). With the exception of the new (pseudo)scalar boson limits, the results can be compared to the previous version of the analysis [24], showing significant improvements due to the increase in the integrated luminosity and the optimization of SR definitions.

To obtain exclusion limits at the 95% confidence level (CL), the results from all SRs – including signal and background uncertainties and their correlations – are combined using an asymptotic formulation of the modified frequentist CL\(_\mathrm{s}\) criterion [67,68,69,70]. When testing a model, all new particles not included in the specific model are considered too heavy to take part in the interaction. To convert cross section limits into mass limits, the signal cross sections specified in Sect. 2 are used.

Fig. 3
figure 3

Distributions of the main analysis variables: \(H_{\mathrm{T}}\)  (a), \(E_{\mathrm{T}}^{\text {miss}}\)  (b), \(m_\mathrm{T}^{\text {min}}\)  (c), \(N_\text {jets}\)  (d), and \(N_{\mathrm{b}}\)  (e), after the baseline selection requiring a pair of SS leptons, two jets, and \(E_{\mathrm{T}}^{\text {miss}} > 50\,\text {GeV} \). The last bin includes the overflow events and the hatched area represents the total uncertainty in the background prediction. The upper panels show the ratio of the observed event yield to the background prediction

Fig. 4
figure 4

Event yields in the HH (a), HL (b), and LL (c) signal regions. The hatched area represents the total uncertainty in the background prediction. The upper panels show the ratio of the observed event yield to the background prediction

Table 6 Number of expected background and observed events in different SRs in this analysis

The observed SUSY cross section limits as a function of the gluino and LSP masses, as well as the observed and expected mass limits for each simplified model, are shown in Fig. 5 for gluino pair production models with each gluino decaying through a chain containing off- or on-shell third-generation squarks. These models, which result in signatures with two or more \(\mathrm{b}\) quarks and two or more \(\mathrm{W}\) bosons in the final state, are introduced in Sect. 2 as \(\mathrm{T}1{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\), \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{b} \mathrm{b}}\mathrm{W}\mathrm{W}\), \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\), and \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{c} \mathrm{c}}\). Figure 6 shows the limits for a model of gluino production followed by a decay through off-shell first- or second-generation squarks and a chargino. Two different assumptions are made on the chargino mass, taken to be between that of the gluino and the LSP. These \(\mathrm{T}5{\mathrm{q} \mathrm{q} \mathrm{q} \mathrm{q}}\mathrm{W}\mathrm{W}\) models result in no \(\mathrm{b}\) quarks and either on-shell or off-shell \(\mathrm{W}\) bosons. Bottom squark pair production followed by a decay through a chargino, \(\mathrm{T}6{\mathrm{t} \mathrm{t}}\mathrm{W}\mathrm{W}\), resulting in two \(\mathrm{b}\) quarks and four \(\mathrm{W}\) bosons, is shown in Fig. 7. For all of the models probed, the observed limit agrees well with the expected one, extending the reach of the previous analysis by 200–300\(\,\text {GeV}\) and reaching 1.5, 1.1, and 0.83\(\,\text {TeV}\) for gluino, LSP, and bottom squark masses, respectively.

Fig. 5
figure 5

Exclusion regions at 95% CL in the \(m_{\widetilde{\chi }^{0}_{1}}\) versus \(m_{\widetilde{\mathrm{g}}}\) plane for the \(\mathrm{T}1{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\)  (a) and \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{b} \mathrm{b}}\mathrm{W}\mathrm{W}\)  (b) models, with off-shell third-generation squarks, and the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\)  (c) and \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{c} \mathrm{c}}\)  (d) models, with on-shell third-generation squarks. For the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{b} \mathrm{b}}\mathrm{W}\mathrm{W}\) model, \(m_{\widetilde{\chi }^\pm _{1}} = m_{\widetilde{\chi }^{0}_{1}} + 5\,\text {GeV} \), for the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{t} \mathrm{t}}\) model, \(m_{\widetilde{\mathrm{t}}} - m_{\widetilde{\chi }^{0}_{1}} = m_{\mathrm{t}}\), and for the \(\mathrm{T}5{\mathrm{t} \mathrm{t} \mathrm{c} \mathrm{c}}\) model, \(m_{\widetilde{\mathrm{t}}} - m_{\widetilde{\chi }^{0}_{1}} = 20\,\text {GeV} \) and the decay proceeds through \(\widetilde{\mathrm{t}} \rightarrow \mathrm{c} \widetilde{\chi }^{0}_{1} \). The right-hand side color scale indicates the excluded cross section values for a given point in the SUSY particle mass plane. The solid, black curves represent the observed exclusion limits assuming the NLO+NLL cross sections [46,47,48,49,50,51] (thick line), or their variations of ±1 standard deviation (thin lines). The dashed, red curves show the expected limits with the corresponding ±1 and ±2 standard deviation experimental uncertainties. Excluded regions are to the left and below the limit curves

Fig. 6
figure 6

Exclusion regions at 95% CL in the plane of \(m_{\widetilde{\chi }^{0}_{1}}\) versus \(m_{\widetilde{\mathrm{g}}}\) for the \(\mathrm{T}5{\mathrm{q} \mathrm{q} \mathrm{q} \mathrm{q}}\mathrm{W}\mathrm{W}\) model with \(m_{\widetilde{\chi }^\pm _{1}}=0.5(m_{\widetilde{\mathrm{g}}} + m_{\widetilde{\chi }^{0}_{1}})\) (a) and with \(m_{\widetilde{\chi }^\pm _{1}} = m_{\widetilde{\chi }^{0}_{1}} + 20\,\text {GeV} \) (b). The notations are as in Fig. 5

Fig. 7
figure 7

Exclusion regions at 95% CL in the plane of \(m_{\widetilde{\chi }^\pm _{1}}\) versus \(m_{\widetilde{\mathrm{b}}}\) for the \(\mathrm{T}6{\mathrm{t} \mathrm{t}}\mathrm{W}\mathrm{W}\) model with \(m_{\widetilde{\chi }^{0}_{1}}=50\,\text {GeV} \). The notations are as in Fig. 5

The observed and expected cross section limits on the production of a heavy scalar or a pseudoscalar boson in association with one or two top quarks, followed by its decay to top quarks, are shown in Fig. 8. The limits are compared with the total cross section of the processes described in Sect. 2. The observed limit, which agrees well with the expected one, excludes scalar (pseudoscalar) masses up to 360 (410)\(\,\text {GeV}\).

The SM four top quark production, \(\mathrm{p}\mathrm{p}\rightarrow \mathrm{t}\overline{\mathrm{t}} \mathrm{t}\overline{\mathrm{t}} \), is normally included among the rare SM backgrounds. When treating this process as signal, its observed (expected) cross section limit is determined to be 42 (\(27^{+13}_{-8}\))\(\,\text {fb}\) at 95% CL, to be compared to the SM expectation of \(9.2^{+2.9}_{-2.4}\) \(\,\text {fb}\) [33]. This is a significant improvement with respect to the observed (expected) limits obtained in the previous version of this analysis, 119 (\(102^{+57}_{-35}\))\(\,\text {fb}\) [24], as well as the combination of those results with results from single-lepton and opposite-sign dilepton final states, 69 (\(71^{+38}_{-24}\))\(\,\text {fb}\)  [71].

The results of the search are also used to set a limit on the production cross section for SS top quark pairs, \(\sigma (\mathrm{p}\mathrm{p}\rightarrow \mathrm{t} \mathrm{t}) + \sigma (\mathrm{p}\mathrm{p}\rightarrow \overline{\mathrm{t}}\overline{\mathrm{t}})\). The observed (expected) limit, based on the kinematics of a SM \(\mathrm{t}\overline{\mathrm{t}}\) sample and determined using the number of b jets distribution in the baseline region, is 1.2 (\(0.76^{+0.3}_{-0.2}\))\(\,\text {pb}\) at 95% CL, significantly improved with respect to the 1.7 (\(1.5^{+0.7}_{-0.4}\))\(\,\text {pb}\) observed (expected) limit of the previous analysis [24].

Fig. 8
figure 8

Limits at 95% CL on the production cross section for heavy scalar (a) and pseudoscalar (b) boson in association to one or two top quarks, followed by its decay to top quarks, as a function of the (pseudo)scalar mass. The red line corresponds to the theoretical cross section in the (pseudo)scalar model

7.1 Model-independent limits and additional results

The yields and background predictions can be used to test additional BSM physics scenarios. To facilitate such reinterpretations, we provide limits on the number of SS dilepton pairs as a function of the \(E_{\mathrm{T}}^{\text {miss}}\) and \(H_{\mathrm{T}}\) thresholds in the kinematic tails, as well as results from a smaller number of inclusive and exclusive signal regions.

The \(E_{\mathrm{T}}^{\text {miss}}\) and \(H_{\mathrm{T}}\) limits are based on combining HH tail SRs, specifically SR42–45 for high \(E_{\mathrm{T}}^{\text {miss}}\) and SR46–51 for high \(H_{\mathrm{T}}\), and employing the CL\(_\mathrm{s}\) criterion without the asymptotic formulation as a function of the minimum threshold of each kinematic variable. These limits are presented in Fig. 9 in terms of \(\sigma \! \mathcal {A} \epsilon \), the product of cross section, detector acceptance, and selection efficiency. Where no events are observed, the observed and expected limits reach 0.1\(\,\text {fb}\), to be compared with a limit of 1.3\(\,\text {fb}\) obtained in the previous analysis [24].

Fig. 9
figure 9

Limits on the product of cross section, detector acceptance, and selection efficiency, \(\sigma \! \mathcal {A} \epsilon \), for the production of an SS dilepton pair as a function of the \(E_{\mathrm{T}}^{\text {miss}}\)  (a) and of \(H_{\mathrm{T}}\)  (b) thresholds

Results are also provided in Table 7 for a small number of inclusive signal regions, designed based on different topologies and a small number of expected background events. The background expectation, the event count, and the expected BSM yield in any one of these regions can be used to constrain BSM hypotheses in a simple way.

In addition, we define a small number of exclusive signal regions based on integrating over the standard signal regions. Their definitions, as well as the expected and observed yields, are specified in Table 8, while the correlation matrix for the background predictions in these regions is given in Fig. 10. This information can be used to construct a simplified likelihood for models of new physics, as described in Ref. [72].

Table 7 Inclusive SR definitions, expected background yields, and observed yields, as well the observed 95% CL upper limits on the number of signal events contributing to each region. No uncertainty in the signal acceptance is assumed in calculating these limits. A dash (–) means that the selection is not applied
Table 8 Exclusive SR definitions, expected background yields, and observed yields. A dash (–) means that the selection is not applied
Fig. 10
figure 10

Correlations between the background predictions in the 15 exclusive regions

8 Summary

A sample of same-sign dilepton events produced in proton–proton collisions at 13\(\,\text {TeV}\), corresponding to an integrated luminosity of 35.9\(\,\text {fb}^{-\text {1}}\), has been studied to search for manifestations of physics beyond the standard model. The data are found to be consistent with the standard model expectations, and no excess event yield is observed. The results are interpreted as limits at 95% confidence level on cross sections for the production of new particles in simplified supersymmetric models. Using calculations for these cross sections as functions of particle masses, the limits are turned into lower mass limits that are as high as 1500\(\,\text {GeV}\) for gluinos and 830\(\,\text {GeV}\) for bottom squarks, depending on the details of the model. Limits are also provided on the production of heavy scalar (excluding the mass range 350–360\(\,\text {GeV}\)) and pseudoscalar (350–410\(\,\text {GeV}\)) bosons decaying to top quarks in the context of two Higgs doublet models, as well as on same-sign top quark pair production, and the standard model production of four top quarks. Finally, to facilitate further interpretations of the search, model-independent limits are provided as a function of \(H_{\mathrm{T}}\) and \(E_{\mathrm{T}}^{\text {miss}}\), together with the background prediction and data yields in a smaller set of signal regions.