1 Introduction

The Standard Model (SM) is a remarkably successful theory, yet it is clear that this theory is not a complete description of nature. The discovery in 2012 of the SM Higgs boson [1,2,3,4], by the ATLAS and CMS collaborations, confirmed the mechanism of the electroweak symmetry breaking and highlighted the hierarchy problem [5,6,7,8]. Supersymmetry (SUSY) [9,10,11,12,13,14], a theoretical extension to the SM, resolves the hierarchy problem by introducing a new fermion (boson) supersymmetric partner for each boson (fermion) in the SM. In SUSY models that conserve R-parity [15], the SUSY particles are produced in pairs. Furthermore, the lightest supersymmetric particle (LSP) is stable and weakly interacting, thus constituting a viable dark-matter candidate [16, 17].

In SUSY scenarios the partners of the SM Higgs boson (h) and the gauge bosons, known as the higgsinos, winos (partners of the \(\mathrm {SU(2)_L}\) gauge fields), and bino (partner of the \(\mathrm {U(1)}\) gauge field) are collectively referred to as electroweakinos. Charginos \(\tilde{\chi }_{i}^{\pm }\) \((i=1,2)\) and neutralinos \(\tilde{\chi }_{j}^{0}\) \((j=1,2,3,4)\) are the electroweakino mass eigenstates which are linear superpositions of higgsinos, winos, and bino. For the models considered in this paper, the lightest neutralino (\(\tilde{\chi }_{1}^{0}\)) is a bino-like LSP. The lightest chargino (\(\tilde{\chi }_{1}^{\pm }\)) and next-to-lightest neutralino (\(\tilde{\chi }_{2}^{0}\)) are wino-like and nearly mass degenerate.

Naturalness considerations [18, 19] suggest that the lightest of the electroweakinos have masses near the electroweak scale. In scenarios where the strongly produced SUSY particles are heavier than a few \(\text {TeV}\), the direct production of electroweakinos may be the dominant SUSY production mechanism at the Large Hadron Collider (LHC). The lightest chargino and next-to-lightest neutralino can decay via and respectively [20,21,22] in scenarios where the lepton superpartners are heavier than the electroweakinos. In this case the decay via the Higgs boson is dominant for many choices of SUSY parameters, as long as . Scenarios with light electroweakinos also provide a possible explanation for the discrepancy between the muon anomalous magnetic moment \(g-2\) measurement and the SM predictions [23, 24, 24].

This paper presents a search for direct production of electroweakinos in proton–proton (pp) collisions produced at the LHC at \(\sqrt{s} = 13\) \(\text {TeV}\). This analysis is designed to be sensitive to direct production of a chargino and a neutralino that promptly decay as and . The search targets a W boson which decays into an electron or muon (and corresponding neutrino) and a Higgs boson which decays into a pair of b-quarks, as shown in Fig. 1. The signature consists of exactly one light lepton (e or \(\mu \)), two jets originating from the fragmentation of b-quarks, and missing transverse momentum (\({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\)) from neutralinos and neutrinos. A set of simplified SUSY models is used to optimise the search and interpret the results. The branching ratios of and are assumed to be 100%. The branching ratio of \(h \rightarrow b{\bar{b}}\) is taken to be 58.3% as expected for the SM Higgs boson.

Previous searches for charginos and neutralinos at the LHC targeting decays via the Higgs boson have been reported by the ATLAS [25] and CMS [26] collaborations. Because of increased integrated luminosity and an improved two-dimensional fit procedure, the search presented here significantly extends the SUSY parameter space sensitivity beyond that of the previously published 13 \(\text {TeV}\)ATLAS search [25] for the same final state.

Fig. 1
figure 1

A diagram illustrating the signal scenario considered for the production of a chargino and a next-to-lightest neutralino

2 ATLAS detector

The ATLAS detector [27] is a multipurpose particle detector with a nearly 4\(\pi \) coverage in solid angle.Footnote 1 It consists of an inner tracking detector surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic and hadron calorimeters, and a muon spectrometer. The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors. A new inner pixel layer, the insertable B-layer [28, 29], was added at a mean radius of 3.3 cm before the start of 2015 data taking period, improving the identification of b-jets. Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A steel/scintillator-tile hadron calorimeter covers the central pseudorapidity range (\(|\eta | < 1.7\)). The endcap and forward regions are instrumented with LAr calorimeters for EM and hadronic energy measurements up to \(|\eta | = 4.9\). The muon spectrometer surrounds the calorimeters and is based on three large air-core toroidal superconducting magnets with eight coils each. The field integral of the toroids ranges between 2.0 and 6.0 Tm across most of the detector. The muon spectrometer includes a system of precision tracking chambers and fast detectors for triggering. A two-level trigger system [30] is used to select events. The first-level trigger is implemented in hardware and uses a subset of the detector information to keep the accepted rate below 100 kHz. This is followed by a software-based trigger that reduces the accepted event rate to 1 kHz on average depending on the data-taking conditions.

3 Dataset and simulated events

The results were obtained using 139 \(\mathrm {fb}^{-1}\) of pp LHC collision data collected between 2015 and 2018 by the ATLAS detector, with a centre-of-mass energy of 13 \(\text {TeV}\) and a 25 ns proton bunch crossing interval. In 2015–2016 the average number of interactions per bunch crossing (pile-up) was \(\langle \mu \rangle =20\), increasing to \(\langle \mu \rangle =38\) in 2017 and to \(\langle \mu \rangle =37\) in 2018. The uncertainty in the combined 2015–2018 integrated luminosity is 1.7% [31], obtained using the LUCID-2 detector [32] for the primary luminosity measurements.

Monte Carlo (MC) simulated datasets are used to model the SM backgrounds and evaluate signal selection efficiency and yields. All simulated samples were produced using the ATLAS simulation infrastructure [33] and Geant 4 [34], or a faster simulation based on a parameterisation of the calorimeter response and Geant 4 for the other detector systems. All simulated events were generated with a varying number of inelastic pp interactions overlaid on the hard-scattering event to model the multiple proton–proton interactions in the same and nearby bunch crossings. The pile-up events are generated with Pythia  8.186 [35] using the NNPDF2.3LO set of PDFs [36] and the A3 tune [37]. The simulated events were reconstructed with the same algorithms as those used for data.

The backgrounds considered in this analysis are: \(t{\bar{t}}\) pair production; single-top production (s-channel, t-channel, and associated Wt production); W / Z+jets production; \(t{\bar{t}}\) production with an electroweak boson (ttV); Higgs boson production (tth, Vh); and diboson (WW, WZ, ZZ) and triboson (VVV where \(V=W,Z\)) production. Background samples were simulated using different MC event generators depending on the process. All background processes were normalised to the best available theoretical calculation of their respective cross-sections. The Sherpa samples used for W+jets modelling include up to two partons at NLO and four partons at LO using Comix [38] and OpenLoops [39, 40] and merged with the Sherpa parton shower [41] according to the ME+PS@NLO prescription [42,43,44,45] using the set of tuned parameters developed by the Sherpa authors. The event generators, the parton shower and hadronisation routines, and the underlying-event parameter tunes and parton distribution function (PDF) sets used in simulating the SM background processes, along with the accuracy of the theoretical cross-sections, are all summarised in Table 1.

For all samples showered with Pythia, the EvtGen v1.2.0 [46] program was used to simulate the properties of the bottom- and charm-hadron decays. Several samples produced without detector simulation were employed to estimate systematic uncertainties associated with the specific configuration of the MC generators used for the nominal SM background samples. They include variations of the renormalisation and factorisation scales, the CKKW-L [47] matching scale, as well as different PDF sets and fragmentation/hadronisation models. Details of the MC modelling uncertainties are discussed in Sect. 7.

The SUSY signal samples were generated using MadGraph5_aMC@NLO  v2.6.2 [48] and Pythia  8.230 with the A14 [49] set of tuned parameters for the modelling of the parton showering (PS), hadronisation and underlying event. The matrix element (ME) calculation is performed at tree level and include the emission of up to two additional partons. The ME–PS matching is done using the CKKW-L prescription, with a matching scale set to one quarter of the chargino and next-to-lightest neutralino mass. The NNPDF2.3LO [36] PDF set was used.

Signal cross-sections are calculated at next-to-leading-order (NLO) accuracy in the strong coupling constant, adding the resummation of soft gluon emission at next-to-leading-logarithm accuracy (NLO+NLL) [50,51,52,53]. The nominal cross-section and its uncertainty are taken as the midpoint and half-width of an envelope of cross-section predictions using different PDF sets and factorisation and renormalisation scales, as described in Ref. [54]. The simplified model has two parameters, the first being the mass of the \(\tilde{\chi }^\pm _1\) and \(\tilde{\chi }^0_2\) (which are assumed to be equal), and the second being the mass of the \(\tilde{\chi }^0_1\). The signal cross-sections decrease as the \(\tilde{\chi }^\pm _1/\tilde{\chi }^0_2\) mass increases, ranging from 769 fb for a 250 \(\text {GeV}\) \(\tilde{\chi }^\pm _1/\tilde{\chi }^0_2\) mass to 1.3 fb for a 1000 \(\text {GeV}\) \(\tilde{\chi }^\pm _1/\tilde{\chi }^0_2\) mass.

Table 1 Overview of MC generators used for different simulated event samples

4 Event reconstruction

Events are required to have at least one reconstructed interaction vertex with a minimum of two associated tracks each having \(p_{\text {T}} >500\) \(\text {MeV}\). In events with multiple vertices, the one with the highest sum of squared transverse momenta of associated tracks is chosen as the primary vertex (PV) [70]. A set of baseline quality criteria are applied to reject events with non-collision backgrounds or detector noise [71].

Two identification levels are defined for leptons and jets: ‘baseline’ and ‘signal’. Baseline leptons and jets are selected with looser identification criteria, and are used in computing the missing transverse momentum as well as in resolving possible reconstruction ambiguities. Signal leptons and jets are a subset of the baseline objects with tighter quality requirements which are used to define the search regions. Isolation criteria, defined with a list of tracking-based and calorimeter-based variables, are used to select signal leptons by discriminating against semileptonic heavy-flavour decays and jets misidentified as leptons.

Electron candidates are reconstructed from energy deposits in the electromagnetic calorimeter that are matched to charged-particle tracks in the inner detector (ID) [72]. Baseline electrons are required to satisfy \(p_{\text {T}} >7\) \(\text {GeV}\) and \(|\eta |<2.47\). They are identified using the ‘loose’ operating point provided by a likelihood-based algorithm, described in Ref.  [72]. The number of hits in the innermost pixel layer is used to discriminate between electrons and converted photons. The longitudinal impact parameter \(z_0\) relative to the PV is required to satisfy \(|z_0\sin \theta | <0.5\) mm. The ‘tight’ likelihood operating point is applied for signal electron identification and the significance of the transverse impact parameter \(d_0\) must satisfy \(|d_0/\sigma (d_0)|<5\). Signal electron candidates with \(p_{\text {T}} < 200\) \(\text {GeV}\) are further refined using the FCLoose isolation working point, while those with larger \(p_{\text {T}} \) are required to pass the FCHighPtCaloOnly isolation working point, as described in Ref.  [72].

Muon candidates are reconstructed from matching tracks in the ID and muon spectrometer, refined through a global fit which uses the hits from both subdetectors [73]. Baseline muons must have \(p_{\text {T}} >6\) \(\text {GeV}\) and \(|\eta |<2.7\), and satisfy the ‘medium’ identification criteria. Similarly to electrons, the longitudinal impact parameter \(z_0\) relative to the PV is required to satisfy \(|z_0\sin \theta | <0.5\) mm. Signal muon candidates are further defined with tighter pseudorapidity and impact parameter requirements, \(|\eta |<2.5\) and \(|d_0/\sigma (d_0)|<3\). The FCLoose isolation working point is also required for signal muons [73].

Jets are reconstructed from three-dimensional topological energy clusters in the calorimeters using the anti-\(k_t\) algorithm [74] with a radius parameter \(R=0.4\) [75]. Baseline jets are selected in the region \(|\eta |<4.5\) and have \(p_{\text {T}} >20\) \(\text {GeV}\). To suppress jets from pile-up interactions, the jets with \(|\eta |<2.8\) and \(p_{\text {T}} <120\) \(\text {GeV}\) are required to satisfy the ‘medium’ working point of the jet vertex tagger (JVT), a tagging algorithm that identifies jets originating from the PV using track information [76, 77]. The selection of signal jets is further refined by requiring them to be in the region \(|\eta |<2.8\) and have \(p_{\text {T}} >30\) \(\text {GeV}\).

Jets containing b-hadrons are identified as ‘b-tagged’ using the MV2c10 algorithm, a multivariate discriminant based on the track impact parameters and displaced secondary vertices [78]. These b-tagged jets are reconstructed in the region \(|\eta |<2.5\) and have \(p_{\text {T}} >30\) \(\text {GeV}\). The b-tagging working point provides an efficiency of 77% for jets containing b-hadrons in simulated \(t\bar{t}\) events, with rejection rates of 110 and 4.9 for light-flavour jets and jets containing c-hadrons, respectively [79].

To resolve the reconstruction ambiguities between electrons, muons and jets, an overlap removal procedure is applied to baseline objects. First, any electron sharing the same ID track with a muon is rejected. If it shares the same ID track with another electron, the one with lower \(p_{\text {T}}\) is discarded. Next, jets are rejected if they lie within \(\Delta R = 0.2\) of a muon or if the muon is matched to the jet through ghost association [80]. Subsequently, electrons within a cone of size \(\Delta R = \min (0.4,0.04+10\) \(\text {GeV}\) \(/p_{\text {T}})\) around a jet are removed. Last, muons within a cone, defined in the same way as for electrons, around any remaining jet are removed.

The missing transverse momentum \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\), with magnitude \(E_{\text {T}}^{\text {miss}}\)   is calculated as the negative vectorial sum of the transverse momentum of all baseline reconstructed objects (electrons, muons, jets and photons [81]) and the soft term. The soft term includes all tracks associated with the PV but not matched to any reconstructed physics object. Tracks not associated with the PV are not considered in the \(E_{\text {T}}^{\text {miss}}\) calculation, improving the \(E_{\text {T}}^{\text {miss}}\) resolution by suppressing the effect of pile-up [82, 83].

Corrections are applied to simulated events in order to account for the trigger, particle identification, and reconstruction efficiency differences between data and simulation.

5 Event selection

Events are recorded with the lowest-threshold \(E_{\text {T}}^{\text {miss}}\) trigger available, which is fully efficient for selecting events when the offline requirement of \(E_{\text {T}}^{\text {miss}} > 240\) \(\text {GeV}\) is applied. To target the signal events, which have a leptonically decaying W boson and a Higgs boson decaying into a \(b{\bar{b}}\) pair, events are required to have exactly one signal electron or muon (but not both) and either two or three signal jets, two of which must be b-tagged. The signal regions (SR) are defined using variables which suppress background contributions and increase the sensitivity for signal. These variables are based on the kinematic properties of the b-jets, the lepton and the missing transverse momentum, and are defined as follows:

  • The invariant mass of the two b-jets, \(m_{{b\bar{b}}}\), is required to be in the range \(100< m_{{b\bar{b}}} < 140\) \(\text {GeV}\), in order to preferentially select b-jets from the Higgs boson decays.

  • The invariant mass of the lepton and the leading b-jet is denoted by \(m(\ell ,b_1)\). For \(t\bar{t}\) or single-top (particularly the Wt-channel) backgrounds, if the lepton and the leading b-jet originate from the same top-quark, the \(m(\ell ,b_1)\) distribution has an endpoint at \(\sqrt{m^{2}(t)-m^{2}(W)}\). For signal events, the lepton and b-jet are produced from the \(\tilde{\chi }_{1}^{\pm }\) and \(\tilde{\chi }_{2}^{0}\) decay chains, respectively. The distribution of the invariant mass depends on the mass of the SUSY particles. For signal events with high-mass \(\tilde{\chi }_{1}^{\pm }\)/\(\tilde{\chi }_{2}^{0}\), this observable provides good discrimination against background events.

  • The transverse mass, \(m_\mathrm {T}\), is defined from the lepton transverse momentum \({{\varvec{p}}}_\mathrm {T}^{\ell }\) and \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\) as

    $$\begin{aligned} m_{\mathrm {T}} = \sqrt{2 p_{\mathrm {T}}^{\ell } E_{\text {T}}^{\text {miss}} (1-\cos [\Delta \phi ({{\varvec{p}}}_{\mathrm {T}}^{\ell },{\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}})])}, \end{aligned}$$

    where \(\Delta \phi ({{\varvec{p}}}_{\mathrm {T}}^{\ell },{\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}})\) is the azimuthal angle between \({{\varvec{p}}}_{\mathrm {T}}^{\ell }\) and \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\). For W+jets and semileptonic \(t\bar{t}\) events, in which one on-shell W boson decays leptonically, the observable has an upper endpoint at the W boson mass. The \(m_\mathrm {T}\) distribution for signal events extends significantly above m(W).

  • The contransverse mass [84, 85] of two b-jets, \(m_\mathrm { CT} \), is defined as:

    $$\begin{aligned} m_\mathrm { CT} = \sqrt{2 p_{\text {T}} ^{b_1} p_{\text {T}} ^{b_2} \left( 1+\cos \Delta \phi _{bb} \right) }, \end{aligned}$$

    where \(p_{\text {T}} ^{b_1}\) and \(p_{\text {T}} ^{b_2}\) are the transverse momenta of the two leading b-jets and \(\Delta \phi _{bb}\) is the azimuthal angle between them. For the \(t\bar{t}\) background, the observable has an upper endpoint at \((m^{2}(t)-m^{2}(W))/m(t)\). A requirement that \(m_\mathrm { CT} \) be larger than 180 \(\text {GeV}\) efficiently suppresses the \(t\bar{t}\) background.

An overview of the signal region definitions is provided in Table 2. Three separate classes of signal regions are defined, progressively targeting increasing mass differences between the (and its mass-degenerate wino partner) and the . These regions are labelled SR-LM, SR-MM and SR-HM to indicate low (LM), medium (MM) and high (HM) mass differences respectively. Requirements on \(m_\mathrm {T}\) make the three regions mutually exclusive. Of the three signal regions, SR-LM selects the smallest values of \(m_\mathrm {T}\). It targets signal models with a mass-splitting between the (and hence the ) and the that is similar to the Higgs boson mass. The other two signal regions select progressively larger mass differences by requiring larger values of \(m_\mathrm {T}\). The signal region with the highest requirement on \(m_\mathrm {T}\), SR-HM, also requires \(m(\ell ,b_1) > 120\) \(\text {GeV}\) in order to further suppress \(t\bar{t}\) and single-top background events. The three signal regions otherwise share a common set of selections on \(E_{\text {T}}^{\text {miss}}\), \(m_{{b\bar{b}}}\) and \(m_\mathrm { CT}\).

When setting model-dependent exclusion limits (‘excl.’), each of the three SRs is binned in three \(m_\mathrm { CT} \) regions, thus providing nine bins in total for a simultaneous two-dimensional fit in \(m_\mathrm { CT} \) and \(m_\mathrm {T}\) across the three SRs. This multi-bin approach enhances the sensitivity to a range of SUSY scenarios with different properties. For model-independent limits and null-hypothesis tests (‘disc.’ for discovery), the various \(m_\mathrm { CT} \) bins are merged for each of the three SRs. The requirement of \(m(\ell ,b_1) > 120\) \(\text {GeV}\) is only applied in SR-HM. Furthermore, the upper bound on \(m_\mathrm {T}\) is removed for SR-LM and SR-MM. The fit strategy is detailed in Section 6. The systematic uncertainties, fit and results discussed in the following sections are based on the exclusion SRs, while the model-independent results are based on the discovery SRs.

Table 2 Overview of the selection criteria for the signal regions. Each of the three ‘excl.’ SRs is binned in three \(m_\mathrm { CT}\) regions for a total of nine ‘excl.’ bins

6 Background estimation

The expected backgrounds in each signal region are determined in a profile likelihood fit, referred to as a ‘background-only fit’. In this fit, the normalisation of the backgrounds is adjusted to match the data in control regions with negligible signal contamination. The resulting normalisation factors are then used to correct the expected yields of the corresponding backgrounds in the various signal regions. The control regions – as detailed in the following – are designed to be enriched in the major background processes: \(t\bar{t}\), single-top and W+jets processes. The control region for single-top has a similar composition in single-top processes as the signal regions, therefore a single scale factor is used. All CRs are designed to be non-overlapping with the signal regions and also mutually exclusive. A probability density function is defined for each of the control regions. The inputs are the observed event yield and the predicted background yield from simulation with Poisson statistical uncertainties as well as with systematic uncertainties as nuisance parameters. The nuisance parameters are constrained by Gaussian distributions with widths corresponding to the sizes of the uncertainties. The uncertainties do not only vary the scales, but also account for bin-to-bin transitions. The systematic uncertainties are detailed in Sect. 7. The product of all the probability density functions forms the likelihood. Normalisation and nuisance parameters are correlated in all regions participating in the fit. The likelihood is maximised by adjusting the normalisation and nuisance parameters. The extrapolation of the adjusted normalisation and nuisance parameters to the signal regions is checked in validation regions (VR), as defined below, which kinematically resemble the signal regions but are expected to have less signal. The VRs do not overlap with the CRs or SRs.

Subdominant background processes, such as Z+jets, diboson and multiboson, \(t\bar{t}\) +V, \(t\bar{t}\) +h and Vh, which have no dedicated control regions, are normalised to the cross-sections indicated in Table 1. In the same way as for the dominant backgrounds, their expected yields in the SRs are subject to statistical and systematic uncertainties. Backgrounds with fake leptons such as jets misreconstructed as a lepton, and events with leptons originating from a jet produced by heavy-flavour quarks or from photon conversions are estimated using a matrix method as described in Ref.  [86], and found to be negligible in all regions.

The \(t{\bar{t}}\) background estimation relies on a set of three CRs (labelled TR-LM, TR-MM, TR-HM), each with an \(m_\mathrm {T}\) selection the same as in the SRs. In order to obtain samples enriched in \(t{\bar{t}}\) events, the requirement on \(m(\ell ,b_1)\) is removed and the selection criteria for \(m_\mathrm { CT}\) and \(m_{{b\bar{b}}}\) are inverted relative to the SRs. These three control regions are fit simultaneously to obtain a single normalisation factor. The W+jets contributions in the SRs are constrained by a single CR (labelled WR) defined similarly to the SRs but with less stringent lower bounds on \(m_\mathrm {T}\) and an off-peak region for \(m_{{b\bar{b}}}\). The fraction of events with heavy flavor hadrons in simulated W+jet events was found similar between the WR and the SRs. Events in the single-top CR (labelled STR) must satisfy the SR requirements except that this CR requires \(m_\mathrm {T} > 100\) \(\text {GeV}\) and \(m_{{b\bar{b}}} > 195\) \(\text {GeV}\). The \(t{\bar{t}}\) purity varies from  79% in TR-LM to  86% in TR-MM. The purity of the single-top (W+jets) is  52% ( 53%) in STR (WR).

Two sets of VRs are defined for each SR, including the off-peak (\(m_{{b\bar{b}}}\) <100 or >140 GeV) and the on-peak \(m_{{b\bar{b}}}\) regions, with the same \(m_\mathrm {T}\) as in the SR. The on-peak VRs validate the extrapolation from the CRs to the SRs in \(m_{{b\bar{b}}}\), and the off-peak VRs validate the extrapolation in \(m_\mathrm { CT}\). The validation regions share the same \(m_\mathrm {T}\) binning as the signal regions, denoted as LM, MM and HM. The background modelings are validated in low, medium and high \(m_\mathrm {T}\) regions separately. A summary of all CR and VR selection criteria is reported in Table 3.

Table 3 Overview of the CR and VR definitions. All regions partially share the same selection as the SR for all variables except \(m(\ell ,b_1)\), which is not used in the CR and VR definitions

7 Systematic uncertainties

Systematic uncertainties are evaluated for all simulated signal and background events. For the dominant backgrounds with dedicated control regions, the systematic uncertainties impact the extrapolation from the control regions to the corresponding signal regions. For all other backgrounds estimated from simulation, the uncertainties affect the overall cross-section normalisation and the acceptance of the analysis selection. Uncertainties arising from theoretical modelling and detector effects are estimated and discussed below. A breakdown of the dominant systematic uncertainties in background estimates in the various exclusion signal regions is summarised in Table 4. The uncertainties in the scale factor fits to the control regions are listed as ‘Normalisation of dominant backgrounds’.

Several uncertainties in the theoretical modelling of the single-top and \(t\bar{t}\) backgrounds are considered. Uncertainties due to the choice of hard-scatter generation program are estimated by comparing Powheg-Box generated events, showered using Pythia 8, with events generated by aMC@NLO and showered with Pythia 8, while those due to the choice of parton shower model are evaluated by comparing Powheg-Box generated samples showered using Pythia 8 with Powheg-Box samples showered using Herwig 7 [87]. The uncertainties from the modelling of initial- and final-state radiation are assessed by varying the renormalisation and factorisation scales up and down by a factor of two, with the radiation setting varied as well [88]. For single-top Wt production, the impact of interference between single-resonant and double-resonant top-quark production is estimated by comparing the nominal sample generated using the diagram removal method with a sample generated using the diagram subtraction method. For the different signal regions, the dominant uncertainty sources are the \(t\bar{t}\) parton shower in SR-LM (10%), and the single-top generator uncertainties in SR-MM (10%) and SR-HM (21%).

Theory uncertainties affecting the generator predictions for W / Z+jets, diboson, triboson and \(t\bar{t}\) +W / Z samples are estimated by taking the envelope of the seven-point variations of the renormalisation and factorisation scales. For W / Z+jets, the uncertainties from the PDF variations, as well as from the variations of matching and resummation scales are also considered. Additionally, an overall 6% (20%) systematic uncertainty in the inclusive cross-section is assigned for the diboson (triboson) sample [89]. Similar cross-section uncertainties are also assigned for other small background contributions.

Theory uncertainties in the expected yields for SUSY signals are estimated by varying by a factor of two the parameters corresponding to the factorisation, renormalisation, and CKKW-L matching scales, as well as the Pythia  8 shower tune parameters. The overall uncertainties range from about 10% in the region with a large splitting between the \(\tilde{\chi }_{2}^{0}/\tilde{\chi }_{1}^{\pm }\) and \(\tilde{\chi }_{1}^{0}\) masses to about 25% in the mass spectra with small mass splitting.

The dominant detector systematic effects are the uncertainties associated with the jet energy scale (JES) and jet energy resolution (JER), the \(E_{\text {T}}^{\text {miss}}\) modelling, and pile-up. The jet uncertainties are measured as a function of the \(p_{\text {T}}\) and \(\eta \) of the jet, the pile-up conditions and the jet flavour composition. They are determined using a combination of data and simulation, through measurements of the jet \(p_{\text {T}}\) balance in dijet, Z+jets and \(\gamma \)+jets events [90]. The systematic uncertainties in the \(E_{\text {T}}^{\text {miss}}\) modelling are derived by propagating the uncertainties in the energy and momentum scale of each of the objects entering the calculation, and the uncertainties in the soft term’s resolution and scale [83]. A pile-up reweighting procedure is applied to simulation to match the distribution of the number of reconstructed vertices observed in data. The corresponding uncertainty is derived by a reweighting in which \(\langle \mu \rangle \) is varied by \(\pm 4\%\). The experimental uncertainties have a less significant impact than the theoretical ones in all signal regions: the largest experimental source contributes 5–10% depending on the SR. The MC statistical uncertainties contribute 5–18% depending on the SR.

Table 4 Breakdown of the dominant systematic uncertainties in background estimates in the various exclusion signal regions. The individual uncertainties can be correlated, and do not necessarily add up in quadrature to the total background uncertainty. The percentages show the size of the uncertainty relative to the total expected background

8 Results

The observed event yield in each of the exclusion signal regions is summarised in Table 5 along with the corresponding Standard Model predictions obtained from the background-only fit. The background normalisation factors are \(1.02^{+0.07}_{-0.09}\) for \(t\bar{t}\), \(0.6^{+0.5}_{-0.25}\) for single top, and \(1.22^{+0.26}_{-0.24}\) for W+jets.

Table 5 Background fit results for the exclusion SR regions. The errors shown are the statistical plus systematic uncertainties. Uncertainties in the fitted yields are symmetric by construction, except where the negative error is truncated at an event yield of zero

In Fig. 2 the post-fit \(m_\mathrm { CT}\) distributions in the \(t\bar{t}\) control regions TR-LM, TR-MM, and TR-HM are compared with the data. For the W boson and single-top control regions the \(m_{{b\bar{b}}}\) distribution is shown. Figure 3 shows the post-fit \(m_\mathrm { CT}\) distributions after all of the validation region selection requirements are applied except the \(m_\mathrm { CT}\) cut. The data and the background expectation in all validation regions agree well within two standard deviations. Therefore no further systematic uncertainty is applied on the background estimation in the signal regions.

The compatibility of the observed and expected event yields in control, validation, exclusion, and discovery signal regions is illustrated in Fig. 4. No significant excess over the SM prediction is observed in data. Figure 5 shows the post-fit \(m_\mathrm { CT}\) distributions in SR-LM, SR-MM, and SR-HM. The uncertainty bands include all statistical and systematic uncertainties. The dashed lines represent the benchmark signal points.

Fig. 2
figure 2

The post-fit \(m_\mathrm { CT}\) distributions in TR-LM, TR-MM, and TR-HM are shown as well as the post-fit \(m_{{b\bar{b}}}\) distributions in WR and STR. The uncertainty bands plotted include all statistical and systematic uncertainties. The overflow events, where present, are included in the last bin

Fig. 3
figure 3

The post-fit \(m_\mathrm { CT}\) distributions are shown in each of the validation regions (VR-onLM, VR-onMM, VR-onHM, VR-offLM, VR-offMM, and VR-offHM) after all the selection requirements are applied other than the \(m_\mathrm { CT}\) cut. The uncertainty bands plotted include all statistical and systematic uncertainties. The overflow (underflow) events, where present, are included in the last (first) bin. The line with an arrow indicates the \(m_\mathrm { CT}\) cut used in VR selection

Fig. 4
figure 4

Comparison of the observed and expected event yields in control, validation, exclusion, and discovery signal regions. Uncertainties in the background estimates include both the statistical (in the simulated event yields) and systematic uncertainties. The bottom panel shows the significance [91] of the differences between the observed and expected yields. Not all regions shown here are statistically independent

Fig. 5
figure 5

The post-fit \(m_\mathrm { CT}\) distributions in the exclusion signal regions (SR-LM, SR-MM, and SR-HM). The uncertainty bands plotted include all statistical and systematic uncertainties. The dashed lines represent the benchmark signal samples. The overflow events, where present, are included in the last bin

Model-dependent exclusion limits at 95% confidence level (CL) are placed on the signal model. These limits are shown as a function of the masses of the supersymmetric particles in Fig. 6. They are determined interpolating between the simulated mass points, but no smoothing procedure is applied so that local fluctuations in the limit can be present. A likelihood similar to the one used in the background-only fit, but with additional terms for the SRs, is used for the calculation. The exclusion SRs thus participate in the fit and are used to constrain normalisation and nuisance parameters. A signal is allowed in this likelihood in both the CRs and SRs. The VRs are not used in the fit. The \(\mathrm {CL_s}\) method [92] is used to derive the confidence level of the exclusion for a particular signal model; signal models with a \(\mathrm {CL_s}\) value below 0.05 are excluded at 95% CL. The uncertainties in the observed limit are calculated by varying the cross-section for the signal up and down by its uncertainty. Due to a modest excess observed in some bins of the exclusion signal regions, the observed limit is weaker than the expected limit and extends up to about 740 \(\text {GeV}\) in for massless and up to  \(\text {GeV}\) for  \(\text {GeV}\). Benefiting from the increased integrated luminosity and the improved two-dimensional fit procedure, the current observed limit exceeds the previous ATLAS limit by about 200 \(\text {GeV}\) in for a massless .

Fig. 6
figure 6

Model-dependent exclusion contour at 95% CL on the production of a chargino and a next-to-lightest neutralino. The observed limit is given by the solid line with the signal cross-section uncertainties shown by the dotted lines as indicated in the text. Expected limits are given by the dashed line with uncertainties shown by the shaded band

Table 6 summarises the observed (\(S_{\mathrm {obs}}\)) and expected (\(S_{\mathrm {exp}}\)) 95% CL upper limits on the number of signal events and on the observed visible cross-section, \(\langle \epsilon {\mathrm { \sigma }}\rangle _{\mathrm { obs}}^{95}\), for each of the three cumulative discovery signal regions. These cumulative signal regions are those defined to test for the presence of any beyond-the-Standard-Model (BSM) physics processes, where in every case the upper bound on \(m_\mathrm {T}\) is also removed. Upper limits on contributions from new physics processes are estimated using the so-called ‘model-independent fit’, where a generic BSM process is assumed to contribute only to the SR and not to the CRs, thus giving a conservative background estimate in the SR. When normalised to the integrated luminosity of the data sample, the results can be interpreted as corresponding to observed upper limits \(\langle \epsilon {\mathrm { \sigma }}\rangle _{\mathrm { obs}}^{95}\), defined as the product of the production cross-section, the acceptance, and the selection efficiency of a BSM signal. The \(p_{0}\) values, which represent the probability of the SM background alone to fluctuate to the observed number of events or higher, are also provided. All numbers are calculated from pseudo-experiments.

Table 6 Left to right: 95% CL upper limits on the visible cross-section (\(\langle \epsilon \sigma \rangle _{\mathrm { obs}}^{95}\)) and on the number of signal events (\(S_{\mathrm { obs}}^{95}\) ). The third column (\(S_{\mathrm { exp}}^{95}\)) shows the expected 95% CL upper limit (and its \(\pm 1\sigma \) excursions) on the number of signal events if no BSM signal is present. The last three columns indicate the \(\text {CL}_{\text {B}}\) value, i.e. the confidence level observed for the background-only hypothesis, the discovery p-value (\(p_{0}\)) and the significance Z [91]

9 Conclusion

The results of a search for electroweakino pair production \(pp \rightarrow \tilde{\chi }^\pm _1 \tilde{\chi }^0_2\) in which the chargino (\(\tilde{\chi }^\pm _1\)) decays into a W boson and the lightest neutralino (\(\tilde{\chi }^0_1\)), while the heavier neutralino (\(\tilde{\chi }^0_2\)) decays into the Standard Model 125 \(\text {GeV}\) Higgs boson and a second \(\tilde{\chi }^0_1\) are presented. The analysis is performed using \(pp\) collisions provided by the LHC at a centre-of-mass energy of 13 \(\text {TeV}\). Data collected with the ATLAS detector between 2015 and 2018 are used, corresponding to an integrated luminosity of \(139~\mathrm {fb}^{-1}\). No significant deviation from the expected Standard Model background is observed. Limits are set on the direct production of the electroweakino in simplified models. Masses of \(\tilde{\chi }^{\pm }_{1}/\tilde{\chi }^{0}_{2}\) up to 740 \(\text {GeV}\) are excluded at 95% confidence level for a massless \(\tilde{\chi }^{0}_{1}\). The current search improves on the previous ATLAS limit by about 200 \(\text {GeV}\) in for a massless .