1 Introduction

Measurements of \(W^{+}W^{-}\) (referred to hereafter as WW) production provide important tests of the electroweak (EWK) gauge structure of the Standard Model (SM) of particle physics, and WW production is also an important background process in searches for physics beyond the SM (BSM physics). In searches for supersymmetry [1,2,3,4,5,6] (SUSY) where WW events are a significant background, a semi-data-driven approach is often taken, that involves normalising the simulated Monte Carlo (MC) samples to data in a control region (CR), designed to be kinematically similar to the search regions but enriched in SM WW production. Significant deviations of these scaling factors from unity suggest mismodelling in the phase space targeted by the search, but it can be difficult to make comparisons with the level of agreement observed in precision SM measurements because the scaling factors refer to detector-level quantities which are subject to mis-measurement and inefficiency. Producing ‘unfolded’ particle-level measurements, which are corrected for these effects and can directly be compared with the prediction of a MC event generator, in event topologies associated with search results is a novel way to address this whilst simultaneously extending the programme of precision SM measurements at the LHC. The ATLAS experiment [7] has previously reported differential measurements of \(t\bar{t}\) and Z+jets production in regions related to a search for leptoquarks in dilepton+dijet events [8]. This paper presents the first effort to measure WW production cross-sections in topologies associated with SUSY searches.

Inclusive and fiducial WW production cross-sections have been measured in proton–proton (pp) collisions at \(\sqrt{s}=7~\hbox {TeV}\) [9, 10], 8 TeV [11,12,13] and 13 TeV [14,15,16,17] at the LHC, as well as in \(e^+e^-\) collisions at LEP [18] and in \(p\bar{p}\) collisions at the Tevatron [19,20,21]. This analysis complements existing ATLAS measurements of WW production at 13 TeV in 0-jet events [15] and in \(\ge 1\)-jet events [16] by measuring differential cross-sections in a fiducial region kinematically close to the WW control region used in a previous search for electroweak production of supersymmetric charginos or sleptons [22]. That search targeted electroweak production of SUSY particles decaying into final states with two leptons (electrons or muons) and missing transverse momentum using 139 fb\(^{-1}\) of pp collisions at 13 TeV collected during Run 2 of the LHC and is referred to hereafter as the ‘EWK 2\(\ell \)+0-jets search’. In that search, WW production was the main background process and the associated theoretical uncertainties were among the dominant systematic uncertainties in the search regions. The present measurement targets event topologies with higher values of the dilepton invariant mass, \(m_{e\mu }\), and the magnitude of the missing transverse momentum, \(E_{\textrm{T}}^{{\textrm{miss}}}\), than were used in previous measurements, and can thus be used to provide additional constraints on BSM physics, and probe the expected SM backgrounds for future searches.

The \(WW \rightarrow e^{\pm }\nu \mu ^{\mp }\nu \) decay channel is studied in events with no identified jets with a transverse momentum \(p_{\textrm{T}}>20~\hbox {GeV}\) and pseudorapidity \(|\eta |<2.4\),Footnote 1 and with \(E_{\textrm{T}}^{{\textrm{miss}}}\) between 60 and 80 GeV. Missing transverse momentum is calculated so as to represent the momentum imbalance in the plane transverse to the colliding beams. High values of \(E_{\textrm{T}}^{{\textrm{miss}}}\) can be produced when weakly interacting neutral particles escape the detector unseen, and \(E_{\textrm{T}}^{{\textrm{miss}}}\) is thus an important variable in many BSM searches. This analysis also imposes requirements on the dilepton invariant mass that are more stringent than those in the 36 fb\(^{-1}\) WW+0-jet measurement [15]. The dominant background process is top-quark production (\(t\bar{t}\) and single-top Wt), which is estimated using the same data-driven method as was used in the EWK 2\(\ell \)+0-jets search. The measurements are performed in a fiducial phase space close to the geometric and kinematic acceptance of the experimental analysis. Differential cross-section measurements are performed for six variables, which are the same as those considered in the 36 fb\(^{-1}\) WW+0-jet measurement [15]:

  • The rapidity of the dilepton system, \(|y_{e\mu }|\).

  • The azimuthal separation between the two leptons, \(|\Delta \phi _{e\mu }|\).

  • The angular variable \(\cos \theta ^{*}=|\tanh (\Delta y(e\mu )/2)|\), which is longitudinally boost invariant and sensitive to the spin structure of the produced dileptons [23], and where \(\Delta y(e\mu )\) is the difference between the electron and muon rapidities.

  • The transverse momentum of the leading lepton, \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\).

  • The invariant mass of the dilepton system, \(m_{e\mu }\).

  • The transverse momentum of the dilepton system, \(p_{\textrm{T}}^{e\mu }\).

In this paper, \(|y_{e\mu }|\), \(|\Delta \phi _{e\mu }|\) and \(\cos \theta ^{*}\) are referred to collectively as ‘angular’ variables, as they probe angular correlations and are sensitive to the spin structure of the WW production system, and \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\), \(m_{e\mu }\) and \(p_{\textrm{T}}^{e\mu }\) are referred to collectively as ‘scale’ variables, as they characterise the energy scale of the process.

The rest of this paper is structured as follows. First, Sect. 2 describes the ATLAS detector and then Sect. 3 presents the analysis that is performed to measure the fiducial and differential cross-sections. This includes the data and MC samples used, the reconstructed-object definitions and event selections used to define the detector-level signal regions, and the SM background estimation, as well as the systematic uncertainties considered and the unfolding techniques used to correct detector-level information back to particle level. Finally, the results are reported in Sect. 4, and Sect. 5 presents the conclusions.

2 ATLAS detector

The ATLAS experiment at the LHC is a multipurpose particle detector with a forward–backward symmetric cylindrical geometry and a near \(4\pi \) coverage in solid angle. It consists of an inner tracking detector surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic and hadron calorimeters, and a muon spectrometer. The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors. Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A steel/scintillator-tile hadron calorimeter covers the central pseudorapidity range \((|\eta | < 1.7).\) The endcap and forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta | = 4.9\). The muon spectrometer surrounds the calorimeters and is based on three large superconducting air-core toroidal magnets with eight coils each. The field integral of the toroids ranges between 2.0 and 6.0 T across most of the detector. The muon spectrometer includes a system of precision tracking chambers and fast detectors for triggering. A two-level trigger system is used to select events. The first-level trigger is implemented in hardware and uses a subset of the detector information to accept events at a rate below 100 kHz. This is followed by a software-based trigger that reduces the accepted event rate to 1 kHz on average depending on the data-taking conditions. An extensive software suite [24] is used in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

3 Analysis

3.1 Data and simulated event samples

This analysis uses pp collision data at a centre-of-mass energy of \(\sqrt{s} = 13~\hbox {TeV}\) collected by the ATLAS detector during the second data-taking run of the LHC, which took place between 2015 and 2018. After applying standard data-quality requirements for LHC and detector operations [25], this dataset corresponds to a total integrated luminosity of 139 fb\(^{-1}\) with an uncertainty of 1.7% [26], obtained using the LUCID-2 sub-detector [27] for the primary luminosity measurements. Candidate events were selected by a trigger that required at least one electron–muon pair [28, 29]. The trigger-level thresholds for the \(p_{\textrm{T}}\) of the leptons were 17 GeV for the electron and 14 GeV for the muon. The thresholds applied in the lepton offline selection ensured that trigger efficiencies are constant in the relevant phase space.

Simulated MC event samples are used for the SM background estimates and to correct the signal distributions for detector effects. These were processed through a full simulation of the ATLAS detector [30] based on Geant4 [31] and reconstructed with the same algorithms as those used for the data. The generation of the simulated event samples includes the effect of multiple pp interactions per bunch crossing (pile-up), as well as changes in detector response because of interactions in bunch crossings before or after the one containing the hard interaction. Differences between data and simulation in the lepton reconstruction efficiency, energy scale, energy resolution and trigger efficiency [32, 33], and in the b-tagging efficiency [34], are treated through correction factors that are derived from data and applied as weights to the simulated events. The MC samples are also reweighted so that the distribution of the average number of interactions per bunch crossing reproduces the observed distribution in the data.

Table 1 Simulated background event samples with the corresponding matrix element and parton shower (PS) generators, cross-section order in \(\alpha _{\textrm{s}}\) used to normalise the event yield, underlying-event tune and the generator PDF sets used. Where used, the label ‘V’ refers to a W or Z boson

Simulated WW signal samples were produced by summing \(q\bar{q}\)- and gg-initiated samples. The \(q\bar{q}\)-initiated WW signal was simulated at next-to-leading-order (NLO) accuracy in QCD using the Powheg Box v2 [35,36,37] generator interfaced to Pythia 8.186 [38] for the modelling of the parton shower, hadronisation, and underlying event, with parameter values set according to the AZNLO tune [39]. The CT10nlo parton distribution function (PDF) set [40] was used for the hard-scattering processes, whereas the CTEQ6L1 PDF set [41] was used for the parton shower [42]. The events were normalised to the cross-section calculated to next-to-next-to-leading order (NNLO) in QCD [43]. Loop-induced \(gg\rightarrow WW \rightarrow \ell \nu \ell \nu \) events were simulated at LO with up to one additional parton emission using Sherpa  2.2.2, with virtual QCD corrections provided by the Open Loops library [42, 44,45,46]. The \(gg \rightarrow WW\) process was normalised to its inclusive NLO QCD cross-section [47]. An alternative sample of \(q\bar{q} \rightarrow WW\) events was simulated using Sherpa  2.2.2 [42, 48] with matrix elements at NLO accuracy in QCD for up to one additional parton emission and at LO accuracy for up to three additional parton emissions. For the Sherpa  \(q\bar{q}\)- and gg-initiated samples the NNPDF3.0nnlo set of PDFs was used [49], along with the dedicated set of tuned parton-shower parameters developed by the Sherpa  authors. No alternative simulation is considered for the \(gg\rightarrow WW\) process, which contributes only a small fraction of the signal.

Table 1 summarises the generators used for the SM backgrounds along with the relevant PDF sets, the set of tuned parameters used to configure the hadronisation and underlying event, and the cross-section order in \(\alpha _{\textrm{s}}\) used to normalise the event yields for these samples. This study uses the same simulated samples and groupings for the SM background processes as the EWK 2\(\ell \)+0-jets search [22]. The ‘Others’ category groups together processes that produce small or negligible contributions to the signal regions of the search, and includes Drell–Yan, \(t\bar{t} +V\) and Higgs boson production. Further information about the simulations of \(t\bar{t}\), single-top (Wt), multiboson and boson-plus-jet processes can also be found in the relevant public ATLAS notes [42, 50,51,52].

3.2 Event reconstruction and selection

Events are required to have at least one reconstructed vertex with at least two associated tracks with \(p_{\textrm{T}} >400~\hbox {MeV}\). When more than one vertex is reconstructed, the one with the highest \(\sum p_{\textrm{T}}^{2}\) of associated tracks is taken to be the primary vertex. All final-state objects (electrons, muons and jets in this study) are required to satisfy ‘baseline’ criteria to ensure they are well-reconstructed and originate from the primary vertex, and additional ‘signal’ criteria are applied to define the objects used in the measurement. Baseline electrons are required to have \(p_{\textrm{T}} >10~\hbox {GeV}\), pseudorapidity \(|\eta |< 2.47\) and a longitudinal impact parameter \(z_0\), relative to the primary vertex, satisfying \(|z_0 \sin \theta | < 0.5\) mm; baseline muons must fulfill the same criteria except \(|\eta |<2.6\). Electrons must satisfy a Loose likelihood-based identification requirement [32], while muons must satisfy the Medium identification requirements defined in Ref. [33]. Signal electrons are required to satisfy a Tight identification requirement [32] and the track associated with the signal electron is required to have \(\vert d_0\vert /\sigma (d_0) < 5\), where \(d_0\) is the transverse impact parameter relative to the primary vertex and \(\sigma (d_0)\) is its uncertainty, whilst for signal muons the associated track must have \(\vert d_0\vert /\sigma (d_0) < 3\). The signal-lepton isolation criteria used in the EWK 2\(\ell \)+0-jets search [22] are also applied in this study. Hadronic jets are reconstructed from energy deposits in topological clusters of calorimeter cells [78, 79] using the anti-\(k_t\) algorithm [80], as implemented in the FastJet package [81], with a radius parameter \(R=0.4\). They are then calibrated by the application of a jet energy scale derived from 13 TeV data and simulation [82]. To reduce the effects of pile-up, for jets with \(|\eta |< 2.5\) and \(p_{\textrm{T}}<120\,\textrm{GeV}\) a significant fraction of the tracks associated with each jet must have an origin compatible with the primary vertex, as defined by the jet vertex tagger [83]. For jets with \(|\eta |>2.5\) and \(p_{\textrm{T}}<60\,\textrm{GeV}\), similar pile-up suppression is achieved through the forward jet vertex tagger [84]. Finally, events are rejected if they contain a jet that does not satisfy the jet-quality requirements [85, 86]; this removes events impacted by detector noise or non-collision backgrounds. Jets containing b-hadrons (‘b-jets’) are identified by the MV2c10 boosted decision tree algorithm  [34], using quantities such as the impact parameters of associated tracks along with well-reconstructed secondary vertices. A selection that provides 85% efficiency for tagging b-jets in simulated \(t\bar{t}\) events is used in this study. Only jet candidates with \(p_{\textrm{T}}>20\,{\textrm{GeV}}\) and \(|\eta |<2.4\) are considered,Footnote 2 although all jets with \(|\eta |<4.9\) are included in the calculation of missing transverse momentum and in the procedure to remove reconstruction ambiguities that could lead to double counting of baseline objects. This procedure is applied as follows:

  • jet candidates within \(\Delta R' =\sqrt{(\Delta y)^2+(\Delta \phi )^2} = 0.2\) of an electron candidate are removed;

  • jets with fewer than three tracks that lie within \(\Delta R' =0.4\) of a muon candidate are removed;

  • electrons and muons within \(\Delta R' = 0.4\) of the remaining jets are discarded, to reject leptons from the decay of b- or c-hadrons;

  • electron candidates are rejected if they are found to share an inner-detector track with a muon.

The measurements are performed in events with exactly one signal electron and one signal muon with opposite electric charge and each satisfying \(p_{\textrm{T}}>25~\hbox {GeV}\), and a veto on additional baseline leptons and hadronic jets. The multiplicities of non-b-tagged jets and b-tagged jets are considered separately in the background estimation for this study: events with exactly one b-tagged jet with a veto on additional non-b-tagged jets are used to estimate and validate the top-quark background. Requirements are also placed on the missing transverse momentum (\({\textbf{p}}^{\textrm{miss}}_{\textrm{T}}\)), which has magnitude \(E_{\textrm{T}}^{{\textrm{miss}}}\). This is defined as the negative vector sum of the transverse momenta of all identified physics objects (electrons, photons, muons and jets), plus an additional ‘soft term’ to include low-momentum tracks associated with the primary vertex but not with these physics objects. The \(E_{\textrm{T}}^{{\textrm{miss}}}\) value is adjusted for the calibration of the selected physics objects [87].

To access a region of phase space similar to the WW control region in the EWK 2\(\ell \)+0-jets search, additional requirements are placed on the following variables in this study:

  • The invariant mass of the dilepton system, \(m_{e\mu }>100~\hbox {GeV}\).

  • The magnitude of the missing transverse momentum vector, \(E_{\textrm{T}}^{{\textrm{miss}}} \in [60, 80]~\hbox {GeV}\).

  • The ‘stransverse mass’ variable, \(m_{\textrm{T}2} \in [60, 80]~\hbox {GeV}\) [88, 89], with \(m_{\textrm{T}2}\) defined as:

    $$\begin{aligned}{} & {} m_{\textrm{T}2} ( {\textbf{p}}_{{\textrm{T}},1}, {\textbf{p}}_{{\textrm{T}},2}, {\textbf{p}}^{\textrm{miss}}_{\textrm{T}}) = \min _{{\textbf{q}}_{{\textrm{T}},1} + {\textbf{q}}_{{\textrm{T}},2} = {\textbf{p}}^{\textrm{miss}}_{\textrm{T}}}\\{} & {} \quad \left\{ \max [\;m_{\textrm{T}}( {\textbf{p}}_{{\textrm{T}},1}, {\textbf{q}}_{{\textrm{T}},1} ), m_{\textrm{T}}( {\textbf{p}}_{{\textrm{T}},2}, {\textbf{q}}_{{\textrm{T}},2})\;]\right\} , \end{aligned}$$

    where \(m_{\textrm{T}}\) is the transverse mass defined as \(m_{\textrm{T}}=\sqrt{2\times |{\textbf{p}}_{\textrm{T,a}}|\times |{\textbf{p}}_{\textrm{T,b}}|\times (1-{\textrm{cos}}(\Delta \phi ))}\), and \(\Delta \phi \) is the azimuthal angle between the particles with transverse momenta \({\textbf{p}}_{\textrm{T,a}}\) and \({\textbf{p}}_{\textrm{T,b}}\). The vectors \({\textbf{p}}_{{\textrm{T}},1}\) and \({\textbf{p}}_{{\textrm{T}},2}\) are the transverse momenta of the two leptons, and \({\textbf{q}}_{{\textrm{T}},1}\) and \({\textbf{q}}_{{\textrm{T}},2}\) satisfy \({\textbf{p}}^{\textrm{miss}}_{\textrm{T}}= {\textbf{q}}_{{\textrm{T}},1} + {\textbf{q}}_{{\textrm{T}},2}\). The \(m_{\textrm{T}2}\) variable was designed to be sensitive to the mass scales of pair-produced heavy particles that each decay semi-invisibly. The minimisation is performed over all the possible decompositions of \({\textbf{p}}^{\textrm{miss}}_{\textrm{T}}\) into two hypothetical invisible particles with momenta \({\textbf{q}}_{{\textrm{T}},1}\) and \({\textbf{q}}_{{\textrm{T}},2}\). For \(t\bar{t}\) or WW decays, assuming an ideal detector with perfect momentum resolution, \(m_{\textrm{T}2}( {\textbf{p}}_{\mathrm{T,\ell _1}}, {\textbf{p}}_{\mathrm{T,\ell _2}},{\textbf{p}}^{\textrm{miss}}_{\textrm{T}})\) has a kinematic endpoint at the mass of the W boson [89]. The signal regions of the EWK 2\(\ell \)+0-jets search required higher values, \(m_{\textrm{T}2} >100~\hbox {GeV}\).

Table 2 Summary of the selection criteria used for the signal region in this study. The same selections are used at detector level and particle level

In the EWK 2\(\ell \)+0-jets search, the top-quark contamination in events with a jet veto was observed to increase with \(m_{\textrm{T}2}\) in the region \(m_{\textrm{T}2} \in [60, 100]~\hbox {GeV}\). To maximise WW purity, the control region required \(m_{\textrm{T}2} \in [60, 65]~\hbox {GeV}\) and \(E_{\textrm{T}}^{{\textrm{miss}}}\in [60, 100]~\hbox {GeV}\) with validation of the estimate being performed in events with \(m_{\textrm{T}2} \in [65, 100]~\hbox {GeV}\) and \(E_{\textrm{T}}^{{\textrm{miss}}}>60~\hbox {GeV}\). Since \(m_{\textrm{T}2}\) is sensitive to the angular separation of the lepton pair, the \(m_{\textrm{T}2}\) range is widened for the analysis described in this paper to provide a broader phase space for measuring angular distributions. Since \(E_{\textrm{T}}^{{\textrm{miss}}}\) and \(m_{\textrm{T}2}\) are correlated, the \(E_{\textrm{T}}^{{\textrm{miss}}}\) range is tightened to reduce the top-quark contamination. These changes increase the number of events in the region used to perform the differential cross-section measurements without reducing the WW purity. The previously used requirement on the ‘object-based \(E_{\textrm{T}}^{{\textrm{miss}}}\) significance’ [90] is removed to simplify the definition of the fiducial region at particle level.Footnote 3 The definition of the signal region used for this measurement is summarised in Table 2. The same selections are used at particle level when defining the fiducial region used for the fiducial and differential cross-section calculations, as discussed in Sect. 3.4.

Figure 1 shows detector-level comparisons between the data and the SM processes for the six variables that are unfolded to particle level in this study.

Fig. 1
figure 1

Signal region detector-level distributions of \(|y_{e\mu }|\) (top left), \(|\Delta \phi _{e\mu }|\) (top right), \(\cos \theta ^{*}\) (middle left), \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\) (middle right), \(m_{e\mu }\) (bottom left), and \(p_{\textrm{T}}^{e\mu }\) (bottom right). Data are indicated by black markers along with the distribution for the WW signal and background SM processes. The last bin of each scale-variable distribution contains overflow events. The lower panels show the ratio of data to the total SM background prediction. The uncertainty bands shown include statistical and systematic uncertainties, excluding theory uncertainties in the WW signal. ‘FNP leptons’ refers to the background from fake/non-prompt leptons, calculated using the data-driven matrix method

3.3 Background estimation

The estimation of the SM backgrounds in this study uses the same techniques as those used in the EWK 2\(\ell \)+0-jets search [22]. For the search, the SM backgrounds were classified into irreducible backgrounds from processes producing prompt leptons and reducible backgrounds containing one or more fake/non-prompt (FNP) leptons. The main irreducible backgrounds were SM diboson (WWWZZZ) and top-quark (\(t\bar{t}\) and Wt) production, which were estimated from simulated events and normalised using a simultaneous likelihood fit to data in dedicated control regions (CRs). The yields and shapes of kinematic distributions of the relevant backgrounds were then validated in a set of validation regions (VRs). Three CRs were used: CR-WW, targeting WW production; CR-VZ, targeting WZ and ZZ production, which were normalised by using a single parameter in the likelihood fit to the data; and CR-top, targeting \(t\bar{t}\) and single-top-quark production, which were also normalised by using a single parameter in the likelihood fit to the data. Both CR-VZ and CR-top require high \(m_{{\textrm{T}}2}\), and high values of \(E_{\textrm{T}}^{{\textrm{miss}}}\) and its significance. CR-VZ uses same-flavour (dielectron and dimuon) events with a jet veto and requires the dilepton invariant mass to be consistent with an on-shell Z boson. CR-top requires one electron and one muon, one b-tagged jet with a veto on additional non-b-tagged jets. The remaining background from FNP leptons was estimated from data using the matrix method [91]. In this study, WW is the target signal process, with the remaining processes being backgrounds that are subtracted from the data prior to calculating the fiducial and differential cross-sections.

Fig. 2
figure 2

Detector-level distributions of \(|y_{e\mu }|\) (top left), \(|\Delta \phi _{e\mu }|\) (top right), \(\cos \theta ^{*}\) (middle left), \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\) (middle right), \(m_{e\mu }\) (bottom left), and \(p_{\textrm{T}}^{e\mu }\) (bottom right) in the top validation region. Data are indicated by black markers along with the distribution for the WW signal and background SM processes. The last bin of each scale-variable distribution contains overflow events. The lower panels show the ratio of data to the total SM background prediction. The uncertainty bands shown include statistical and systematic uncertainties, excluding theory uncertainties in the WW signal. ‘FNP leptons’ refers to the background from fake/non-prompt leptons, calculated using the data-driven matrix method

The statistical interpretation for the search was performed using the HistFitter framework [92]. The likelihood for the ‘background-only’ fit used to constrain the background normalisation factors was a product of Poisson probability density functions describing the observed number of events in each CR and Gaussian distributions that constrain the nuisance parameters associated with the systematic uncertainties. Poisson distributions were used for MC statistical uncertainties. Further details of the likelihood fit can be found in the EWK 2\(\ell \)+0-jets search paper [22]. After the fit, the normalisation factors returned for the WW, \(t\bar{t}\) and single-top-quark, and WZ/ZZ processes were \(1.25 \pm 0.11\), \(0.82 \pm 0.06\) and \(1.18 \pm 0.05\) respectively (where the errors include both statistical and systematic uncertainties), which for diboson processes were applied to MC samples scaled to NLO QCD cross-sections (the NNLO QCD cross-sections were not used in the original search paper because the samples were normalised to the data in the control regions). Good agreement, within about one standard deviation, was observed for the yields and kinematic distributions in all VRs when applying these normalisation factors and their corresponding uncertainties. The deviation of the WW normalisation factor from unity by more than \(1\sigma \) suggests there is tension between the SM and data in the parameter space probed by the search, and this is tested in the present study.

In this study, the normalisation factors from the EWK 2\(\ell \)+0-jets search are applied directly to the VZ (WZ/ZZ) and top \((t\bar{t}, Wt)\) backgrounds that are subtracted from the data when performing the cross-section measurements described in Sect. 3.4. The uncertainties in the normalisation factors are propagated through the calculation as discussed in Sect. 3.5. The correlations and constraints that the likelihood fit imposes on the nuisance parameters describing the systematic uncertainties are not applied in this study; the systematic uncertainties are instead assumed to take their nominal values as discussed in Sect. 3.5. This approach is designed to be conservative, but it has negligible impact on the results because no significant constraints were observed in the EWK 2\(\ell \)+0-jets search. To validate the use of the original top normalisation factor from the EWK 2\(\ell \)+0-jets search in the adjusted phase space of this study, an additional validation exercise is performed to check the modelling of the top-quark background in a region with the same selection as in Table 2 but requiring exactly one b-tagged jet. Good agreement is observed across all six distributions considered for differential cross-section measurements, as shown in Fig. 2.

3.4 Fiducial cross-section determination

The differential cross-sections are measured in the fiducial phase space of the \(WW \rightarrow e^{\pm }\nu \mu ^{\mp }\nu \) decay channel using particle-level implementations of the selection criteria defined in Table 2. The particle-level quantities associated with simulated events are calculated using the SimpleAnalysis [93] framework. The signal particle-level distributions produced by SimpleAnalysis have been validated against the Rivet [94] toolkit that enables further reinterpretation of SM measurements and validation of MC generators. The Rivet routine for this measurement is available on HepData [95].

Electrons and muons are required to originate from the hard interaction and not from hadron decays. Electrons and muons from leptonically decaying \(\tau \)-leptons are included in the fiducial region. The momenta of photons that are emitted in a cone of size \(\Delta R'=0.1\) around the lepton direction and do not originate from hadron decays are added to form ‘dressed’ leptons. Particle-level jets are reconstructed using the anti-\(k_t\) algorithm [80] with radius parameter \(R=0.4\) from visible stable final-state particles, excluding prompt dressed leptons. The particle-level missing transverse momentum is defined as the vectorial sum of the momenta of invisible particles in the event. For SM processes this is the sum of the neutrino momenta.

The fiducial cross-section is calculated as:

$$\begin{aligned} \sigma _{WW} = \frac{N_{\textrm{obs}} -N_{\textrm{bkg}}}{C\cdot {\mathcal {L}}}, \end{aligned}$$

where \(N_{\textrm{obs}}\) is the observed number of data events in the fiducial region, \(N_{\textrm{bkg}}\) is the predicted number of background events, \({\mathcal {L}}\) the integrated luminosity, and C is a correction factor to account for limited acceptances and detector inefficiencies. It is calculated using MC simulation as the number of simulated signal events passing the detector-level event selection divided by the number of events in the fiducial phase space. In this study, \(C=0.55 \pm 0.08\) is applied, where the uncertainty comes from statistical, experimental and theoretical sources, as described in Sect. 3.5.

The differential cross-sections are calculated using the iterative Bayesian unfolding (IBU) technique [96, 97] as implemented in the RooUnfold package [98]. This unfolding technique corrects the detector-level distributions of data (with the non-WW backgrounds subtracted) for bin-to-bin migrations due to the event reconstruction. It also applies fiducial corrections (corresponding to events that are reconstructed in the signal region but originate outside the fiducial region at particle level) and reconstruction efficiency corrections (due to events that lie inside the fiducial region at particle level but do not enter the signal region due to detector inefficiencies). The bins chosen for the differential measurements were optimised for a desired level of statistical uncertainty and to reduce the migration of events between particle-level and detector-level bins. The number of iterations used in IBU is also optimised by considering the bias due to the assumed true distribution and the resulting statistical uncertainty of the measurement, with too many iterations generating high statistical uncertainties and too few iterations biasing the measurements towards the MC prediction. In this study, two iterations are chosen for \(\cos \theta ^{*}\) and \(|\Delta \phi _{e\mu }|\), three iterations are used for \(m_{e\mu }\) and \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\), and four iterations are used for \(|y_{e\mu }|\) and \(p_{\textrm{T}}^{e\mu }\). In addition to the bias tests (discussed in Sect. 3.5) to measure any systematic effects due to the use of the signal WW MC sample in the unfolding procedure, several signal injection tests were performed using SUSY models for chargino-pair production that were on the edge of the exclusion sensitivity in the EWK 2\(\ell \)+0-jets search. These are important checks of the validity of using these measurements to calculate constraints on BSM physics. Detector-level distributions of WW plus injected BSM signal were input to the unfolding calculation to test whether the particle-level WW plus BSM distribution could be recovered. The unfolding calculation matched the expected WW plus BSM distributions for a range of SUSY models displaying different kinematics because of their different SUSY particle masses. The results of the BSM injection tests are available on HepData [95].

3.5 Systematic uncertainties

Systematic uncertainties in the WW differential cross-sections measured in this study arise from experimental sources (which impact the subtracted non-WW backgrounds, and the calculation used to correct the signal for detector effects), uncertainties in the modelling of the top-quark background (which includes theoretical uncertainties, and uncertainties associated with the data-driven background estimate), and signal modelling. Statistical uncertainties associated with the MC samples used for the signal and background processes, and with the observed data distributions, also impact the unfolded distributions.

The sources of experimental uncertainty considered in the EWK 2\(\ell \)+0-jets search [22] are also considered in this study. The dominant experimental uncertainties are due to the calibration of the jet energy scale and resolution [79, 82]. Additional uncertainties that arise from the lepton reconstruction efficiency, lepton energy scale and lepton energy resolution, and differences between the trigger efficiencies in data and simulation are grouped into the lepton uncertainties category. There are also uncertainties in the scale factors applied to the simulated samples to account for differences between data and simulation in the b-jet identification efficiency, and an uncertainty in \({\textbf{p}}^{\textrm{miss}}_{\textrm{T}}\) associated with the soft-term resolution and scale [87]. Finally, an uncertainty is assigned to the reweighting procedure (pile-up reweighting) applied to simulated events to match the distribution of the number of interactions per bunch crossing observed in data.

Several sources of uncertainty in the modelling of \(t\bar{t}\) and Wt events are accounted for by varying the normalisation and shape of the subtracted backgrounds. For \(t\bar{t}\) production, uncertainties in the parton shower simulation are estimated from differences between samples generated with Powheg Box interfaced to either Pythia 8.186 or Herwig  7.04 [99, 100]. Uncertainties in the modelling of initial- and final-state radiation are estimated by comparing the nominal sample with two alternative samples generated with Powheg Box interfaced to Pythia 8.186 but with the radiation settings varied [101]. Finally, an additional uncertainty associated with the choice of event generator is estimated by comparing the nominal samples with samples generated with MadGraph5_aMC@NLO interfaced to Pythia 8.186 [102]. For single-top-quark production, an uncertainty is assigned to the treatment of the interference between the Wt and \(t\bar{t}\) samples. This is done by comparing the nominal sample generated using the diagram removal method with a sample generated using the diagram subtraction method [101].

Of the systematic uncertainties considered in the EWK 2\(\ell \)+0-jets search, uncertainties in the data-driven estimate of FNP leptons and theoretical uncertainties in the diboson WZ/ZZ backgrounds are not applied in this study because these processes contribute little to the subtracted backgrounds. Additional systematic uncertainties are applied to the unfolding to account for the uncertainty in the normalisation of the top-quark and VZ backgrounds, although the VZ normalisation uncertainties are observed to be negligible. The luminosity uncertainty (1.7%) is applied to the subtracted backgrounds that are not estimated using data-driven techniques.

Tests were performed to estimate the bias introduced by using information from the nominal signal MC sample in the unfolding procedure. This includes a data-driven test, whereby MC simulated WW signal events are reweighted at generator level to obtain better agreement between the detector-level signal and the background-subtracted data. The nominal unfolding procedure is then applied to the reweighted detector-level signal distributions to check whether the reweighted particle-level distributions can be reproduced. The impact of theoretical uncertainties in the signal modelling is evaluated by using the detector-level signal distributions with the alternative Sherpa  \(qq\rightarrow WW\) signal sample introduced in Sect. 3.1 as input to the nominal unfolding procedure, and comparing the result with the alternative particle-level signal distribution. In all tests the expected particle-level distributions were accurately recovered so no additional uncertainties were assigned to the unfolding procedure.

Finally, statistical uncertainties from the data are calculated using pseudo-experiments that vary the data distributions according to their Poisson uncertainties in each bin, which are then passed through the unfolding calculation. Statistical uncertainties associated with the simulated MC samples are evaluated using a similar technique.

4 Results

The measured fiducial cross-section for \(WW \rightarrow e^{\pm }\nu \mu ^{\mp }\nu \) production in the phase space defined in Table 2 is:

$$\begin{aligned}{} & {} \sigma (WW\rightarrow e^{\pm }\nu \mu ^{\mp }\nu )\\{} & {} \quad = 19.2 \pm 0.3~({\textrm{stat}}) \pm 2.5~({\textrm{syst}}) \pm 0.4~({\textrm{lumi}})~{\textrm{fb}}\\{} & {} \quad = 19.2 \pm 2.6~({\textrm{total}})~{\textrm{fb}}. \end{aligned}$$

Table 3 shows the relative impact of the categories of systematic uncertainties discussed in Sect. 3.5 on the measured fiducial cross-section. The largest contribution is from the experimental jet uncertainty, which contributes a \(12\%\) uncertainty to the measured fiducial cross-section. The jet uncertainties are higher than in the previous ATLAS 13 TeV WW+0-jet measurement [15] and this can be attributed to the lower \(p_{\textrm{T}}\) threshold used to define the jet veto.

Table 3 Breakdown of the relative uncertainties per category and the total uncertainty on the fiducial cross-section measurement
Fig. 3
figure 3

Measured fiducial differential cross-sections of WW production for (top to bottom) \(|y_{e\mu }|\), \(|\Delta \phi (e\mu )|\) and \(\cos \theta ^{*}\). The measured cross-section values are shown as points with dark bands giving the statistical uncertainty and light bands indicating the size of the total uncertainty. The results are compared with the \(q\bar{q}\)-initiated predictions from Powheg Box v2 + Pythia 8.186 and Sherpa  2.2.2, each combined with Sherpa  2.2.2 + Open Loops (LO+PS) for the gg-initiated states. The k-factors refer to the corrections applied to scale the predictions of qq-initiated and gg-initiated processes to NNLO and NLO accuracy in QCD respectively. The right column shows a breakdown of contributions to the uncertainties in the unfolded measurement

Fig. 4
figure 4

Measured fiducial differential cross-sections of WW production for (top to bottom) \(p_{\textrm{T}}^{\textrm{lead}\, \ell }\), \(m_{e\mu }\) and \(p_{\textrm{T}}^{e\mu }\). The last bin is inclusive in the measured observable and for \(p_{\textrm{T}}^{e\mu }\) the first bin contains the underflow bin. The measured cross-section values are shown as points with dark bands giving the statistical uncertainty and light bands indicating the size of the total uncertainty. The results are compared with the \(q\bar{q}\)-initiated predictions from Powheg Box v2 + Pythia 8.186 and Sherpa  2.2.2, each combined with Sherpa  2.2.2 + Open Loops (LO+PS) for the gg-initiated states. The k-factors refer to the corrections applied to scale the predictions of qq-initiated and gg-initiated processes to NNLO and NLO accuracy in QCD respectively. The right column shows a breakdown of contributions to the uncertainties in the unfolded measurement

The measured value is compatible with the nominal predictions of 17.8 fb and 17.1 fb from Powheg Box v2 + Pythia 8.186 and Sherpa  2.2.2, respectively, where both are combined with Sherpa   2.2.2 + Open Loops (LO+PS) for the gg-initiated states. The ratio of the measured cross-section to the nominal Powheg Box v2 + Pythia 8.186 prediction is 1.08. To compare this ratio with the detector-level WW normalisation factor of \(1.25 \pm 0.11\) in the EWK 2\(\ell \)+0-jets search [22], the former must be multiplied by 1.13 to account for the NLO cross-section calculation, which is included in this study but not in the EWK 2\(\ell \)+0-jets search. This gives a ratio of 1.22, which is consistent with the normalisation factor from the EWK 2\(\ell \)+0-jets search.

Particle-level differential cross-sections for the six variables targeted in this study are presented in Fig. 3 for the angular variables and Fig. 4 for the scale variables. In each case, the right-hand plot shows the impact of the uncertainties, grouped into the categories discussed in Sect. 3.5, on the measurement.

The dilepton rapidity distribution has a maximum between \(|y_{e\mu }|=0\) and \(|y_{e\mu }|=1\), consistent with central production of a massive diboson system. The \(|\Delta \phi _{e\mu }|\) distribution peaks at \(|\Delta \phi _{e\mu }|=2\). The shape of this distribution is influenced by the \(m_{\textrm{T}2}\) selection defining the fiducial region: high \(|\Delta \phi _{e\mu }|\) values are associated with back-to-back leptons, which typically give lower \(m_{\textrm{T}2}\) values than are considered in this study. Conversely, the highest \(m_{\textrm{T}2}\) values (which for WW production should occur around 90 GeV in the absence of detector effects) are often associated with collinear leptons (low \(|\Delta \phi _{e\mu }|\)), which are also excluded from the fiducial region. The \(\cos \theta ^{*}\) distribution peaks around \(\cos \theta ^{*}=0.8\), with higher values being suppressed by the rapidity acceptance of the fiducial phase space. The distributions of the scale variables all show the expected characteristic fall for high values of the variable. The fiducial phase-space acceptance also suppresses the leading-lepton and dilepton \(p_{\textrm{T}}\) distributions at lower \(p_{\textrm{T}}\) values.

Table 4 Chi-squared per number of degrees of freedom \(\chi ^{2}/\text {NDF}\) for a comparison of unfolded distributions with different theory predictions. The calculation takes into account bin-by-bin correlations of systematic and statistical uncertainties. Uncertainties in the theory predictions are not considered

The measurements are compared with the qq-initiated NLO QCD+PS predictions from Powheg Box v2+Pythia 8.186 and Sherpa  2.2.2, each combined with Sherpa  2.2.2 + Open Loops (LO+PS) for the gg-initiated states. For the angular variables, the region with \(|\Delta \phi _{e\mu }|<1.5\) is underestimated by both theory predictions, which is consistent with observations in the previous ATLAS 13 TeV WW+0-jet measurement [15]. The region \(\cos \theta ^{*}>0.8\) is also underestimated by 10–30% by both predictions. This corresponds to a rapidity difference of \(|\Delta y(e\mu )|\ge 2.2\) between the leptons. For the distribution of dilepton rapidity \(|y_{e\mu }|\), the theory shows reasonable agreement with the measurement. The predictions for the scale variables show good agreement with the data except for low values of \(p_{\textrm{T}}^{{\textrm{lead}}\, \ell }\), where both predictions underestimate the cross-section by 20–25%. Global \(\chi ^{2}\) calculations are carried out for all predictions and are displayed in Table 4. Uncertainties in the theory predictions are not considered. The largest \(\chi ^{2}/{\textrm{NDF}}\) is 24.1/6 corresponding to the comparison between the \(q\bar{q}\rightarrow WW\) (Sherpa  2.2.2) + \(gg\rightarrow WW\) (Sherpa  2.2.2 + Open Loops) prediction and the unfolded distribution for \(p_{\textrm{T}}^{{\textrm{lead}}\, \ell }\).

5 Conclusion

The cross-section for \(WW \rightarrow e^{\pm }\nu \mu ^{\mp }\nu \) production in pp collisions at \(\sqrt{s}=13~\hbox {TeV}\) is measured with the ATLAS detector at the LHC in a fiducial phase-space characterised by the absence of jets and additional leptons, the presence of a high dilepton invariant mass \(m_{e\mu }\), and with values of \(E_{\textrm{T}}^{{\textrm{miss}}}\) and the stransverse mass \(m_{{\textrm{T}}2}\) motivated by the control regions used in supersymmetry searches [22]. The measured cross-section is \(\sigma (WW \rightarrow e^{\pm }\nu \mu ^{\mp }\nu )= 19.2 \pm 0.3~({\textrm{stat}}) \pm 2.5~({\textrm{syst}}) \pm 0.4~({\textrm{lumi}})~{\textrm{fb}}\). Differential cross-sections for three variables sensitive to the energy scale of the event and three variables sensitive to the angular correlations of the leptonic decay products are compared with two theoretical SM predictions from perturbative QCD calculations. Good agreement is observed for most distributions within the uncertainties. The largest discrepancies occur at low values of \(|\Delta \phi _{e\mu }|<1.5\), high values of \(\cos \theta ^{*}>0.8\) and low \(p_{\textrm{T}}^{{\textrm{lead}}\, \ell }\), which is consistent with the observations of the previous ATLAS WW+0-jet measurement [15]. This study validates the SM in a new and interesting region motivated particularly by searches for supersymmetry and provides benchmark measurements that can be used to improve future SM predictions and calculate additional constraints on BSM models.