Search for additional heavy neutral Higgs and gauge bosons in the ditau final state produced in 36

A search for neutral heavy resonances is performed in the W W → e νμν decay channel using pp collision data corresponding to an integrated luminosity of 36 . 1 fb − 1 , collected at a centre-of-mass energy of 13 TeV by the ATLAS detector at the Large Hadron Collider. No evi-dence of such heavy resonances is found. In the search for production via the quark–antiquark annihilation or gluon– gluon fusion process, upper limits on σ X × B ( X → W W ) as a function of the resonance mass are obtained in the mass range between 200 GeV and up to 5 TeVfor various benchmark models: a Higgs-like scalar in different width scenarios, a two-Higgs-doublet model, a heavy vector triplet model, and a warped extra dimensions model. In the vector-boson fusion process, constraints are also obtained on these resonances, as well as on a Higgs boson in the Georgi–Machacek model and a heavy tensor particle coupling only to gauge bosons.


Introduction
The measured properties [1][2][3][4] of the Higgs boson discovered in 2012 by the ATLAS [5] and CMS [6] collaborations at the Large Hadron Collider (LHC) are, within experimental uncertainties, consistent with those predicted for the Standard Model (SM) Higgs boson, h. Nevertheless, the SM is thought to be an incomplete theory and many scenarios beyond the SM (BSM) predict an extended Higgs sector [7,8]. Diboson vector and tensor resonances are also predicted in several other extensions to the SM, such as in composite Higgs models [9,10] and models with warped extra dimensions [11][12][13][14].
This article reports on the results of a search for heavy neutral resonances decaying into two W bosons, which then decay into the eνμν final state, either directly or via leptonic tau decays with additional neutrinos. The analysis is based on the full pp collision dataset collected by the ATLAS detector in 2015 and 2016 at the centre-of-mass energy of e-mail: atlas.publications@cern.ch √ s = 13 TeV, corresponding to an integrated luminosity of 36.1 fb −1 .
The results are interpreted in terms of different benchmark models. For the case of a scalar resonance produced by gluon-gluon fusion (ggF) or vector-boson fusion (VBF), two scenarios with different intrinsic widths are considered. Constraints on the heavy neutral scalar in two-Higgs-doublet models (2HDM) are also obtained. The neutral member of the fiveplet in the Georgi-Machacek (GM) model [15,16] also serves as a reference model in the VBF production mode. The parameterisation of heavy vector triplet (HVT) Lagrangians [17,18] permits the interpretation of searches for spin-1 resonances in a generic way. The bulk Randall-Sundrum (RS) model [11,19] features a spin-2 Kaluza-Klein (KK) graviton excitation (G KK ) decaying into W W , while a tensor resonance signal in the VBF production mode is based on an effective Lagrangian model (ELM) [20].
A previous search for a heavy Higgs boson in the eνμν final state was performed by ATLAS [21] based on a data sample with an integrated luminosity of 20.3 fb −1 at √ s = 8 TeV. The CMS Collaboration also published a search for a high-mass scalar decaying into two W bosons in the fully leptonic final state [22], using datasets at √ s = 7 and 8 TeV with integrated luminosities of 5.1 and 19.5 fb −1 , respectively. A search for heavy resonances in the RS models in the leptonic decays of the W W channel, using a dataset of 4.7 fb −1 at 7 TeV [23], was reported by the ATLAS Collaboration. The ATLAS and CMS collaborations have obtained constraints on the HVT and bulk RS models, based on other decay modes of the V V channels, with V being either a W or a Z boson [24][25][26][27][28][29][30][31][32][33][34][35][36]. The search in the eνμν decay mode is complementary to searches performed in other decay modes. In particular, the sensitivity to low mass resonances is higher in the fully leptonic final state than in final states that include jets due to background from jet production.
The article is organised as follows. Section 2 presents the various models used in this analysis. Section 3 describes the ATLAS detector. The data and simulated event samples are discussed in Sect. 4. The event reconstruction and selection are described in Sects. 5 and 6, respectively, followed by the background estimation techniques in Sect. 7. Systematic uncertainties are discussed in Sect. 8 and the results are presented in Sect. 9. Finally, the conclusions are given in Sect. 10.

Theoretical models
The different signal models studied are presented in Table 1. One scenario for the heavy scalar assumes that the scalar has a width much smaller than the detector resolution. This is referred to as the narrow-width approximation (NWA). Larger widths (large-width assumption, LWA) of 5, 10 and 15% of the heavy Higgs boson mass, are also considered. The choice of the width range for the heavy Higgs boson is motivated by the fact that, for several of the most relevant BSM models, widths above 15% are already excluded by indirect limits [37].
The 2HDM comes in different types [38], defined by assumptions about the couplings of each of the Higgs doublets and the discrete symmetries imposed. This analysis considers Type I, where one Higgs doublet couples to vector bosons while the other couples to fermions, and Type II of the minimal supersymmetric (SUSY)-like model in which one Higgs doublet couples to up-type quarks and the other one to down-type quarks and charged leptons. This analysis uses a generic charge-conjugation-and parity-conserving (CP-conserving) 2HDM with a softly broken Z 2 symmetry [38] which has several free parameters: (i) four masses m h , m H , m A and m H ± for the two CP-even neutral states, the pseudo-scalar and the charged Higgs boson pair, respectively, (ii) a mixing angle α between the CP-even neutral Higgs fields, and (iii) the ratio of the vacuum expectation values of the two Higgs doublets tan β = υ 2 /υ 1 . The bench- Table 1 Summary of the different signal models and resonances considered in the analysis. The resonance spin and production mode are also specified with ggF for gluon-gluon fusion, qqA for quark-antiquark annihilation and VBF for vector-boson fusion  [39,40]. The GM model extends the Higgs sector with the addition of a real and a complex triplet of SU(2) L in a way which preserves the SM value of ρ = M 2 W /(M 2 Z cos 2 θ W ) = 1 at tree level, with m W , m Z and θ W being the W and Z boson mass and the weak mixing angle, respectively. The physical states include a fermiophobic fiveplet, H 0 5 , H ± 5 , and H ±± 5 , of custodial SU(2) symmetry which couples preferentially to vector bosons [41]. For that reason, the GM model is less constrained [42], when produced by the VBF process, than other standard benchmark models of a triplet Higgs field, such as the little Higgs model [43] or the left-right symmetric model [44]. The model has many parameters [45,46], but, if the other new Higgs bosons are heavier than those of the H 5 multiplet, the only production mode is via the VBF process. The cross section and decay width into V V are then proportional to a single parameter, sin 2 θ H , which characterises the fraction of the gauge boson masses generated by the triplet Higgs fields.
The HVT Lagrangian [18] parameterises the couplings of the new spin-1 heavy bosons to SM particles in a generic manner and allows their mixing with SM gauge bosons. The s-channel production mechanism of the heavy gauge bosons is primarily via qq annihilation (qqA). The HVT bosons couple to the Higgs boson and SM gauge bosons with coupling strength c h g V and to the fermions with coupling strength g 2 c F /g V , where g is the SM SU(2) L gauge coupling, c h and c F are multiplicative factors that modify the couplings to the Higgs boson and to the fermions, and g V represents its coupling strength to the W and Z bosons. For the case of vector-boson fusion, it is assumed that there is no coupling to fermions so that non-VBF production processes are suppressed.
The spin-2 G KK is the first Kaluza-Klein excitation of the graviton in the RS model with a warped extra dimension [11,19], where the SM fields are localised in the bulk [12][13][14]. This model is characterised by the dimensionless coupling constant k/M Pl ∼ O(1) where k determines the curvature of the space, and whereM Pl = M Pl / √ 8π is the reduced Planck scale.
For the VBF production mode, the spin-2 signal is based on an effective Lagrangian approach with as a characteristic energy scale of the underlying new physics [20], Here, f i are variable coupling parameters, T μν is the spin-2 singlet field, B αν and W αν i are the electroweak field strength tensors, and is the scalar Higgs field. The covariant deriva- where σ i are the Pauli matrices, Y the weak hypercharge, and g and g the corresponding gauge coupling constants. The model differs from the RS model in that the couplings to fermions or gluons are not included in the Lagrangian. Also, the BSM amplitude is multiplied by a form factor which is a function of a cut-off scale f f and a suppression power n f f in order to preserve unitarity at high energies: where p 2 1 and p 2 2 are the squared invariant masses of the incoming electroweak bosons and k 2 sp2 is the squared invariant mass of the sum of the initial boson momenta, equivalent to that of an s-channel spin-2 particle. The specific parameter settings for the signal models used are given in Sect. 4.

ATLAS detector
The ATLAS detector [47,48] is a general-purpose particle detector used to investigate a broad range of physics processes. It includes an inner tracking detector (ID) surrounded by a thin superconducting solenoid, electromagnetic and hadronic calorimeters and a muon spectrometer (MS) incorporating three large superconducting toroidal magnets with eight coils each. The ID consists of fine-granularity silicon pixel and microstrip detectors, and a straw-tube tracker. It is immersed in a 2 T axial magnetic field produced by the solenoid and provides precision tracking for charged particles in the range |η| < 2.5, where η is the pseudorapidity of the particle. 1 The straw-tube detector also provides transition radiation measurements for electron identification. The calorimeter system covers the pseudorapidity range |η| < 4.9. It is composed of sampling calorimeters with either liquid argon (LAr) or scintillator tiles as the active medium, and lead, steel, copper, or tungsten as the absorber material. The MS provides muon identification and momentum measurements for |η| < 2.7. The ATLAS detector has a two-level trigger system [49] to select events for further analysis. 1 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the zaxis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upwards. Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2). Angular distance is measured in units of R ≡ ( η) 2 + ( φ) 2 .

Data and simulation samples
The data used in this analysis were collected with a singleelectron or single-muon trigger. These triggers have a transverse energy or momentum threshold, E T or p T , that depends on the data-taking period, with the lowest threshold varying between 20 and 26 GeV. The trigger efficiency for W W events passing the offline event selection (Sect. 6) is greater than 99%. Data quality criteria are applied to ensure that events are recorded with stable beam conditions and with all relevant subdetector systems operational.
Samples of simulated signal and background events are used to optimise the event selection and to estimate the signal acceptance and the background yields from various SM processes.
The  [60] for the underlying event. The mass of the heavy Higgs boson signals considered in this analysis spans the range between 200 GeV and 4 (3) TeV for the ggF-induced (VBF-induced) signals. Both NWA and LWA samples were generated in steps of 100 GeV up to 1 TeV, and in steps of 200 GeV thereafter.
The Powheg-Box samples describe the production of a ggF-induced heavy Higgs boson in association with one jet at leading-order (LO) precision, while further jets are emulated by the parton shower generator, Pythia. A more precise calculation of higher jet multiplicities is provided by using MadGraph5_aMC@NLO 2.3.2 to simulate gg → H events in association with up to two jets at NLO precision. Here, the overlap between identical final states generated at the matrix element (ME) and the parton shower (PS) stage is removed using FxFx merging [61]. The fraction of ggF events passing the event selection requirements of the N jet = 1 and N jet ≥ 2 VBF categories (defined later in Sect. 6) predicted by the Powheg-Box event generator is reweighted to match that of the MadGraph5_aMC@NLO FxFx samples. The corresponding scale factors are calculated for several hypothetical heavy Higgs boson masses. It is the largest, 1.14, for the 200 GeV mass point, and decreases with increasing res-onance mass to a value of 0.85 for the 4 TeV mass point, for the N jet = 1 VBF category. The corresponding numbers are 0.91 and 0.73 for the N jet ≥ 2 VBF category.
Benchmark samples for the GM, HVT and bulk RS models were generated at LO using MadGraph5_aMC@NLO interfaced to Pythia 8.186 with the NNPDF23LO PDF set. A value of sin θ H = 0.4 is chosen for the GM benchmark model. For the HVT interpretation in the qq annihilation mode, samples were generated according to the extended gauge symmetry model A [18] with g V = 1. In the VBF mode, samples were generated using the same g V value but setting the couplings to the fermions to zero so that the new vector boson couples only to the SM vector and Higgs bosons. For the bulk RS model, a curvature scale parameter k/M Pl of either 0.5 or 1 is considered. The ELM VBF spin-2 signals were generated at LO with VBFNLO 3.0.0 beta 2 [62] with the NNPDF30LO PDF set [63] and using the following parameter setting [20]: f f = 3 TeV, n f f = 4, = 1.5 TeV and f 1 = f 2 = f 5 = 1. The mass range considered is between 200 GeV and 5 TeV for the KK graviton signal, between 250 GeV and 5 TeV for the HVT qqA signal, between 200 GeV and 1 TeV for the GM and ELM VBF signals, and between 300 GeV and 1 TeV for the HVT VBF signal.
The main sources of SM background include events from the production of single top quarks, tt, dibosons (W W , W Z and Z Z), Z /γ * +jets and W +jets. Single-topquark simulated events were generated with Powheg-Box 2.0 [64,65] using the CT10 NLO PDF set interfaced to Pythia 6.428 [66] for parton showering and hadronisation, with the Perugia2012 tune [67] and CTEQ6L1 PDF [68] to describe the underlying event. The tt events were generated with Powheg-Box 2.0 [69] using the NNPDF30NLO PDF set [63] interfaced to Pythia 8.186 for parton showering and hadronisation, with the A14 tune and CTEQ6L1 PDF to describe the underlying event. The sample was generated by setting the resummation damping parameter h damp to 1.5 times the top-quark mass, m top , which was set to 172.5 GeV. The h damp parameter controls the ME/PS matching and effectively regulates the highp T radiation. The Evt-Gen 1.2.0 [70] package was used to model the properties of the bottom and charm hadron decays. Diboson samples were generated with Sherpa 2.1.1 [71][72][73][74][75] for the gg production processes and Sherpa 2.2.1 for the qq production processes, using the CT10 NLO and NNPDF30NNLO PDF sets, respectively. The Sherpa event generator for the latter processes produces up to one additional parton at NLO and up to three additional partons at LO. Production of W and Z bosons in association with jets was also simulated using Sherpa 2.1.1 with the CT10 NLO PDF set, where b-and c-quarks are treated as massive particles. The gg → W W production also includes the contribution of the SM Higgs boson at 125 GeV and the interference effects between the continuum and Higgs resonance processes. The VBF part of SM Higgs boson production was generated with Powheg-Box [54] interfaced to Pythia 8.186 for parton showering and hadronisation.
The effect of multiple pp interactions in the same and neighbouring bunch crossings (pile-up) was included by overlaying minimum-bias collisions, simulated with Pythia 8.186, on each generated signal and background event. The number of overlaid collisions is such that the distribution of the average number of interactions per pp bunch crossing in the simulation matches the pile-up conditions observed in the data, which is about 25 interactions per bunch crossing on average. The generated samples were processed through a Geant4-based detector simulation [76,77], followed by the standard ATLAS reconstruction software used for collision data.

Event reconstruction
Events used in this analysis are required to have at least one primary vertex with a minimum of two associated tracks, each with transverse momentum p T > 400 MeV. If there is more than one vertex reconstructed in an event that meets these conditions, the one with the highest sum of track p 2 T is chosen as the primary vertex.
Electrons are reconstructed from clusters of energy deposits in the electromagnetic calorimeter that match a track reconstructed in the ID. They are identified using the likelihood identification criteria described in Ref. [78]. The electrons used in this analysis are required to pass the "Medi-umLH" selection for p T > 25 GeV 2 or the "TightLH" selection for p T < 25 GeV and be within |η| < 2.47, excluding the transition region between the barrel and endcaps in the LAr calorimeter (1.37 < |η| < 1.52). These "MediumLH" and "TightLH" selection categories have identification efficiencies of 84 and 74%, respectively, for electrons with p T of 25 GeV. The corresponding probabilities to misidentify hadrons as electrons are approximately 0.5 and 0.3%, respectively.
Muons are reconstructed by combining ID and MS tracks that have consistent trajectories and curvatures [79]. The muon candidates used in this analysis are required to have |η| < 2.5 and pass the "Medium" selection for p T > 25 GeV or the "Tight" selection for p T < 25 GeV, defined on the basis of the quality of the reconstruction and identification. These selections have a reconstruction efficiency of approximately 96 and 92%, respectively, for muons originating from the decay of W bosons [80]. The corresponding probabilities to misidentify hadrons as muons are approximately 0.2 and 0.1%, respectively.
To ensure that leptons originate from the interaction point, a requirement of |d 0 |/σ d 0 < 5 (3) is imposed on the electrons (muons) and |z 0 sin θ | < 0.5 mm is applied to both lepton types. Here d 0 and z 0 are the transverse and longitudinal impact parameters of the lepton with respect to the primary vertex, respectively, and σ d 0 is the uncertainty in the measured value of d 0 . In addition, electrons and muons are required to be isolated from other tracks and calorimetric activities by applying p T -and η-dependent isolation criteria. For muons, the calorimeter isolation is based on energy deposits in the calorimeter within a cone R of 0.2 around the muons. The muon track isolation uses a variable cone size starting at R = 0.3 and shrinking with increasing p T of the muon [81]. The same calorimeter isolation is used for electrons, and the electron track isolation uses a variable cone size starting at R = 0.2. The efficiency of these isolation requirements is 90% for both lepton types with p T of 25 GeV, increasing to 99% at 60 GeV.
Jets are reconstructed from three-dimensional clusters of energy deposits in the calorimeters using the anti-k t algorithm [82] with a radius parameter of R = 0.4 implemented in the FastJet package [83]. The four-momenta of the jets are calculated as the sum of the four-momenta of their constituents, which are assumed to be massless. Jets are corrected for energy from pile-up using the pile-up subtraction based on jet areas [84]. The jet energy scale is estimated in Ref. [85]. Jets are required to have p T > 30 GeV and |η| < 4.5.
For jets with p T < 60 GeV and |η| < 2.5, the multivariate "jet vertex tagger" algorithm [86] is used to suppress jets from pile-up interactions. To avoid double counting, jets of any transverse momentum are discarded if they are within a cone of size R = 0.2 around an electron candidate or if they have fewer than three associated tracks and are within a cone of size R = 0.2 around a muon candidate. However, if a jet with three or more associated tracks is within a cone of size R < 0.4 of a muon candidate, or the separation between an electron and any jet is within 0.2 < R < 0.4, the corresponding muon or electron candidate is rejected.
To estimate the number of b-tags in the event, jets with p T > 20 GeV and within |η| < 2.5 are considered to contain a b-hadron if they yield a b-tagging algorithm discriminant value exceeding a reference value. The MV2c10 algorithm [87,88] is chosen at the 85% b-tagging efficiency benchmark point, estimated from b-jets in simulated tt events. The misidentification rate for jets which originate from a light quark or gluon is less than 1%, while it is approximately 17% for c-jets.
The missing transverse momentum, with magnitude E miss T , is calculated as the negative vectorial sum of the transverse momenta of calibrated electrons, muons, and jets originating from the primary vertex, as well as tracks with p T > 500 MeV compatible with the primary vertex and not associated with any of these [89].

Event selection
As a first step, W W candidate events are selected by requiring two oppositely charged, different-flavour leptons (e or μ). Both leptons must satisfy the minimal quality criteria discussed in Sect. 5. When ordered in p T , these leptons are called the leading and subleading ones, p ,(sub)lead T . In order to suppress the background from diboson processes, a veto is imposed on events with an additional lepton with p ,other T > 15 GeV. Table 2 summaries the selections and the definition of signal regions (SRs). The variables used in the selections are the most discriminating ones chosen by a boosted decision tree (BDT) [90], based on the NWA signal samples. These are p ,lead T , the invariant mass of the leading and subleading leptons, m , and the pseudorapidity difference between the two leptons, η . The first two variables provide good separation between a heavy resonance signal and the W W and top-quark background. The separation of signal from background based on the η distribution is found to have a reasonable efficiency and allows, at the same time, a control region to be defined for the W W background (Sect. 7.2). For each selected variable, the selection criterion is set by maximising the signal significance in the presence of background. The optimised selection is checked to be applicable to the LWA signals.
In order to further suppress the top-quark background, events with at least one b-tagged jet (N b-tag ≥ 1) are rejected Common selections where p T and φ are the transverse momentum and azimuthal angle of a given lepton and φ E miss T is the azimuthal angle of the missing transverse momentum vector.
Three event categories are defined: two disjoint categories optimised for the VBF production, VBF N jet = 1 and VBF N jet ≥ 2 (SR VBF1J and SR VBF2J ), and one quasi-inclusive category (excluding the VBF phase space) dedicated to the ggF or qqA signal (SR ggF ). For the VBF N jet = 1 category, two discriminating variables are used to minimise the contribution of the ggF signal: the pseudorapidity of the jet, η j , and the minimum value of the pseudorapidity difference between the jet and either of the leptons, min(| η j |). For the VBF N jet ≥ 2 category, the invariant mass, m j j , and the rapidity difference, y j j , of the two leading jets are used to select the VBF signal.
The NWA and LWA signal acceptance times the efficiency, after all selection requirements for a 700 GeV ggF signal, is approximately 50% in the quasi-inclusive ggF category and 5% or less in the VBF N jet = 1 and N jet ≥ 2 categories. For a 700 GeV VBF signal, it is between 15 and 25% for the three event categories. The acceptance times efficiency for the three event categories combined, as a function of resonance mass, is shown in Fig. 1 for the different signals. For the spin-1 and spin-2 signals, the range up to 1 TeV is considered in the case of VBF model processes. For samples with lower resonance masses, the acceptance times efficiency is lower because the leptons are softer. This is also the reason why the search is limited to signal mass values greater than about 200 GeV. The same selection is applied to all models and the different selection efficiencies between the models are mainly due to different η distributions for the different spin states.
The discriminating variable used for the statistical analysis (Sect. 9) in this search is the transverse mass defined as and p T is the transverse momentum vector of the leading and subleading leptons.

Background estimation
The dominant background for the eνμν final state is due to events with top quarks and due to SM W W events. Additional contributions to the background arise from V +jets and the diboson processes V Z, V γ and V γ * . Since the discriminating variable used for this search is the transverse mass, m T , both the normalisation and the shape of the background m T distribution must be estimated. The shape of the background is modelled using simulated events while the top-quark and W W background normalisations are determined by a simultaneous fit (Sect. 9) to the data in m T -binned distributions in the signal regions and the total event yields in control regions. The normalisation factors of the fit, named "post-fit" normalisation factors 3 hereafter, provide the best overall matching  between the number of observed data events and the corresponding SM background expectations in all the signal and control regions. The control regions are defined by criteria similar to those used for the signal regions, but with some requirements loosened or reversed to obtain signal-depleted samples, enriched in the relevant background. These criteria are summarised in Table 3.
The following subsections describe the methods used to estimate the most important background processes, namely top quark, W W , and W +jets. The Z /γ * +jets and non-W W diboson background contributions are small. The Z /γ * +jets Monte Carlo (MC) samples are normalised using NNLO cross sections [91] and the non-W W ones with NLO cross sections from the Sherpa event generator. The small background from the m h 125 GeV Higgs boson resonance and its off-shell component is included and its interference with the continuum W W background is taken into account.

Top-quark background
Events containing top quarks can be produced as a tt pair or as a single top quark in association with either a W boson or a quark of another flavour. In this analysis, contributions from tt and single-top-quark events are estimated together, with their relative contributions determined by their predicted cross sections and by their relative acceptances obtained from MC simulation. The single-top-quark contribution varies from about 10 to 30% depending on the signal event category.
The normalisation of the top-quark background for the quasi-inclusive ggF category is determined in a control region (Top CR ggF ) where one jet is required to be b-tagged in addition to the signal region selection. The purity of the top-quark background in this CR is high (97%) and thus allows the modelling of the MC simulation to be validated. The distribution of the simulated leading lepton p T in the Top CR ggF is found to disagree with the data and the ratio between the data and the simulation decreases with increasing p ,lead T . The simulated distribution is corrected in the SR ggF and corresponding CRs with factors obtained by fitting the ratio with a linear function. The correction varies between + 4 and −10% as p ,lead T increases from 50 to 200 GeV. The top-quark background control regions for the VBF categories (Top CR VBF ) have a small number of data events and are therefore merged. At least one jet is required to be btagged. In addition, the selection thresholds imposed on m and p ,(sub)lead T are relaxed to 10 and 25 GeV, respectively, and the selection on | η | and max(m W T ) is removed. The threshold value on m of 10 GeV is used to suppress background contributions from low-mass resonances decaying into different-flavour final states via τ + τ − . In this control region, the purity of the top-quark background is 96%, and no mis-modelling of the p ,lead T distribution is observed. The post-fit normalisation factors from the simultaneous fit are 0.96 ± 0.05 and 1.12 +0. 13 −0.12 in the ggF and the VBF control regions, respectively, where the uncertainty quoted corresponds to the combined statistical and systematic uncertainties. Figure 2 shows the m T distributions in the ggF and VBF top-quark CRs. The different background components are scaled according to the event yields obtained from the simultaneous fit. In the control regions the fit uses only the integrated event yields. The shape of the distributions is compared between data and MC predictions and found to be in good agreement after the application of the p ,lead T correction described above for the ggF top-quark CR. The shapes of the m T distribution for 700 GeV and 2 TeV NWA Higgs boson signals are also shown, normalised to the expected limits on σ H × B(H → W W ) from this analysis. The ggF contribution from the SM Higgs boson is included in the W W component. The SM Higgs boson VBF contribution is negligibly small and is not shown in this and following figures.

W W background
The W W CR for the quasi-inclusive ggF category (W W CR ggF ) uses the same selection as for the SR except for | η | which is reversed so that the CR and SR are orthogonal. The selection conditions are shown in Table 3. The m T distributions of the qq → W W Sherpa MC sample in the     While the integrated yields of the distributions agree within 3% in both the SR ggF and the W W CR ggF , a small m T shape difference is observed, particularly in the SR. The m T distributions of the Sherpa samples are thus reweighted to the combined NNLO QCD and NLO EW predictions. The postfit normalisation factor obtained from the simultaneous fit for the W W contributions in the quasi-inclusive ggF categories is 1.14 ± 0.09, where the uncertainty quoted corresponds to the combined statistical and systematic uncertainties. The postfit purity of the W W background in the control region is 51%.
In order to select more data events, the W W CR for the N jet = 1 VBF category (W W CR VBF1J ) uses a slightly different selection (shown in Table 3) from the one in the SR, but still disjoint from the SR. The normalisation factor obtained from the same simultaneous fit for the W W contribution in the W W CR VBF1J is 1.0 ± 0.2, where the uncertainty quoted corresponds to the combined statistical and systematic uncertainties. The post-fit purity of the W W background in the control region is 44%.
The W W contribution in the N jet ≥ 2 VBF category is about 20%, and its prediction is taken from simulation because it is difficult to isolate a kinematic region with a sufficient number of W W events and with a small contamination from the top-quark background. Figure 3 shows the m T distributions in the W W CR ggF and CR VBF1J . The different background contributions are scaled according to the event yields obtained from the simultaneous fit. For the W W control regions only integrated event yields are used in the fit, like in the fits of the top control regions.

W +jets background
Events with W bosons produced in association with jets may enter the SR when a jet is misidentified as a lepton. Due to the difficulties in accurately modelling the misidentification process in the simulation, the W +jets background contribution is estimated using the data-driven method developed for the SM h → W W analysis [94]. A sample of events is used which satisfies all event selection criteria, except that one of the two lepton candidates fails to meet the quality criteria for being an identified lepton but satisfies a less restrictive selection, referred to as "anti-identified". Anti-identified muons (electrons) have loosened isolation and impact parameter (likelihood identification) selection criteria as compared to the identified selection. From this data sample the non-W +jets contribution, dominated by top-quark and W W background processes, is subtracted on the basis of MC predictions. The W +jets purity of the samples is 46, 59 and 22% for the quasi-inclusive ggF, N jet = 1 and N jet ≥ 2 VBF categories, respectively.
The W +jets contamination in the signal region is then determined by scaling the number of events in the background-subtracted data sample by an extrapolation factor, which is the ratio of the number of identified leptons to the number of anti-identified leptons in a data sample   of dijet events in bins of lepton p T and η. The dijet sample is collected using prescaled lowp T single-lepton triggers with thresholds of 12 GeV for electrons and 14 GeV for muons. Events are selected with exactly one candidate lepton, back-to-back with the leading jet. Electroweak processes in the dijet event sample, dominated by W +jets and Z /γ * background contributions, are subtracted. The dominant systematic uncertainty in the estimation of the W +jets background is due to the differences between dijet and W +jets sample characteristics. All systematic uncertainties associated with this background estimate are listed in Sect. 8.1.

Systematic uncertainties
In this section, experimental and theoretical uncertainties in the normalisation and shape of the m T distributions of the background and the signal are described. Except for those explicitly mentioned here, the shape uncertainties are small and thus neglected. Overall, the systematic uncertainty dominates, except in the tails of the m T distributions where the statistical uncertainty is larger.

Experimental uncertainties
The dominant sources of experimental uncertainty in the signal and background yields are the jet energy scale and resolution (Jet) [85], the b-tagging efficiency (b-tag) [87], and the pile-up modelling [86]. Other systematic uncertainties such as those associated with trigger efficiencies, lepton reconstruction and identification efficiencies, lepton momentum scales and resolutions [78,80], missing transverse momentum reconstruction [89] and the jet vertex tagger [86] are also considered when evaluating systematic effects on the shape and normalisation of the background, or the shape and efficiency of the signal yield. The uncertainty associated with the pile-up modelling is assessed by performing a variation of ±9% in the number of simulated pile-up interactions to cover the uncertainty in the ratio of the predicted and measured cross sections of non-diffractive inelastic events producing a hadronic system of mass m X,had > 13 GeV [95].
For the main background from top-quark and W W processes, the impact of the most important experimental systematic uncertainties is summarised in Tables 4 and 5 together with dominant theoretical uncertainties. The maximum changes in yield for the up and down variations are shown in the various signal and control regions. The correlation between the SRs and CRs is taken into account in the simultaneous fit.
Systematic effects due to lepton identification efficiencies, momentum and scale resolutions, are found to be approximately 1%. They are not shown in the tables. The last column in the tables shows the total uncertainty, including these small uncertainty sources.
The data-driven W +jets background estimate is subject to several sources of systematic uncertainty. The subtraction of the subdominant electroweak processes (Sect. 7.3) has a significant impact on the extrapolation factor calculation at high lepton p T . The subtraction is varied, as described in Ref.
[94], and the variation of the event yield in the signal region is taken as the uncertainty. The method assumes that the extrapolation factors of the dijet and W +jets samples are equal. Differences in the jet flavour composition between dijet and W +jets events introduce an additional systematic uncertainty. This is evaluated as the sum in quadrature of two contributions: differences between the extrapolation factors calculated with dijet samples and Z +jets samples in data, and differences between the extrapolation factors evaluated with W +jets and Z +jets MC samples. Finally, the statistical uncertainties of the different data and MC samples used to evaluate the extrapolation factors are taken as an additional source of systematic uncertainty. The overall relative systematic uncertainty of the W +jets background is found to be approximately 35% for each of the three signal event categories, with the dominant uncertainty being associated with the jet flavour composition.
The uncertainty in the total 2015 and 2016 integrated luminosity is 2.1%. It is derived, following a methodology similar to that detailed in Ref.
[96], from van der Meer scans performed in August 2015 and May 2016, calibrated at high luminosity by various luminosity detectors.

Theoretical uncertainties of the background
For background sources which are normalised using control regions, theoretical uncertainties are evaluated for the extrapolation from the control region to the signal region.
For the top-quark and W W background, theoretical uncertainties in the extrapolation are evaluated according to the prescription from the LHC Higgs Cross Section Working Group [97]. The uncertainties include the impact of missing higher-order corrections, PDF variations and other MC modelling. The dominant theoretical uncertainties are shown in Tables 4 and 5.
For the top-quark background, the uncertainty from the event generator and parton shower modelling (ME+PS) is estimated by comparing the nominal Powheg-Box+Pyhtia8 generated samples with those from an alternative event generator, Sherpa 2.2.1. The uncertainty named "Scale" corresponds to variations of the renormalisation μ R and factorisation μ F scales as well as h damp . The variations for μ R and μ F are between 0.5 and 2 from their nominal scale of m 2 top + p 2 T , with p T being the top-quark transverse momentum. The parameter h damp is varied between m top and 2 · m top from its nominal scale h damp = 1.5 · m top . In the analysis the single-top-quark and tt processes are studied together. An uncertainty of 20% [98,99] is assigned to the relative contribution of the single-top-quark processes, corresponding to the source "Single top" in Table 4. The PDF uncertainty is obtained by taking the envelope of the uncertainty of the NNPDF30NLO PDF set and its differences in central value with the CT14 [100] and MMHT 2014 [101] PDF sets, following the recommendations of Ref.
[55]. The PDF uncertainties are m T dependent and increase from 2 to 10% with m T . This m T dependence is taken into account in the signal regions. In the ggF quasi-inclusive category, two additional shape systematic uncertainties associated with the scale variations and the p T reweighting for the leading lepton in the top-quark background are applied, the latter corresponding to ±50% of the reweighting correction. These two uncertainties are comparable and vary from a few percent at low m T to about 10% at m T 1 TeV, without affecting the integrated event yield of the top-quark background in the category.
For the W W background, the ME+PS modelling uncertainty is obtained by comparing the nominal Sherpa 2.2.1 sample with an alternative sample generated with Powheg-Box+Pythia8. The renormalisation, factorisation, and resummation scales are varied separately by factors of 0.5 and 2. The uncertainty corresponding to the factorisation scale variation is smaller than the other uncertainties and is not shown. The PDF uncertainty for the W W background is obtained and treated in the same way as for the top-quark background. In the ggF quasi-inclusive category, an additional shape uncertainty from ME+PS is applied. It varies from a few percent at low m T to about 20% at m T 1 TeV. There are no significant shape uncertainties in the m T distributions in the VBF categories.
In addition to the scale uncertainties described above, a relative uncertainty of ±50% is assigned to the reweighting corrections of the qq → W W Sherpa sample to the combined NNLO QCD and NLO EW predictions in the ggF SR and W W CR.
The gg → (h * ) → W W process, where the SM 125 GeV Higgs boson is off-shell, is modelled at leading order with the Sherpa event generator with a K -factor of 1.7 that is used to account for higher-order cross-section corrections with an uncertainty of 60%, following the studies in Refs. [102][103][104][105].
Other small background processes, such as W Z, Z Z, Z /γ * +jets and W W in the N jet ≥ 2 VBF category, do not have their own control regions. They are normalised to the theoretical predictions. The uncertainties in their yields due to the uncertainties in the predictions are evaluated with the same prescription as described above. The impact of these uncertainties is small (see Tables 6, 7 in Sect. 9).

Theoretical uncertainties in the signal predictions
Theoretical uncertainties in the signal acceptance include effects due to the choice of QCD renormalisation and factorisation scales, the PDF set as well as the underlying-event modelling, the parton shower model and the parton shower tune. These uncertainties are evaluated separately in each of Table 6 Event yields in the signal and control regions for the quasiinclusive ggF category. The predicted background yields and uncertainties are calculated after the simultaneous fit to the data in all the SRs and the CRs including those from  the three event categories as a function of the resonance mass and independently for ggF-and VBF-induced resonances. The effect of missing higher-order corrections in QCD on the signal acceptance is estimated by varying the renormalisation and factorisation scales independently by factors of 0.5 and 2 from the nominal scale of m 2 H + p 2 T,H , with m H and p T,H being the mass and the transverse momentum of the heavy Higgs boson, respectively. The acceptance values obtained with these modified MC samples are compared to the signal acceptance of the nominal sample. For resonances produced via ggF, these uncertainties are found to be negligible in the quasi-inclusive ggF and N jet = 1 VBF categories, while in the N jet ≥ 2 VBF category they range between 2.5 and 0.2% for a resonance mass varying from 200 GeV to 4 TeV (unless stated otherwise, the following uncertainties are quoted for the same mass range). For resonances produced via vector-boson fusion, these uncertainties range from 0.9 to 2.8% in the quasi-inclusive ggF category, from 1.9 to 3.6% in the N jet = 1 VBF category and from 1.0 to 7.3% in the N jet ≥ 2 VBF category.
The PDF-induced uncertainties in the signal acceptance are determined in the same way as for the top-quark and W W background processes. For the ggF-induced (VBF-induced) signal, these uncertainties reach 0.4% (1.7%), 1.5% (1.2%) and 1.6% (1.5%) for the quasi-inclusive ggF, N jet = 1 and N jet ≥ 2 VBF event categories, respectively.
The uncertainties corresponding to the parton shower tune and the underlying event are derived by moving independently, up or down, the Pythia internal parameters that are associated with final-state radiation or the multiple parton interactions to study separately their influence on the signal acceptance of the various signal mass points. These uncertainties are compared for each event category and mass point to the uncertainties from the choice of parton shower model, which are estimated by comparing the results obtained for the nominal parton shower generator to those obtained using Herwig++ [106,107]. The tune uncertainties are found to be smaller than the shower uncertainties for all mass points. Thus only the latter uncertainties are considered in the final results. The corresponding uncertainties for ggF-induced signals increase from 1.3 to 3.1%, from 13 to 28%, and from 2.3 to 15% for increasing resonance masses in the quasi-inclusive ggF, N jet = 1 and N jet ≥ 2 VBF categories, respectively. The uncertainties for VBF-induced signals increase from 4.3 to 19%, from 5.1 to 9.0%, and from 3.3 to 8.0% in the three categories.
In addition, uncertainties due to missing higher-order corrections in QCD are evaluated for ggF-induced processes for each event category, considering also event migration effects between different event categories. This follows the method proposed by Stewart and Tackmann [108]. The corresponding uncertainties range from 3 to 10% for the quasi-inclusive ggF category and from 4 to 30% (30-60) for the N jet = 1 (N jet ≥ 2) VBF event categories.

Results
The statistical method used to interpret the results of the search is described in Ref. [109]. A likelihood function L Upper limits at 95% CL on the total ggF and VBF Higgs boson production cross section times branching fraction σ H × B(H → W W ) in the eνμν channel, for a signal at 800 GeV as a function of the ggF cross section divided by the combined ggF and VBF production cross section. The inner and outer bands show the ±1σ and ±2σ ranges around the expected limit is defined as the product of Poisson probabilities associated with the number of events in bins of the m T distributions in the signal regions and of the total yields in the control regions. Each source of systematic uncertainty is parameterised by a corresponding nuisance parameter θ constrained by a Gaussian function.
The m T distributions in the signal regions are divided into 18 (8) bins for the ggF quasi-inclusive (N jet = 1 and ≥ 2 VBF) categories. The bins are of variable size to reflect the increasing width of the m T distribution of the expected signal with increasing mass, while keeping the statistical precision of the background contributions in each bin sufficiently high.
The numbers of events predicted and observed in the signal and control regions are shown for the quasi-inclusive ggF categories in Table 6 and for the VBF N jet = 1 and ≥ 2 categories in Table 7. These yields are obtained from a simultaneous fit to the data in all the SRs and the CRs. The fitted signal event yield is consistent with zero. The background compositions depend strongly on the event categories: the top-quark and W W processes are comparable in SR ggF and SR VBF1J while the top-quark events dominate in SR VBF2J . The large reduction of the total background uncertainty is due to strong anti-correlations between some of the uncertainty sources of the top-quark and W W background. The m T distributions in SR ggF , SR VBF1J and SR VBF2J are shown in Fig. 4. As no excess over the background prediction is observed, upper limits at 95% confidence level (CL) are set on the production cross section times the branching fraction, σ X × B(X → W W ), for signals in each benchmark model.
The 95% CL upper limits are computed using the modified frequentist method known as CL s [110], using the asymptotic approximation of the distribution of a test statistic [111], q μ , a function of the signal strength μ, defined as the ratio of the measured σ X × B(X → W W ) to that of the prediction: The quantitiesμ andθ are those values of μ and θ , respectively, that unconditionally maximise L. The numerator depends on the valuesθ μ that maximise L for a given value of μ.
Limits are obtained separately for ggF and VBF production for the NWA and LWA signal hypotheses. To derive the expected limits on the ggF (VBF) production modes, the VBF (ggF) production cross section is set to zero so that the     To derive the observed limits on the ggF (VBF) production mode, the VBF (ggF) production cross section is treated as a nuisance parameter in the fit and profiled, in the same way as dealing with the normalisation factors of the different background processes. This approach avoids making any assumption about the presence or absence of the signal in any of these production modes. Figure 5 shows the 95% CL upper limits on σ H × B(H → W W ) as a function of m H for a Higgs boson in the NWA scenario in the mass range 200 GeV ≤ m H ≤ 4(3) TeV for the ggF (VBF) production. Values above 6.4 pb (1.3 pb) at m H = 200 GeV and above 0.008 pb (0.006 pb) at 4 (3) TeV are excluded at 95% CL by the quasi-inclusive ggF (VBF) NWA analysis. The main systematic uncertainties affecting the limits are the p T correction for the leading lepton in the top-quark background, scale variations for the top-quark background, the parton shower modelling of the W W MC generator, and the jet energy scale and resolution uncertainties. Limits are consistent with those expected in the absence of a signal over the investigated mass range. The fact that the observed limits are more stringent than the expected ones for mass values beyond 2 TeV is explained by the deficit in data at the high m T tail in Fig. 4. These limits are extracted using the asymptotic approximation and their accuracy is verified to be consistent within about 5% at 800 GeV and better than 20% at 2 TeV and beyond using pseudo-experiments.
The analysis can be extended to a more general case where the relative fraction of the ggF production cross section varies over the total ggF and VBF production cross section. The corresponding 95% CL upper exclusion limits for a signal at 800 GeV are shown in Fig. 6. The dependence of the limits on the ggF fraction for other masses is similar but becomes slightly stronger (weaker) for lower (higher) mass values. The limit values for a ggF fraction of 0 and 1 are comparable with the VBF and ggF limits shown in Fig. 5 at the same mass value. The VBF limits are tighter than the ggF limits since the VBF N jet ≥ 2 signal region has the smallest background contribution and thus is the most sensitive.
The NWA exclusion limit shown above can be further translated to exclusion contours in the 2HDM for the phase space where the narrow-width approximation is valid. The 95% CL exclusion contours for Type I and Type II in the plane of tan β and cos(β − α) for three mass values of 200, 300 and 500 GeV are shown in Fig. 7. For a fixed value of cos(β − α) = −0.1, 95% CL exclusion limits on tan β as a function of the heavy Higgs boson mass are shown in Fig. 8. The coupling of the heaviest CP-even Higgs boson to vector bosons is proportional to cos(β − α) and in the decoupling limit cos(β − α) → 0, the light CP-even Higgs boson is indistinguishable from a SM Higgs boson with the same mass. The range of cos(β − α) and tan β explored is limited to the region where the assumption of a heavy narrow-width Higgs boson with negligible interference is valid. When calculating the limits at a given choice of cos(β − α) and tan β, [ Fig. 12 Upper limits at 95% CL on the resonance production cross section times branching fraction σ X × B(X → W W ) in the eνμν channel, for a graviton signal with two different couplings of k/M Pl = 1 (left) and k/M Pl = 0.5 (right), and for an ELM spin-2 VBF signal (bottom).
The inner and outer bands show the ±1σ and ±2σ ranges around the expected limit. The full curves without dots correspond to the predicted theoretical cross sections the relative rate of ggF and VBF production in the fit is set to the prediction of the 2HDM for that parameter choice. The white regions in the exclusion plots indicate regions of parameter space which are not excluded by the present analysis.
For the LWA scenario, the interference effects among the heavy boson, the light Higgs boson at 125 GeV and the SM W W continuum background were studied and found to have negligible impact on the exclusion limits. The 95% CL upper limits are shown in Fig. 9. The limits for signal widths of 5, 10 and 15% are comparable with those from the NWA scenario for the VBF signals while for the ggF signals, the limits weaken slightly at high masses as the width increases. For the LWA 15% case, the upper exclusion limit ranges between 5.2 pb (1.3 pb) at m H = 200 GeV and 0.02 pb (0.006 pb) at 4 (3) TeV for the ggF (VBF) signals. Figure 10 shows the limits on the resonance production cross section times branching fraction σ X × B(X → W W ) and sin θ H for a scalar GM signal with masses between 200 GeV and 1 TeV. At the observed limit, the width is narrower than the experimental resolution [46]. The current sensitivity is not sufficient to exclude the benchmark model with sin θ H = 0.4.
Limits are derived in the mass range from 250 GeV to 5 TeV and from 300 GeV to 1 TeV for a qqA and VBF HVT signal, respectively, as shown in Fig. 11. For the qqA production, signals below about 1.3 TeV are excluded at 95% CL. No limit can be set for the VBF production in the benchmark model that assumes a coupling strength to gauge bosons g V = 1 and a coupling to fermions c F = 0. The model has an intrinsic width much narrower than the detector resolution. Figure 12 shows the limits on a G KK → W W signal for two different couplings: k/M Pl = 1 and k/M Pl = 0.5, for masses between 200 GeV and 5 TeV, and for an ELM spin-2 VBF signal for masses between 200 GeV and 1 TeV. The observed limits exclude a KK graviton signal lighter than 1.1 TeV (750 GeV) with the higher (lower) coupling, while the current sensitivity is not sufficient to exclude the ELM spin-2 VBF signal.

Conclusion
A search for heavy neutral resonances decaying into a W W boson pair in the eνμν channel performed by the ATLAS Collaboration at the LHC is presented. The search uses proton-proton collision data collected at a centre-of-mass energy of 13 TeV corresponding to an integrated luminosity of 36.1 fb −1 . No significant excess of events beyond the Standard Model background prediction is found in the mass range between 200 GeV and up to 5 TeV. Upper lim-its are set on the product of the production cross section and the X → W W branching fraction in several scenarios: a high-mass Higgs boson with a narrow width or with intermediate widths (of 5, 10 and 15% of the heavy Higgs boson mass), as well as other spin-0, spin-1, and spin-2 signals. For the narrow-width heavy Higgs boson signals, values above 6.4 pb at m H = 200 GeV and above 0.008 pb at 4 TeV are excluded at 95% confidence level for the gluon-gluon fusion production mode.