Determination of spin and parity of the Higgs boson in the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$WW^*\rightarrow e \nu \mu \nu $$\end{document}WW∗→eνμν decay channel with the ATLAS detector

Studies of the spin and parity quantum numbers of the Higgs boson in the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$WW^* \rightarrow e \nu \mu \nu $$\end{document}WW∗→eνμν final state are presented, based on proton–proton collision data collected by the ATLAS detector at the Large Hadron Collider, corresponding to an integrated luminosity of 20.3 fb\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^{-1}$$\end{document}-1 at a centre-of-mass energy of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}=8$$\end{document}s=8 TeV. The Standard Model spin-parity \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J^{CP} = 0^{++}$$\end{document}JCP=0++ hypothesis is compared with alternative hypotheses for both spin and CP. The case where the observed resonance is a mixture of the Standard-Model-like Higgs boson and CP-even (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J^{CP} = 0^{++}$$\end{document}JCP=0++) or CP-odd (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J^{CP} = 0^{+-}$$\end{document}JCP=0+-) Higgs boson in scenarios beyond the Standard Model is also studied. The data are found to be consistent with the Standard Model prediction and limits are placed on alternative spin and CP hypotheses, including CP mixing in different scenarios.


Introduction
This paper presents studies of the spin and parity quantum numbers of the newly discovered Higgs particle [1,2] in the W W * → eνμν final state, where only final states with opposite-charge, different-flavour leptons (e, μ) are considered. Determining the spin of the newly discovered resonance and its properties under charge-parity (CP) conjugation is of primary importance to firmly establish its nature, and in particular whether it is the Standard Model (SM) Higgs boson or not. Compared to the previous ATLAS publication [3], this paper contains significant updates and improvements: the SM Higgs-boson hypothesis is compared with improved spin-2 scenarios. The case where the observed resonance 1 has J P = 1 + or 1 − is not studied in this paper as it is already excluded by previous publications both by the ATLAS [3] and CMS collaborations [4].
To simulate the alternative Higgs-boson hypotheses, the MadGraph5_aMC@NLO [5] generator is adopted. It includes terms of higher order (α 3 S ) in the Lagrangian, in 1 In the following the abbreviated notation J P is used instead of J C P . e-mail: atlas.publications@cern.ch contrast to the JHU [6,7] event generator used in the previous publication [3]. In the context of this study, the 1-jet final state, which is more sensitive to contributions from the higher-order terms, is analysed, in addition to the 0-jet final state. Furthermore, the parity of the Higgs resonance is studied by testing the compatibility of the data with a beyond-the-Standard-Model (BSM) CP-even or CP-odd Higgs boson [8]. Finally, the case where the observed resonance is a mixed CP-state, namely a mixture of a SM Higgs boson and a BSM CP-even or CP-odd Higgs boson, is investigated.
This study follows the recently published H → W W * analysis [9] in the 0-and 1-jet channels with one major difference: the spin and parity analysis uses multivariate techniques to disentangle the various signal hypotheses and the backgrounds from each other, namely Boosted Decision Trees (BDT) [10]. The reconstruction and identification of physics objects in the event, the simulation and normalisation of backgrounds, and the main systematic uncertainties are the same as described in Ref. [9]. This paper focuses in detail on the aspects of the spin and parity analysis that differ from that publication.
The outline of this paper is as follows: Sect. 2 describes the theoretical framework for the spin and parity analysis, Sect. 3 discusses the ATLAS detector, the data and Monte Carlo simulation samples used. The event selection and the background estimates are described in Sects. 4 and 5, respectively. The BDT analysis is presented in Sect. 6, followed by a description of the statistical tools used and of the various uncertainties in Sects. 7 and 8, respectively. Finally, the results are presented in Sect. 9.

Theoretical framework for the spin and parity analyses
In this section, the theoretical framework for the study of the spin and parity of the newly discovered resonance is discussed. The effective field theory (EFT) approach is adopted in this paper, within the Higgs characterisation model [8] implemented in the MadGraph5_aMC@NLO [5] generator. Different hypotheses for the Higgs-boson spin and parity are studied. Three main categories can be distinguished: the hypothesis that the observed resonance is a spin-2 resonance, a pure CP-even or CP-odd BSM Higgs boson, or a mixture of an SM Higgs and CP-even or CP-odd BSM Higgs bosons. The latter case would imply CP violation in the Higgs sector. In all cases, only the Higgs boson with a mass of 125 GeV is considered. In case of CP mixing, the Higgs boson would be a mass eigenstate, but not a CP eigenstate. The approach used by this model relies on an EFT, which by definition is only valid up to a certain energy scale . This Higgs characterisation model considers that the resonance structure recently observed corresponds to one new boson with J P = 0 ± , 1 ± or 2 + and with mass of 125 GeV, assuming that any other BSM particle exists at an energy scale larger than . The EFT approach has the advantage of being easily and systematically improvable by adding higherdimensional operators in the Lagrangian, which effectively corresponds to adding higher-order corrections, following the same approach as that used in perturbation theory. The cutoff scale is set to 1 TeV in this paper, to account for the experimental results from the LHC and previous collider experiments that show no evidence of new physics at lower energy scales. More details can be found in Ref. [8]. In the EFT approach adopted, the Higgs-boson couplings to particles other than W bosons are ignored as they would impact the signal yield with no effects on the H → W W * decay kinematics, which is not studied in this analysis.
This section is organised as follows. Higgs-like resonances in the framework of the Higgs characterisation model are introduced in Sects. 2.1.1 and 2.2.1, for spin-2 and spin-0 particles, respectively. The specific benchmark models under study are described in Sects. 2.1.2 and 2.2.2.

Spin-2 theoretical model
Given the large number of possible spin-2 benchmark models, a specific one is chosen, corresponding to a gravitoninspired tensor with minimal couplings to the SM particles [11]. In the spin-2 boson rest frame, its polarisation states projected onto the parton collision axis can take only the values of ±2 for the gluon fusion (ggF) process and ±1 for the qq production process. For the spin-2 model studied in this analysis, only these two production mechanisms are considered. The Lagrangian L p 2 for a spin-2 minimal coupling model is defined as: where T p μν is the energy-momentum tensor, X μν 2 is the spin-2 particle field and V and f denote vector bosons (Z , W , γ and gluons) and fermions (leptons and quarks), respectively. The κ p are the couplings of the Higgs-like resonance to particles, e.g. κ q and κ g label the couplings to quarks and gluons, respectively.
With respect to the previous publication [3], the spin-2 analysis presented in this paper uses the MadGraph5_aMC@NLO [5] generator, which includes higher-order tree-level QCD calculations. As discussed in the following, these calculations have an important impact on the Higgs-boson transverse momentum p H T distribution, compared to the studies already performed using a Monte Carlo (MC) generator at leading order [6,7]. In fact, when κ q is not equal to κ g (non-universal couplings), due to orderα 3 S terms, a tail in the p H T spectrum appears. For leading-order (LO) effects, the qq and ggF production processes are completely independent, but the beyond-LO processes contain diagrams with extra partons that give rise to a term proportional to (κ q − κ g ) 2 , which grows with the centre-of-mass energy squared of the hard process (s) as s 3 /(m 4 2 ) (where m is the mass of the spin-2 particle), and leads to a large tail at high values of p H T . The distributions of some spin-sensitive observables are affected by this tail. For a more detailed discussion, see Ref. [8]. This feature appears in final states with at least one jet, which indeed signals the presence of effects beyond leading order. Therefore, the 1-jet category is analysed in addition to the 0-jet category in this paper, in order to increase the sensitivity for these production modes. Figure 1 shows the p H T distribution for the 0-and 1-jet final states at generator level after basic selection requirements (the minimum p T required for the jets used for this study is 25 GeV). Three different signal hypotheses are shown: one corresponding to universal couplings, κ g = κ q , and two examples of non-universal couplings. The tail at high values of p H T is clearly visible in the 1-jet category for the cases of non-universal couplings.
This p H T tail would lead to unitarity violation if there were no cutoff scale for the validity of the theory. By definition, in the context of the EFT approach, at a certain scale , new physics should appear and correct the unitarityviolating behaviour, even below the scale . There is a model-dependent theoretical uncertainty on the p T scale at which the EFT would be corrected by new physics: this uncertainty dictates the need to study benchmarks that use different p H T cutoffs, as discussed in the following subsection.

Choice of spin-2 benchmarks
Within the spin-2 model described in the previous section, a few benchmarks, corresponding to a range of possible sce-  Fig. 1 The distribution of the transverse momentum of the Higgs boson, p H T , at the Monte Carlo event-generator level for 0-jet (left) and 1-jet (right) final states. Three spin-2 signal hypotheses are shown: κ g = κ q = 1, κ g = 0.5, κ q = 1 and κ g = 1, κ q = 0. The last bin in each plot includes the overflow narios, are studied in this paper. In order to make sensible predictions for the spin-sensitive observables in the case of non-universal couplings, a cutoff on the Higgs-boson transverse momentum is introduced at a scale where the EFT is assumed to still be valid: this is chosen to be one-third of the scale , corresponding to p T < 300GeV. On the other hand, the lowest possible value up to which the EFT is valid by construction is the mass of the resonance itself; therefore it is important to study the effect of a threshold on p H T at 125 GeV.
Five different hypotheses are tested against the data: • universal couplings: κ g = κ q ; • κ g = 1 and κ q = 0, with two p H T cutoffs at 125 and 300 GeV; • κ g = 0.5 and κ q = 1, with two p H T cutoffs at 125 and 300 GeV.
The case κ g = 0 and κ q = 1 is not considered here, because it leads to a p H T distribution which disagrees with the data, as shown in the H → γ γ and H → Z Z differential cross-section measurements [12,13].

Spin-0 and CP-mixing theoretical models
In the case where the spin of the Higgs-like resonance is zero, there are several BSM scenarios that predict the parity of the Higgs particle to be either even or odd [14]. Another interesting possibility is that the Higgs-like resonance is not a CP eigenstate, but a mixture of CP-even and CP-odd states. This would imply CP violation in the Higgs sector, which is possible in the context of the Minimal Supersymmetric Standard Model [15] or of two Higgs-doublet models [16]. This CP violation might be large enough to explain the prevalence of matter over antimatter in the universe.
In the adopted EFT description, the scalar boson has the same properties as the SM Higgs boson, and its interactions with the SM particles are described by the appropriate operators. The BSM effects are expressed in terms of interactions with SM particles via higher-dimensional operators.
The effective Lagrangian L W 0 adopted for this study, in order to describe the interactions of W bosons with scalar and pseudoscalar states, is expressed as: where W μν = ∂ μ W ± ν − ∂ ν W ± μ , W μν = 1/2 · μνρσ W ρσ and μνρσ is the Levi-Civita tensor, while X 0 represents the spin-0 Higgs-boson field [8]. In the SM, the coupling of the Higgs boson to the W bosons is given by g HWW , while the angle α describes the mixing between CP-even and CP-odd states. The notation c α ≡ cos α, s α ≡ sin α is used in the Lagrangian. The dimensionless coupling parameters κ i are real and describe CP violation in the most general way. The parameter κ SM describes the deviations of the Higgs-boson coupling to the vector boson W from those predicted by the SM, while κ AWW and κ HWW are the BSM CP-odd and CP-even coupling parameters, respectively. 2 The mixing between the CP-even SM Higgs boson and the CP-even BSM Higgs boson can be achieved by changing the relative strength of the couplings κ SM and κ HWW . The cos α term multiplies both the SM and BSM CP-even terms in the Lagrangian and therefore its value does not change the relative importance of those contributions. This is different from the mixing of CP-even and CP-odd states, as a sin α term multiplies the CP-odd state in the Lagrangian. The last term of the Lagrangian is due to derivative operators which are relevant in the case one of the two vector bosons is off-shell.
The higher-dimensional operator terms in the Lagrangian are the terms that contain κ AWW and κ HWW and are suppressed by a factor 1/ . The SM Higgs boson is described by the first term of the Lagrangian, corresponding to the following choice of parameters: κ SM = 1, κ AWW = κ HWW = 0 and |c α | = 1. The derivative operator (the κ H∂W term) described in the Lagrangian of Eq. (2) would modify the results below the sensitivity achievable with the available data statistics. In fact, the effects on the kinematic distributions introduced by the derivative operator in the same range of variation of κ HWW are at most 10-20 % of the ones produced by κ HWW itself. Since the present analysis is barely sensitive to κ HWW , the even smaller κ H∂W variations are not studied further, and the corresponding term in the Lagrangian is neglected.

Choice of CP benchmarks
The following approach to study different CP hypotheses under the assumption of a spin-0 hypothesis is taken in this paper. First of all, in the fixed-hypothesis scenario, the cases where the observed resonance is a pure BSM CP-even or CP-odd Higgs boson are considered. In addition, the mixing between the CP-even SM and BSM CP-odd or CP-even Higgs bosons is studied. In the CP-odd case, the mixing depends on the value of κ AWW and on the mixing angle α. As can be deduced from Eq. (2), varying α or κ AWW has an equivalent effect on the kinematic variable distributions; therefore in this paper only the α parameter is varied while κ AWW is kept constant. The scan range of α covers the entire range from −π/2 to π/2 as the final state kinematic distributions differ for positive and negative values of α. On the other hand, the mixing between the CP-even SM and CP-even BSM Higgs bosons depends exclusively on the value of κ HWW and not on the value of α.
To summarise, four hypotheses are tested against the data in this paper (for the cutoff value = 1 TeV): • Compare the SM Higgs-boson case with the pure BSM CP-even case, defined as κ SM = 0, κ AWW = 0, κ HWW = 1, c α = 1. • Compare the SM Higgs-boson case with the BSM CPodd case, defined as κ SM = 0, κ AWW = 1, κ HWW = 0, c α = 0. • Scan over tan α: under the assumption of a mixing between a CP-even SM Higgs boson and a CP-odd BSM Higgs boson. The mixing parameter is defined as is the vacuum expectation value and tan α is the only variable term (corresponding to variations of c α between −1 and 1). The other parameters are set as follows: κ SM = 1, κ AWW = 1, κ HWW = 0. • Scan over κ HWW : under the assumption of a mixing between a CP-even SM Higgs boson and a CP-even BSM Higgs boson. The mixing parameter is defined as κ HWW /κ SM , whereκ HWW = (1/4) · (v/ ) · κ HWW and the only variable term is κ HWW (corresponding to variations ofκ HWW /κ SM between −2.5 and 2.5). For larger values of this ratio, the kinematic distributions of the final-state particles asymptotically tend to the ones obtained in presence of a pure CP-even BSM Higgs boson. The latter is used as the last point of the scan. The other parameters are set as follows: In the case of CP-mixing, only one MC sample is generated (see Sect. 3), and all other samples are obtained from it by reweighting the events on the basis of the matrix element amplitudes derived from Eq. (2). The precision of this procedure is verified to be better than the percent level. The mixing parameters used to produce this sample are chosen such that the kinematic phase space for all CP-mixing scenarios considered here was fully populated, and the values of the parameters are: In addition, it is interesting to study the case where the SM, the BSM CP-even and the CP-odd Higgs bosons all mix. Unfortunately, in the H → W W * channel, the present data sample size limits the possibility to constrain such a scenario, which would imply a simultaneous scan of two parameters tan α and κ HWW . In particular this is due to the lack of sensitivity in the κ HWW scan, consequently, as already stated, both the two and the three parameter scans, including in addition the derivative operators, are not pursued further. These studies are envisaged for the future.

ATLAS detector, data and MC simulation samples
This section describes the ATLAS detector, along with the data and MC simulation samples used for this analysis.

The ATLAS detector
The ATLAS detector [17] is a multipurpose particle detector with approximately forward-backward symmetric cylindrical geometry and a near 4π coverage in solid angle. 3 The inner tracking detector (ID) consists of a silicon-pixel detector, which is closest to the interaction point, a siliconstrip detector surrounding the pixel detector, both covering up to |η| = 2.5, and an outer transition-radiation straw-tube tracker (TRT) covering |η| < 2. The ID is surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field.
A highly segmented lead/liquid-argon (LAr) sampling electromagnetic calorimeter measures the energy and the position of electromagnetic showers over |η| < 3.2. The LAr calorimeter includes a presampler (for |η| < 1.8) and three sampling layers, longitudinal in shower depth, up to |η| < 2.5. LAr sampling calorimeters are also used to measure hadronic showers in the end-cap (1.5 < |η| < 3.2) and both the electromagnetic and hadronic showers in the forward (3.1 < |η| < 4.9) regions, while an iron/scintillator tile sampling calorimeter measures hadronic showers in the central region (|η| < 1.7).
The muon spectrometer (MS) surrounds the calorimeters and is designed to detect muons in the pseudorapidity range |η| < 2.7. The MS consists of one barrel (|η| < 1.05) and two end-cap regions. A system of three large superconducting air-core toroid magnets provides a magnetic field with a bending integral of about 2.5 T·m (6 T·m) in the barrel (end-cap) region. Monitored drift-tube chambers in both the barrel and end-cap regions and cathode strip chambers covering 2.0 < |η| < 2.7 are used as precision measurement chambers, whereas resistive plate chambers in the barrel and thin gap chambers in the end-caps are used as trigger chambers, covering up to |η| = 2.4.
A three-level trigger system selects events to be recorded for offline analysis. The first-level trigger is hardware-based, while the higher-level triggers are software-based.

Data and Monte Carlo simulation samples
The data and MC simulation samples used in this analysis are a subset of those used in Ref. [9] with the exception of the specific spin/CP signal samples produced for this paper.
The data were recorded by the ATLAS detector during the 2012 LHC run with proton-proton collisions at a centreof-mass energy of 8 TeV, and correspond to an integrated luminosity of 20.3 fb −1 . This analysis uses events selected by triggers that required either a single highp T lepton or two leptons. Data quality requirements are applied to reject Footnote 3 continued is along the beam direction. Cylindrical coordinates (r, φ) are used in the plane transverse to the beam, with φ the azimuthal angle around the beam axis. Transverse components of vectors are indicated by the subscript T. The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2). The angular distance between two objects is defined as R = ( η) 2 + ( φ) 2 . events recorded when the relevant detector components were not operating correctly.
Dedicated MC samples are generated to evaluate all but the W +jets and multi-jet backgrounds, which are estimated using data as discussed in Sect. 5. Most samples use the Powheg [18] generator, which includes corrections at nextto-leading order (NLO) in α S for the processes of interest. In cases where higher parton multiplicities are important, Alpgen [19] or Sherpa [20] provide merged calculations at tree level for up to five additional partons. In a few cases, only leading-order generators (such as AcerMC [21] or gg2VV [22]) are available. Table 1 shows the event generator and production cross-section times branching fraction used for each of the signal and background processes considered in this analysis. The matrix-element-level Monte Carlo calculations are matched to a model of the parton shower, underlying event and hadronisation, using either Pythia6 [23], Pythia8 [24], Herwig [25] (with the underlying event modelled by Jimmy [26]), or Sherpa. Input parton distribution functions (PDFs) are taken from CT10 [27] for the Powheg and Sherpa samples and CTEQ6L1 [28] for the Alpgen + Herwig and AcerMC samples. The Drell-Yan (DY) sample (Z /γ * +jets) is reweighted to the MRST PDF set [29].
The effects of the underlying event and of additional minimum-bias interactions occurring in the same or neighbouring bunch crossings, referred to as pile-up in the following, are modelled with Pythia8, and the ATLAS detector response is simulated [30] using either Geant4 [31] or Geant4 combined with a parametrised Geant4-based calorimeter simulation [32].
For the signal, the ggF production mode for the H → W W * signal is modelled with Powheg + Pythia8 [33,34] at m H = 125GeV for the SM Higgs-boson signal in the spin-2 analysis, whereas MadGraph5_aMC@ NLO [5] is used for the CP analysis. The H + 0, 1, 2 partons samples are generated with LO accuracy, and subsequently showered with Pythia6. For the BSM signal, the Mad-Graph5_aMC@NLO generator is used in all cases. For the CP analysis, all samples (SM and BSM) are obtained by using the matrix-element reweighting method applied to a CP-mixed sample, as mentioned in Sect. 2.2.1, to provide a description of different CP-mixing configurations. The PDF set used is CTEQ6L1. To improve the modelling of the SM Higgs-boson p T , a reweighting scheme is applied that reproduces the prediction of the next-tonext-to-leading-order (NNLO) and next-to-next-to-leadinglogarithms (NNLL) dynamic-scale calculation given by the HRes2.1 program [35,36]. The BSM spin-0 Higgs-boson p T is reweighted to the same distribution.
Cross-sections are calculated for the dominant diboson and top-quark processes as follows: the inclusive W W cross-section is calculated to NLO with MCFM [37]; non- Table 1 Monte Carlo samples used to model the signal and background processes. The corresponding cross-sections times branching fractions, σ · B, are quoted at √ s = 8TeV. The branching fractions include the decays t → W b, W → ν, and Z → (except for the pro-cess Z Z → νν). Here refers to e, μ, or τ . The neutral current Z /γ * → process is denoted Z or γ * , depending on the mass of the produced lepton pair. The parameters κ g , κ q are defined in Sect. 2 MadGraph5_aMC@NLO + Pythia6 -Signal samples used in CP-mixing analysis resonant gluon fusion is calculated and modelled to LO in α S with gg2VV, including both W W and Z Z production and their interference; tt production is normalised to the calculation at NNLO in α S , with resummation of higher-order terms to NNLL accuracy, evaluated with Top++2.0 [38]; singletop-quark processes are normalised to NNLL, following the calculations from Refs. [39,40] and [41] for the s-channel, t-channel, and W t processes, respectively. The W W background and the dominant backgrounds involving top-quark production (tt and W t) are modelled using the Powheg + Pythia6 event generator [42][43][44][45]. For W W , WZ, and Z Z production via non-resonant vector boson scattering, the Sherpa generator provides the LO cross-section and is used for event modelling. The negligible vector-boson-scattering (VBS) Z Z process is not shown in Table 1 but is included in the background modelling for completeness. The process W γ * is defined as associ-ated W +Z /γ * production, containing an opposite-charge same-flavour lepton pair with invariant mass m less than 7 GeV. This process is modelled using Sherpa with up to one additional parton. The range m > 7 GeV is simulated with Powheg + Pythia8 and normalised to the Powheg cross-section. The use of Sherpa for W γ * is due to the inability of Powheg + Pythia8 to model invariant masses down to the production threshold. The Sherpa sample requires two leptons with p T > 5 GeV and | η | < 3. The jet multiplicity is corrected using a Sherpa sample generated with 0.5 < m < 7 GeV and up to two additional partons, while the total cross-section is corrected using the ratio of the MCFM NLO to Sherpa LO calculations in the same restricted mass range. A similar procedure is used to model Z γ * , defined as Z /γ * pair production with one sameflavour opposite-charge lepton pair having m ≤ 4 GeV and the other having m > 4 GeV.
The W γ and DY processes are modelled using Alpgen + Herwig with merged tree-level calculations of up to five jets. The merged samples are normalised to the NLO calculation of MCFM (for W γ ) or the NNLO calculation of DYNNLO [46] (for DY). The W γ sample is generated with the requirements p γ T > 8 GeV and R(γ , ) > 0.25. A Sherpa sample is used to accurately model the Z (→ )γ background. The photon is required to have p γ T > 8 GeV and R(γ , ) > 0.1; the lepton pair must satisfy m > 10 GeV. The cross-section is normalised to NLO using MCFM. Events are removed from the Alpgen + Herwig DY samples if they overlap with the kinematics defining the Sherpa Z (→ )γ sample.

Event selection
The object reconstruction in terms of leptons, jets, and missing transverse momentum, as well as the lepton identification and isolation criteria, which were optimised to minimise the impact of the background from misidentified isolated prompt leptons, are the same as described in detail in Ref. [9]: these aspects are therefore not discussed in this paper. The selection criteria and the analysis methodology used for the spin/CP studies described here are different however, since they are motivated not only by the need to distinguish the background processes from the Higgs-boson signal, but also by the requirement to optimise the separation power between different signal hypotheses. Thus, several selection requirements used in Ref. [9] are loosened or removed in the selection described below.
This section is organised in four parts. First, the event preselection is described, followed by the discussion of the spinand parity-sensitive variables. These variables motivate the choice of topological selection requirements in the 0-jet and 1-jet categories described in the last two sections. All selection criteria are summarised in Table 2 and the corresponding expected and observed event yields are presented in Table 3.

Event preselection
The W W → eνμν final state chosen for this analysis consists of eμ pairs, namely pairs of opposite-charge, differentflavour, identified and isolated prompt leptons. This choice is based on the expected better sensitivity of this channel compared to the same-flavour channel, which involves a large potential background from Z /γ * → ee/μμ processes. The preselection requirements are designed to reduce substantially the dominant background processes to the Higgs-boson signal (see Sect. 5) and can be summarised briefly as follows: • The leading lepton is required to have p T > 22 GeV to match the trigger requirements.
• The subleading lepton is required to have p T > 15 GeV.
• The mass of the lepton pair is required to be above 10 GeV. • The missing transverse momentum in the event is required to be p miss T > 20 GeV. • The event must contain at most one jet with p T > 25 GeV and |η| < 4.5. The jet p T is required to be higher than 30 GeV in the forward region, 2.4 < |η| < 4.5, to minimise the impact of pile-up.
This analysis considers only eμ pairs in the 0-jet and 1-jet categories for the reasons explained in Sect. 1. Each category is analysed independently since they display rather different background compositions and signal-to-background ratios.

Spin-and CP-sensitive variables
The shapes of spin-and CP-sensitive variable distributions are discussed in this section for the preselected events. Figures 2 and 3 show the variables used to discriminate different spin-2 signal hypotheses from the SM Higgsboson hypothesis for the 0-jet and the 1-jet category, respectively. For both the 0-jet and the 1-jet categories, the most sensitive variables are p T (transverse momentum of the Table 3 Expected event yields in the signal regions (SR) for the 0and 1-jet categories (labelled as 0j and 1j, respectively). For the dominant backgrounds, the expected yields are normalised using the control regions defined in Sect. 5. The expected contributions from various processes are listed, namely the ggF SM Higgs-boson production (N ggF ), and the background contribution from W W (N W W ), top quark (top-quark pairs N tt , and single-top quark N t ), Drell-Yan Z /γ * to τ τ (N DY,τ τ ), misidentified leptons (N W +jets ), W Z/Z Z/W γ (N VV ) and Drell-Yan Z /γ * to ee/μμ (N DY,SF ). The total sum of the backgrounds (N bkg ) is also shown together with the data. Applying the p H T requirement in the 0-jet category does not change substantially the event yields, while it has an effect in the 1-jet category, as expected. The errors on the ratios of the data over total background, N bkg , only take into account the statistical uncertainties on the observed and expected yields  hypotheses, namely J P = 2 + , κ g = 0.5, κ q = 1 (dashed yellow line), J P = 2 + , κ g = 1, κ q = 0 (blue dashed line) and J P = 2 + , κ g = κ q (green dashed line). The expected shapes for the sum of all backgrounds, including the data-derived W +jets background, is also shown (solid black line). The last bin in each plot includes the overflow dilepton system), m , φ (φ angle between the two leptons) and m T (transverse mass of the dilepton and missing momentum system). These variables are the same as those used for the spin-2 analysis in the previous publication [3].
Similarly, Figs. 4 and 5 show the the variables that best discriminate between an SM Higgs boson and a BSM CP-even or CP-odd signal, respectively. The BSM CP-even variables are the same as those used in the spin-2 analysis, apart from the p miss T variable which is substituted for m T . The variables for the CP-odd analysis are m , E νν , p T , φ , where E νν = p 1 T − 0.5 p 2 T + 0.5 p miss T , p 1 T and p 2 T are respectively the transverse momenta of the leading and subleading leptons, and p T is the absolute value of their difference.
The CP-mixing analysis studies both the positive and negative values of the mixing parameter, as explained in Sect. 2.2.2. In the BSM CP-even benchmark scan, for negative values of the mixing parameter, interference between the SM and BSM CP-even Higgs-boson couplings causes a cancellation that drastically changes the shape of the The expected shapes for the sum of all backgrounds, including the data-derived W +jets background, is also shown (solid black line). The last bin in each plot includes the overflow While for positive values ofκ HWW /κ SM (Fig. 6, left) and for the SM Higgs-boson hypothesis, the φ distribution peaks towards low values, when reaching the maximum of the interference (at aboutκ HWW /κ SM ∼ −1), the mean of the φ distribution slowly moves towards higher values. This significantly improves the separation power between the SM and the BSM CP-even Higgs-boson hypotheses (Fig. 6,  right). For values ofκ HWW /κ SM < −1, the peak of distribution gradually moves back to low values of φ , as in the case of the SM Higgs-boson hypothesis. The sum of the backgrounds is also shown on the same figure. The other CP-sensitive variables exhibit a similar behaviour in this specific region of parameter space. The impact of this feature on the results is discussed in Sect. 9.3. 4.3 Event selection in the 0-jet and 1-jet categories Table 2 summarises the preselection requirements discussed in Sect. 4.1, together with the selections applied specifically to the 0-jet and 1-jet categories. These selection requirements are optimised in terms of sensitivity for the different spin and CP hypotheses studied while maintaining the required rejection against the dominant backgrounds. In general, they are looser than those described in Ref. [9], which were optimised for the SM Higgs boson.
Some of these looser selection requirements are applied to both the 0-jet and 1-jet categories: • The mass of the lepton pair, m , must satisfy m < 80 GeV, a selection which strongly reduces the dominant WW continuum background.
Events in the 0-jet category are required to also satisfy p T > 20 GeV, while events in the 1-jet category, which suffer potentially from a much larger background from topquark production, must also satisfy the following requirements: • Using the direction of the missing transverse momentum a τ -lepton pair can be reconstructed with a mass m τ τ by applying the collinear approximation [48]; m τ τ is required to pass the m τ τ < m Z − 25 GeV requirement to reject Z /γ * → τ τ events.
where φ is the angle between the lepton transverse momentum and p miss T , is required to satisfy m T > 50 GeV to reject the W +jets background.
• The total transverse mass of the dilepton and missing transverse momentum system, m T , is required to satisfy m T < 150 GeV.
For alternative spin-2 benchmarks with non-universal couplings, as listed in Sect. 2.1.2, an additional requirement on the reconstructed Higgs-boson transverse momentum p H T is applied in the signal and control regions for all MC samples and data. The p H T variable is reconstructed as the transverse component of the vector sum of the four-momenta of both leptons and the missing transverse energy. Table 3 shows the number of events for data, expected SM signal and the various background components after event selection. The background estimation methods are described in detail in Sect. 5. Good agreement is seen between the observed numbers of events in each of the two categories and the sum of the total background and the expected sig-nal from an SM Higgs boson. The 0-jet category is the most sensitive one with almost three times larger yields than the 1jet category. As expected, however, the requirements on p H T affect mostly the 1-jet category, which is sensitive to possible tails at high values of p H T , as explained in Sect. 2.1.2. Figures 7 and 8 show the distributions of discriminating variables used in the analysis after the full selection for the 0-jet and 1-jet categories, respectively. These figures show reasonable agreement between the data and the sum of all expected contributions, including that from the SM Higgs boson.

Backgrounds
The background contamination in the signal region (SR) is briefly discussed in the previous section. This section is dedicated to a more detailed description of backgrounds and their determination. The following physics processes relevant for this analysis are discussed: • W W : non-resonant W -boson pair production; • top quarks (labelled as Top): top-quark pair production (tt) and single-top-quark production (t); • misidentified leptons (labelled as W +jets): W -boson production, in association with a jet that is misidentified as a lepton, and dijet or multi-jet production with two misidentifications; • Z /γ * decay to τ τ final states.
Other smaller backgrounds, such as non-W W dibosons (W γ , W γ * , WZ and Z Z) labelled as V V in the following, as well as the very small Z /γ * → ee or μμ contribution, are estimated directly from simulation with the appropriate theoretical input as discussed in Sect. 3.
The dominant background sources are normalised either using only data, as in the case of the W +jets background, or using data yields in an appropriate control region (CR) to normalise the MC predictions, as for W W , Z /γ * → τ τ and top-quark backgrounds. The event selection in control regions is orthogonal to the signal region selection but as close as possible to reduce the extrapolation uncertainties from the CRs to the SR. The requirements that define these regions are listed in Table 4.
The control regions, for example the W W CR, are used to determine a normalisation factor, β, defined by the ratio of the observed to expected yields of W W candidates in the CR, where the observed yield is obtained by subtracting the non-W W contributions from the data. The estimate B est SR for the background under consideration, in the SR, can be written as:  The extrapolation factor α has uncertainties which are common to all MC-simulation derived backgrounds: • uncertainty due to higher perturbative orders in QCD not included in the MC simulation, evaluated by varying the renormalisation and factorisation scales by factors onehalf and two; • uncertainty due to the PDF choice, estimated by taking the largest difference between the nominal PDF set (e.g. CT10) and two alternative PDF sets (e.g. MSTW2008 [49] and NNPDF2.3 [50]), with the uncertainty determined from the error eigenvectors of the nominal PDF set added in quadrature; • uncertainty due to modelling of the underlying event, hadronisation and parton shower (UE/PS), evaluated by comparing the predictions from the nominal and alternative parton shower models, e.g. Pythia and Herwig.
The section is organised as follows. Section 5.1 describes the W W background -the dominant background in both the 0-and 1-jet categories. Section 5.2 describes the background from the top-quark production, the second largest background in the 1-jet category. The Z /γ * → τ + τ − background Table 5 Theoretical uncertainties (in %) on the extrapolation factor α for W W , top-quark and Z /γ * → τ τ backgrounds. "Total" refers to the sum in quadrature of all uncertainties. The negative sign indicates anti-correlation with respect to the unsigned uncertainties for categories in the same column. The uncertainties on the top-quark background extrapolation factor in the 0-jet category are discussed in Sect. 5 is described in Sect. 5.3, while the data-derived estimate of the W +jets background is briefly described in Sect. 5.4. The extrapolation factor uncertainties are summarised in Table 5.
More details can be found in Ref. [9].

Non-resonant W -boson pairs
Non-resonant W -boson pair production is the dominant (irreducible) background in this analysis. Only some of the kinematic properties allow resonant and non-resonant production to be distinguished. The W W background is normalised using a control region which differs from the signal region in having a different range of dilepton invariant mass, m . The leptons from non-resonant W W production tend to have a larger opening angle than the resonant W W production. Furthermore, the Higgs-boson mass is lower than the mass of the system formed by the two W bosons. Thus, the nonresonant W W background is dominant at high m values. The 0-jet W W control region is defined after applying the p T criterion by changing the m requirement to 80 < m < 150 GeV. The 1-jet W W control region is defined after the m T criterion by requiring m > 80 GeV. The purity of the W W control region is expected to be 69 % in the 0jet category and 43 % in the 1-jet category. Thus, the dataderived normalisation of the main non-W W backgrounds, the top-quark and Drell-Yan backgrounds, is applied in the W W CR as described in the following two subsections. Other small backgrounds are normalised using MC simulation. The CR normalisation is applied to the combined W W estimate independently of the production (qq, qg or gg) process. The φ and m distributions in the W W control region are shown in Fig. 9 for the 0-jet and 1-jet final states.
Apart from the sources discussed in the previous section, the extrapolation factor α has uncertainties due to the generator choice, estimated by comparing the Powheg + Herwig and aMC@NLO + Herwig generators, and due to higher-order electroweak corrections determined by reweighting the MC simulation to the NLO electroweak calculation. All uncertainties are summarised in Table 5.

Top quarks
The top-quark background is one of the largest backgrounds in this analysis. Top quarks can be produced in pairs (tt) or individually in single-top processes in association with a W boson (W t) or lighter quark(s) (single-t). The topquark background normalisation from data is derived independently of the production process.
For the 0-jet category, the control region is defined by applying the preselection cuts including the missing transverse momentum threshold, with an additional requirement of φ < 2.8 to reduce the Z /γ * → τ τ background. The top-quark background 0-jet CR is inclusive in the number of jets and has a purity of 74 %. The extrapolation parameter α is determined as described in Eq. (3). The value of α is corrected using data in a sample containing at least one b-tagged jet [9].
The resulting normalisation factor is 1.08 ± 0.02 (stat.). The total uncertainty on the normalisation factor is 8.1 %. The total uncertainty includes variations of the renormalisation and factorisation scales, PDF choice and parton shower model. Also the uncertainty on the tt and W t production cross-sections and on the interference of these processes is included. An additional theoretical uncertainty is evaluated on the efficiency of the additional selection after the jet-veto requirement. Experimental uncertainties on the simulationderived components are evaluated as well.
In the 1-jet category, the top-quark background is the second leading background, not only in the signal region  Fig. 10.
The extrapolation uncertainty is estimated using the above mentioned sources of theoretical uncertainties and the additional uncertainties specific to the top-quark background: tt and single-top cross-sections and the interference between single and pair production of top quarks. A summary of the uncertainties is given in Table 5.

Drell-Yan
The Drell-Yan background is dominated by Z /γ * → τ τ events with τ -leptons decaying leptonically. The Z /γ * → τ τ 0-jet control region is defined by applying the preselection requirements, adding m < 80 GeV and reversing the φ criterion, φ > 2.8. The purity of this control region is expected to be 90 %. The Z /γ * → τ τ 1-jet control region is defined by applying the preselection requirements, b-veto, m T > 50 GeV as in the signal region but requiring |m τ τ − m Z | < 25 GeV. The purity of the 1-jet control region is about 80 %. The Z /γ * → τ τ predictions in the 0-and 1-jet categories are estimated using the extrapolation from the control region to the signal region and to the W W control region, as there is a 4-5 % contamination of Z /γ * → τ τ events in the W W control region. The φ and m distributions in the Z /γ * → τ τ control region are shown in Fig. 11 for the 0-jet and 1-jet final states.
A mismodelling of the transverse momentum of the Z boson p Z T , reconstructed as p T , is observed in the DYenriched region. The mismodelling is more pronounced in the 0-jet category. The Alpgen + Herwig MC generator does not adequately model the parton shower of the soft jets which balance p T in events with no selected jets. A correction, based on weights derived from a data-to-MC comparison in the Z mass peak, is therefore applied to MC events in bins of p T in the 0-jet category. The weights are applied to p Z T at generator-level for all lepton flavour decays. Apart from the above mentioned sources of theoretical uncertainties, one additional uncertainty on the p Z Treweighting in the 0-jet category is estimated by comparing the difference between the nominal (derived in the Z mass peak) and the alternative (derived in the Z mass peak but after the p miss T > 20 GeV criterion) set of weights. All uncertainties are summarised in Table 5.

Misidentified leptons
The W +jets background is estimated in the same way as in Ref. [9], where a detailed description of the method can be found. The W +jets control sample contains events where one of the two lepton candidates satisfies the identification and isolation criteria for the signal sample, and the other lepton fails to meet these criteria but satisfies less restrictive criteria (these lepton candidates are called "anti-identified"). Events in this sample are otherwise required to satisfy all of the signal selection requirements. The dominant component of this sample (85-90 %) is due to W +jets events in which a jet produces an object reconstructed as a lepton. This object may be either a non-prompt lepton from the decay of a hadron containing a heavy quark, or a particle (or particles) originating from a jet and reconstructed as a lepton candidate.
The W +jets contamination in the signal region is obtained by scaling the number of events in the data control sample by an extrapolation factor. This extrapolation factor is measured in a data sample of jets produced in association with Z bosons reconstructed in either the ee or μμ final state (referred to as the Z +jets control sample below). The factor is the ratio of the number of identified lepton candidates satisfying all lepton selection criteria to the number of anti-identified leptons measured in bins of anti-identified lepton p T and η. Each number is corrected for the presence of processes other than Z +jets.
The composition of the associated jets -namely the fractions of jets due to the production of heavy-flavour quarks, light-flavour quarks and gluons -in the Z +jets sample and the W +jets sample are different. Monte Carlo simulation is used to correct the extrapolation factors and to determine the associated uncertainty. Other important uncertainties on the Z +jets extrapolation factor are due to the limited number of jets that meet the lepton selection criteria in the Z +jets control sample and the uncertainties on the contributions from other physics processes.
The total systematic uncertainty on the corrected extrapolation factors varies as a function of the p T of the antiidentified lepton; this variation is from 29 to 61 % for antiidentified electrons and from 25 to 46 % for anti-identified muons. The systematic uncertainty on the corrected extrapolation factor dominates the systematic uncertainty on the W +jets background.

BDT analysis
Both the spin and the CP analysis employ a BDT algorithm 4 to distinguish between different signal hypotheses. In all cases, two discriminants are trained to separate the signals from each other, or from the various background components, using the discriminating variables described in Sect. 4.2. The resulting two-dimensional BDT output is then used to construct a binned likelihood, which is fitted to the data to test its compatibility with the SM or BSM Higgs hypotheses, using the fit procedure presented in Sect. 7.
Before the training, the same preselection and some of the selection cuts listed in Table 2 are applied to data and on all MC predictions for background and signal. The addi- 4 A decision tree is a collection of cuts used to classify events as signal or background. The classification is based on a set of discriminating variables (BDT input variables) on which the algorithm is trained. The input events are repeatedly split using this information. At each split, the algorithm finds the variable and the optimal selection cut on this variable, that give the best separation between signal and background. Finally, an overall output weight (BDT output) is assigned to each event: the larger the weight, the more signal-like the event is classified to be. More details can be found in Ref. [10]. tional selection requirements adopted for both the 0-and 1-jet categories are m < 100 GeV and on p H T for the spin-2 non-universal coupling models. The loosening of the m requirement with respect to the one applied in the full event selection is meant to increase the number of MC events for training. In the 0-jet category a requirement p T > 20 GeV is applied while the φ cut is omitted, whereas the latter is needed in the 1-jet category due to the large DY background. All background samples are used in the training and each one is weighted by the corresponding production cross-section.

Spin analysis
The spin analysis presented here follows closely the strategy of Ref. [3] for the 0-jet category, while the 1-jet category has been added and is treated likewise. For each category, one BDT discriminant (called BDT 0 in the following) is trained to discriminate between the SM hypothesis and the background, and a second one (BDT 2 ) to discriminate between the alternative spin-2 hypotheses and the background. This results in five BDT 2 trainings for the alternative spin-2 models defined in Sect. 2.1.2 and one BDT 0 training for the SM Higgs boson.
The distributions of the input variables used for BDT 0 and BDT 2 in the 0-jet and 1-jet categories, respectively, are shown in Figs. 2 and 3 (see Sect. 4.2).
The BDT discriminant distributions (also referred to as BDT output distributions) for the 0-jet and 1-jet signal region are shown in Figs. 12 and 13 for the case of universal couplings and of non-universal ones with p H T < 125 GeV, respectively. The plots for non-universal couplings and p H T < 300 GeV are very similar to the ones obtained using the requirement p H T < 125 GeV except for the BSM signal distribution. The SM Higgs signal is normalised using the SM Higgs-boson production cross-section. Good agreement between data and MC simulation is observed in those distributions once the SM signal is included.

CP analysis
The CP analysis -which includes both the fixed-hypothesis test and the CP-mixing scan -uses only the 0-jet category. In this case as well, two BDT discriminants are trained: the first, BDT 0 , is identical to the one described above for the spin analysis (SM Higgs-boson signal versus background, using m , p T , φ and m T as input variables, as shown in Fig. 2). The second BDT, however, called BDT CP in the following, is trained to discriminate between the SM signal and signal for the alternative hypothesis without any background component. The training obtained using the two pure CP-even or CP-odd hypotheses is then applied to all the CP-mixing scenarios. As described in Sect CP-even scenario, as shown in Fig. 4, and m , φ , E νν and p T for the CP-odd scenario, as shown in Fig. 5. The different training strategy adopted for BDT CP and BDT 2 is motivated by the intrinsic difference between the spin and CP analyses: while, in the former case, the spin-2 signal is more background-like (its shape is similar to that of the dominant W W background), in the latter case, the different signal hypotheses result in shapes of the input variable distributions which are quite similar to each other, while they remain different from the background shape. Therefore, for the CP analysis, the best separation power is obtained by training BDT CP to discriminate between the SM and BSM hypotheses.
The BDT CP output distributions for the SM versus BSM CP-odd and CP-even hypotheses are shown in Fig. 14. Good agreement between data and MC simulation is also found in this case, once the SM Higgs-boson signal is included.

Fit procedure
This section discusses the statistical approach adopted in this paper. First, the rebinning of the two-dimensional BDT output distribution is discussed. The rebinning is applied for both analyses: the fixed-hypothesis tests and the CP-mixing anal-ysis. Afterwards the statistical procedure for the individual analyses is presented.
The two-dimensional BDT 0 × BDT 2 output (or BDT 0 × BDT CP for the CP analysis) distribution is unrolled row by row to a one-dimensional distribution. After the unrolling, bins with less than one background event are merged. The latter threshold is applied to the sum of weighted background events, i.e. after the normalisation to the corresponding crosssection and luminosity and the application of the post-fit scale factors to the background processes. This is done independently in the 0-jet and 1-jet categories and for all benchmarks and scans where a retraining of the BDT has occurred. Such a procedure is not intended to improve the expected sensitivity per se, rather to stabilise the fit in the presence of a large number of free parameters.

Procedure for the fixed-hypothesis test
The statistical analysis of the data employs a binned likelihood L(ε, μ, θ ) constructed with one parameter of interest, ε, which represents the fraction of SM Higgs-boson events with respect to the expected signal yields, and can assume only discrete values ε = 0 (for the alternative ALT hypothesis) and ε = 1 (for the SM hypothesis).  L(ε, μ, θ ), summing over the bins (N bins ) of the unrolled BDT output distributions, per jet category in the spin-2 analysis case. S SM,i and S ALT,i are the signal yields for the SM and alternative hypothesis, respectively, while B i refers to the total background. Systematic uncertainties are represented through the N sys nuisance parameters θ , constrained by the auxiliary measurements A(θ |θ), whereθ is the central value of the measurement. The full likelihood can then be written as: The analysis is designed to rely on shape information to distinguish between different signal hypotheses. The overall signal normalisation μ is obtained from the fit and, in the case of the spin analysis, as a combination over both jet categories. Further details of the various likelihood terms can be found in Ref. [9]. The compatibility of the data and two signal hypotheses is then estimated using a test statistic defined as: For both the numerator and denominator, the likelihood is maximised independently over all nuisance parameters to obtain the maximum likelihood estimatorsμ andθ. Pseudoexperiments for the two hypotheses (ε = 0, 1) are used to obtain the corresponding distributions of the test statistic q and subsequently to evaluate the p values, which define the expected and observed sensitivities for various hypotheses. The expected p values are calculated using the fitted signal strength in data, p SM exp, μ=μ for the SM hypothesis, and p ALT exp, μ=μ for the alternative hypothesis. In addition, for the SM hypothesis the expected p value fixing the signal normalisation to the SM prediction, p SM exp, μ=1 , is given. The observed p values, p SM obs and p ALT obs , are defined as the probability of obtaining a q value smaller (larger) than the observed value under the SM (alternative) signal hypothesis. Pseudoexperiments are needed because the asymptotic approximation [51] does not hold when the parameter of interest, ε in this case, takes only discrete values (0 or 1), and in particular −2 ln(L) does not follow a χ 2 distribution.
The confidence level (CL) for excluding an alternative BSM hypothesis in favour of the SM is evaluated by means of a CL estimator [52]: which normalises the rejection power of the alternative hypothesis, p ALT , to the compatibility of the data with the SM case, 1 − p SM .

Procedure for CP-mixing analysis
The likelihood definition for the CP-mixing analysis is the same as for the spin analysis, with ε = 1 corresponding to the SM signal hypothesis and ε = 0 corresponding to the alternative CP hypothesis. Whereas for the fixed-hypothesis test, the sensitivities are estimated by means of pseudo-experiments and follow the procedure explained above, for the CP-mixing analysis, the simpler asymptotic approximation is used, since the fraction of BSM signal events is now considered a continuous parameter. Results using the asymptotic approximation are cross-checked with pseudo-data for a few values of the scan parameter. The fits to data and to the MC expectation under the SM hypothesis are performed for each value of the scan parameter. Two fits to the SM expectation are evaluated: fixing the signal normalisation to the SM expectation and to the observed SM signal normalisation. From the fit, the value of the log-likelihood (LL) is extracted, as a function of the CP-mixing fraction. The maximum of the LL curve is determined and its difference from all other values is computed, −2 LL. The 1σ and 2σ confidence levels are then found at −2 LL = 1 and −2 LL = 3.84, respectively.

Systematic uncertainties
This section describes the systematic uncertainties considered in this analysis, which are divided into two categories: experimental uncertainties and theoretical ones which affect the shape of the BDT output distribution. The systematic uncertainties specific to the normalisation of individual backgrounds are described in Sect. 5.

Experimental uncertainties
The jet-energy scale and resolution and the b-tagging efficiency are the dominant sources of experimental uncertainty in this category, followed by the lepton resolution, identification and trigger efficiencies and the missing transverse momentum measurement. The latter is calculated as the negative vector sum of the momentum of objects selected according to the ATLAS identification algorithms, such as leptons, photons, and jets, and of the remaining soft objects (referred  Pile-up The number of pile-up events is varied by 10 % Luminosity 2.8 % [53] to as soft terms in the following) that typically have low values of p T [9]. The various systematic contributions taken into account in the analysis are listed in Table 6. More information on the experimental systematic uncertainties can be found in Ref. [9].
In the likelihood fit, the experimental uncertainties are varied in a correlated way across all backgrounds and across signal and control regions, so that the uncertainties on the extrapolation factors α described in Sect. 5 are correctly propagated. All sources in Table 6 are analysed to evaluate their impact on both the yield normalisation and on the shape of the BDT discriminant distributions. Shape uncertainties are ignored if they are smaller than 5 % (smaller than the statistical uncertainty) in each bin of the distributions under study. Normalisation uncertainties are ignored as well if they are below 0.1 %.

Modelling uncertainties
The dominant background is SM W W production, and therefore uncertainties on the shape and yield in the signal region for this background require special attention. The uncertainties on the W W normalisation are discussed in Sect. 5.1; the shape uncertainties are addressed in this section.
An important uncertainty arises from the modelling of the shape of the W W background in the signal region, which is obtained using the same procedure adopted in the evaluation of the theoretical uncertainty on the W W extrapolation parameter. The scale uncertainty on the MC prediction of the BDT discriminants was studied by varying the factorisation and renormalisation scales up and down by a factor of two. The parton shower and generator uncertainties are estimated by comparing the Herwig and Pythia par-ton shower programs and by comparing Powheg + Herwig and aMC@NLO + Herwig, respectively. Finally, the PDF uncertainty is estimated by combining the CT10 PDF error set with the difference between the central values of NNPDF2.3 and CT10. The procedure is repeated for each of the final BDT output distributions and for each benchmark of the spin and parity analyses.
Modifications to the shape of the final BDT distribution from PDF and scale variations are found to be negligible, and well within the statistical uncertainty of the Monte Carlo predictions. Therefore they are included in the fit model only as overall normalisation effects. The parton shower and generator uncertainties were found to be statistically significant; therefore, a bin-by-bin shape uncertainty is applied.
The interference between the gg → W W and the gg → H processes is not taken into account in this study because of its negligible effect. In fact it results in a 4 % decrease in the total yield of events after the selection criteria and is of the same order as in Ref. [9]. These results confirm the expectations in Ref. [54].
The signal final-state observables are affected by the underlying Higgs-boson p T distribution. The Higgs-boson p T distribution for a spin-0 particle is given by the p H Treweighted Powheg + Pythia generator prediction as mentioned in Sect. 3. All spin-0 samples are reweighted to the same p H T distribution to avoid any impact of the difference in the Higgs-boson p T predictions between Mad-Graph5_aMC@NLO and Powheg on the CP-analysis results. No additional shape uncertainty is considered. For the spin-2 benchmarks no theoretical uncertainties on the Higgs-boson p H T are considered, because they are negligible compared to the effect of the choice of p H T requirement in the non-universal couplings models.

Ranking of systematics
The impact of each systematic variation on the CL s estimator gives the measure of the relevance of the systematic uncertainty on the obtained result. The systematic uncertainties that are found to be most important in the various fixedhypothesis tests are listed for the different cases in Table 7.
The W W modelling uncertainty dominates in all three benchmarks, and another common large uncertainty is due to the W +jets background estimate. The spin-2 and CP-odd analyses are affected by the Z /γ * → τ τ modelling uncertainty. In addition, the CP-odd analysis is impacted by the modelling uncertainties on the non-W W background. The impact of systematics on the CL s estimator is larger for the CP-even case than for other benchmarks because of the lower sensitivity of the CP-even analysis.

Results
The results of the studies of the spin and parity quantum numbers are presented in this section. The SM J P = 0 + hypothesis is tested against several alternative spin/parity hypotheses, and the mixture of the SM Higgs and a BSM CP-even or CP-odd Higgs bosons is studied by scanning all possible mixing combinations.
This section is organised as follows. The event yields and the BDT output distributions after the fit to data are presented in Sect. 9.1. The results of the fixed-hypotheses tests for spin-2 benchmarks are discussed in Sect. 9.2 and the results for spin-0 and CP-mixed tests are shown in Sect. 9.3.

Yields and distributions
The post-fit yields for all signals and backgrounds are summarised in Table 8 for the spin and CP analyses. They account for changes in the normalisation factors and for pulls of the nuisance parameters. All the systematic uncertainties discussed in Table 5 and Sect. 8 are included in the fit. The fitted signal yields vary significantly in the BSM scenarios because of the differences in the shapes of the input variable distributions between the benchmark models. A striking example is given by the benchmark models with non-universal couplings: the fitted signal yield varies considerably between the p H T < 125 GeV and p H T < 300 GeV selections because of the presence of the tail at high p H T values discussed in Sect. 2.1.1. The yield fitted under the SM hypothesis, 270 ± 70 events (see Table 8), is in good agreement with the signal expectation of 238 events, corresponding to the ggF signal strength measured in Ref. [9].

Spin-2 results
The compatibility of the spin-2 signal model with the observed data is calculated following the prescription explained in Sect. 7.1 for five different benchmarks discussed in Sect. 2.1.2. The expected distributions of the test statistic q, derived from pseudo-experiments, are shown for the universal couplings case in Fig. 15 for 0-and 1-jet combined. The q distributions are symmetric and have no overflow or underflow bins. The expected and observed significances and CL s are summarised in Table 9. The expected significance p SM exp, μ=μ using the observed SM normalisation is higher than p SM exp, μ=1 , because the observed SM yields in Table 8 are larger than the expected SM yields in Table 3. The SM hypothesis is favoured in all tests in data and the alternative model is disfavoured at 84.5 % CL for the model with universal couplings and excluded at 92.5-99.4 % CL for the benchmark models with non-universal couplings. The exclusion limits for non-universal couplings are stronger for a p H T Table 8 Post-fit event yields for the 0-and 1-jet categories for various signal hypotheses. The number of events observed in data, the signal and the total background yields, including their respective post-fit systematic uncertainties, are shown in the top part of the table, assuming in each case the alternative signal hypothesis. The spin-2 κ g = κ q benchmark is used as an example in the bottom part of the  Fig. 15 Test-statistic distribution for the spin-2 benchmark with universal couplings (κ g = κ q ) including all systematic uncertainties, with 0-and 1-jet categories combined. The median of the expected distributions for the SM (dashed red line) and the spin-2 Higgs-boson signal (dashed blue line) is also shown, together with the observed result (solid black line) from the fit to the data. The shaded areas are used to compute the observed p values cut above 300 GeV because of the enhanced sensitivity at high values of the Higgs-boson p T .
The one-dimensional distribution of the unrolled post-fit BDT output distribution is presented in Fig. 16 for the κ g = 1, κ q = 0 and p H T < 125 GeV scenario in the 0-jet case. The distributions are shown for the SM and alternative signal hypotheses separately and compared with the data after the subtraction of all backgrounds. Both the signal and background yields are normalised to the post-fit values. The distributions are ordered in terms of increasing signal yield and, for visualisation purposes, only contain bins that have at least three signal events and a signal-to-background ratio of at least 0.02.

Spin-0 and CP-mixing results
Similar to the spin-2 fixed-hypothesis tests, the CP-even BSM Higgs and the CP-odd BSM Higgs-boson hypotheses are tested against the SM Higgs-boson hypothesis. The expected distributions of the test statistic q, derived from pseudo-experiments for the SM versus BSM CP-odd and CPeven pure states, are shown in Fig. 17. The distributions are symmetric and have no overflow or underflow bins. The overlap of the test-statistic distributions for the SM hypothesis and the alternative hypothesis indicates the sensitivity of the analysis to distinguish them. The expected sensitivity is higher for the CP-odd hypothesis than for the CP-even hypothesis. The expected and observed significances and CL s values are summarised in Table 9 Fig. 18. These distributions show the one-dimensional unrolled BDT output for the SM and alternative signal hypotheses separately and compare them with the data after background subtraction. Both the signals and the background yields are normalised to the post-fit values. The distributions are ordered by increasing signal, and they contain bins that have at least three signal events and are above a signal-to-background threshold (S/B) of 0.035. As already mentioned above, these plots are intended for illustrative purposes only. The figure shows that the SM Higgs-   Fig. 16 The unrolled one-dimensional BDT output after background subtraction and using post-fit normalisations, in the case of the spin-2 benchmark with non-universal couplings (κ g = 1, κ q = 0), requiring the Higgs-boson p T to be below 125 GeV. The background yields are taken from the fit results, assuming the SM signal hypothesis in the left-hand plot, and the alternative spin-2 hypothesis in the right-hand plot boson hypothesis is preferred over the pure BSM CP-even or CP-odd cases. The S/B ratio used for the CP analysis is higher than the one used for the spin-2 analysis because on average the bins with the highest significance have a higher S/B in the CP-mixing than in the spin-2 BDT output.
The compatibility of the CP-mixed signal plus background with the observed data is calculated following the prescription explained in Sect. 7.2 for the two different scans (mixing of an SM Higgs boson with a BSM CP-even or CP-odd boson) as discussed in Sect. 2.2.1. The scan results are presented in Fig. 19.
In the case of the BSM CP-odd mixing scan (top row of Fig. 19), the expected and observed curves are slightly asymmetric, but the sensitivity to the sign of the scan parameter is small. Due to higher observed yields for the SM hypothesis, the expected curve using the observed yields (μ =μ) is above the expected curve for the yields fixed to the SM expectation (μ = 1). The minimum of the −2 LL curve is very broad and lies at −0.2. The value at 0 corresponds to the SM hypothesis. The values of (κ AWW /κ SM ) · tan α below −6 and above 5 can be excluded at 95 % CL, while values below −1.6 and above 1.3 at 68 % CL. The fitted signal yields and their relative uncertainties, for the SM and alternative signal hypotheses, are very stable throughout the scan. They are given in Table 8 for the fixed-hypothesis case.
The plot on the bottom of Fig. 19 shows the result of the BSM CP-even scan as a function ofκ HWW /κ SM . The separa-  Fig. 18 The unrolled one-dimensional BDT output after background subtraction in the case of the pure BSM CP-odd (top) and BSM CPeven (bottom) benchmarks. The background yields are taken from the fit results, assuming the SM signal hypothesis in the left-hand plots, and the alternative hypothesis in the right-hand plots tion power between the SM Higgs-boson hypothesis and the BSM CP-even mixed hypothesis is enhanced in the region around −1, the observed minimum of the −2 LL distribution, because of the interference effect explained in Sect. 4.2. The fitted signal yield, both for the SM and alternative signal hypotheses, is stable for values outside the observed minimum region and similar to the values given in Table 8 for the fixed-hypothesis case. In the region around the minimum, the fitted BSM signal yield is higher, reaching about 370 events. These variations are expected from the significant shape dif-ferences of the input variable distributions in this region of the parameter scan, as described in Sect. 4.2. The relative uncertainty is stable throughout the scan, with values around 30 %.
The observed minimum of the −2 LL curve is at −1.3 and is compatible with the SM hypothesis within 1.9σ . To further study the compatibility of the SM signal hypothesis with the observed result, several scans are performed, by fitting, instead of the real data, pseudo-data generated around the expected signal-plus-background post-fit BDT distribu- tion. This means that the nuisance parameters from this test are obtained from the fit of the SM signal to the data. Distributions similar to the one observed in the data are reproduced by pseudo-data. Furthermore, a fixed-hypothesis test is also performed, where the compatibility of the observed data with the SM Higgs boson versus the CP-even mixed signal corresponding toκ HWW /κ SM = −1.3 is studied, resulting in a 1 − CL s of 43 % in favour of the SM and of 93 % in favour of the alternative hypothesis.
Values of the mixing parameter,κ HWW /κ SM , above 0.4 and below −2.2 can be excluded at 95 % CL, as well as in the region between −0.85 and −1. Values above −0.5 and below −1.5, as well as between −1.2 and −0.65, can be excluded at 68 % CL.

Conclusions
The Standard Model J P = 0 + hypothesis for the Higgs boson is compared to alternative spin/parity hypotheses using 20.3 fb −1 of the proton-proton collision data collected by the ATLAS experiment at the LHC at √ s = 8 TeV and corresponding to the full data set of 2012. The Higgs-boson decay W W * → eνμν is used to test several alternative models, including BSM CP-even and CP-odd Higgs bosons, and a graviton-inspired J P = 2 + model with minimal couplings to the Standard Model particles. In addition to the tests of pure J P states, two scenarios are considered where all the CP mixtures of the SM Higgs boson and a BSM CP-even or CP-odd Higgs boson are tested.
For the spin-2 benchmarks, the SM hypothesis is favoured in all tests in data and the alternative model is disfavoured at 84.5 % CL for the model with universal couplings and excluded at 92.5-99.4 % CL for the benchmark models with non-universal couplings.
The SM Higgs-boson hypothesis is tested against a pure BSM CP-even or CP-odd Higgs-boson hypothesis: the results prefer the SM Higgs-boson hypothesis, excluding the alternative hypothesis at the 70.8 and 96.5 % levels, respectively.
The data favour the Standard Model quantum numbers in all cases apart from the scan of a CP-mixed state with a BSM CP-even Higgs boson, where the data prefer a mixed state withκ HWW /κ SM = −1.3, which is compatible with the SM hypothesis within 1.9σ . Theκ HWW /κ SM values can be excluded at 95 % CL above 0.4 and below −2.2, as well in the region between −0.85 and −1. For the mixing with a BSM CP-odd Higgs boson, the (κ AWW /κ SM ) · tan α values above 5 and below −6 can be excluded at 95 % CL. The preferred value corresponds to (κ AWW /κ SM ) · tan α = −0.2, which is compatible with the SM to within 0.5σ . des Sciences Semlalia, Université Cadi Ayyad, LPHEA-Marrakech, Marrakech, Morocco; (d) Faculté des Sciences,