Measurements of the Higgs boson production cross section and couplings in the W boson pair decay channel in proton-proton collisions at \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}=13\,\text {Te\hspace{-.08em}V} $$\end{document}s=13TeV

Production cross sections of the standard model Higgs boson decaying to a pair of W bosons are measured in proton-proton collisions at a center-of-mass energy of 13\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text {Te\hspace{-.08em}V}$$\end{document}TeV. The analysis targets Higgs bosons produced via gluon fusion, vector boson fusion, and in association with a W or Z boson. Candidate events are required to have at least two charged leptons and moderate missing transverse momentum, targeting events with at least one leptonically decaying W boson originating from the Higgs boson. Results are presented in the form of inclusive and differential cross sections in the simplified template cross section framework, as well as couplings of the Higgs boson to vector bosons and fermions. The data set collected by the CMS detector during 2016–2018 is used, corresponding to an integrated luminosity of 138\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text {fb}^{-1}$$\end{document}fb-1. The signal strength modifier \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document}μ, defined as the ratio of the observed production rate in a given decay channel to the standard model expectation, is measured to be \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu = 0.95^{+0.10}_{-0.09}$$\end{document}μ=0.95-0.09+0.10. All results are found to be compatible with the standard model within the uncertainties.


Introduction
After the observation of a scalar particle compatible with the standard model (SM) Higgs boson by the ATLAS and CMS Collaborations in 2012 [1][2][3], the two experiments have focused on performing precision measurements of the properties of the new particle. The large data sample collected at the CERN LHC during the data taking periods through 2018 allowed the measurement of the Higgs boson quantum numbers and couplings to other SM particles with an unprecedented level of accuracy [4]. All results reported so far are compatible with the SM within the current uncertainties.
Among all the Higgs boson decay channels predicted by the SM, the one into a pair of W bosons has the second largest branching fraction (≈22%), while benefitting from a lower background with respect to the more probable decay in a pair of b quarks. This combination makes this channel one of the most sensitive for measuring the production cross section of the Higgs boson and its couplings to SM particles. This paper presents the measurement of the Higgs boson properties in the H → WW decay channel targeting the gluon fusion (ggH) and vector boson fusion (VBF) production mechanisms, as well as associated production with a vector boson (VH, where V stands for either a W or a Z boson). The measurement utilizes final states with at least two charged leptons arising either from the associated vector boson or from the products of the H → WW decays. In all cases at least one of the W bosons originating from the Higgs boson is required to decay leptonically.
The properties of the Higgs boson are probed by measuring the inclusive cross sections for each production mechanism, as well as the production cross sections in finer phase spaces defined according to the simplified template cross section (STXS) framework [5]. In addition, measurements of the Higgs boson couplings to fermions and vector bosons are presented.
The analysis is based on proton-proton (pp) collision data produced at the LHC at √ s = 13 TeV and collected by the CMS detector during 2016-2018, for a total integrated luminosity of about 138 fb −1 . This paper builds on previous analyses published by the CMS Collaboration in the H → WW channel focused on the inclusive production cross section and coupling measurements at √ s = 7, 8, and 13 TeV [6, 7], and on differential fiducial production cross section measurements at 8 TeV [8] and 13 TeV [9]. Similar measurements have also been reported in several Higgs boson decay channels by the ATLAS and CMS Collaborations [10][11][12][13][14].
Results reported in this paper show an overall improvement of the measurement accuracy thanks to new analysis techniques specifically devised to increase the sensitivity to particular production mechanisms (e.g., VBF with a different-flavor lepton pair in the final state), to the inclusion of new channels that have not been investigated in Run 2 before, such as VBF and VH production with a same-flavor pair of charged leptons and a hadronically decaying V, and ZH production with a three-lepton final state, and to the larger integrated luminosity analyzed. Moreover, WH production with two same sign leptons is measured for the first time in CMS. Tabulated results are provided in the HEPData record for this analysis [15]. This paper is organized as follows. A brief overview of the CMS apparatus is given in Section 2. The data set and simulated samples used are described in Section 3. Sections 4-8 describe in detail the event selection and categorization strategy, as well as the discriminating variables used to target each final state. The estimation of the backgrounds is described in Section 9, and the sources of systematic uncertainty and their treatment are given in Section 10. Results are presented in Section 11. Finally, closing remarks are given in Section 12.

The CMS detector and event reconstruction
The CMS apparatus is a general purpose detector designed to tackle a wide range of measurements. The central feature of CMS is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity (η) coverage provided by the barrel and endcap detectors. Muons are detected in gas-ionization chambers embedded in the steel flux-return yoke outside the solenoid.
The events of interest are selected using a two-tiered trigger system. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100 kHz within a fixed latency of about 4 µs [16]. The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to around 1 kHz before data storage [17]. Events passing the trigger selection are stored for offline reconstruction. A more detailed description of the CMS detector, together with a definition of the coordinate system and the kinematic variables, can be found in Ref. [18].
Muons are identified and their momenta are measured in the range |η| < 2.4 by matching tracks in the muon system and the silicon tracker. The single muon trigger efficiency exceeds 90% over the full η range, and the efficiency to reconstruct and identify muons is greater than 96%. The relative transverse momentum (p T ) resolution for muons with p T up to 100 GeV is 1% in the barrel and 3% in the endcaps [19,20].
Electrons are identified and their momenta are measured in the interval |η| < 2.5 by combining tracks in the silicon tracker with spatially compatible energy deposits in the ECAL, also accounting for the energy of bremsstrahlung photons likely originating from the electron track. The single electron trigger efficiency exceeds 90% over the full η range. The efficiency to reconstruct and identify electrons ranges between 60 and 80% depending on the lepton p T . The momentum resolution for electrons with p T ≈ 45 GeV from Z → ee decays ranges from 1.7 to 4.5% depending on the η region. The resolution is generally better in the barrel than in the endcaps and also depends on the bremsstrahlung energy emitted by the electron as it traverses the material in front of the ECAL [21].
In order to achieve better rejection of nonprompt leptons, increasing the sensitivity of the analysis, leptons are required to be isolated and well reconstructed by imposing a set of requirements on the quality of the track reconstruction, shape of calorimetric deposits, and energy flux in the vicinity of the particle trajectory. On top of these criteria, a selection on a dedicated multivariate analysis (MVA) tagger developed for the CMS ttH analysis [22], referred to as ttHMVA, is added in all analysis categories for muon candidates. In categories targeting the VH production modes with leptonically decaying V boson, it is found that adding a selection on the ttHMVA tagger for electrons improves the sensitivity of the analysis.
Multiple pp interaction vertices are identified from tracking information by use of the adaptive vertex fitting algorithm [23]. The primary vertex is taken to be the vertex corresponding to the hardest scattering in the event, evaluated using tracking information alone, as described in Section 9.4.1 of Ref. [24].
The particle-flow (PF) algorithm [25] aims to reconstruct and identify each individual particle in an event, with an optimized combination of information from the various elements of the CMS detector. The energy of muons is obtained from the curvature of the corresponding track. The Table 1: Trigger requirements on the data set used in the analysis.

Trigger
Year Requirements Single electron 2016 p T > 25 GeV, |η| < 2.1 or p T > 27 GeV, 2.1 < |η| < 2.5 2017 p T > 35 GeV, |η| < 2.5 2018 p T > 32 GeV, |η| < 2.5 Single muon 2016 p T > 24 GeV, |η| < 2.4 2017 p T > 27 GeV, |η| < 2.4 2018 p T > 24 GeV, |η| < 2.4 Double electron All years p T1 > 23 GeV, p T2 > 12 GeV, |η 1,2 | < 2.5 Double muon All years p T1 > 17 GeV, p T2 > 8 GeV, |η 1,2 | < 2.4 Electron-muon All years p T1 > 23 GeV, p T2 > 12 GeV p T2 > 8 GeV in first part of 2016 data taking |η e | < 2.5, |η µ | < 2.4 require the presence of either one or two electrons or muons, satisfying isolation and identification requirements. For the 2016 data set, the single-electron trigger requires a p T threshold of 25 GeV for electrons with |η| < 2.1 and 27 GeV for 2.1 < |η| < 2.5. For the single-muon trigger the p T threshold is 24 GeV for |η| < 2.4. In the dielectron (dimuon) trigger the p T thresholds of the leading (highest p T ) and trailing (second-highest p T ) electron (muon) are respectively 23 (17) and 12 (8) GeV. In the dilepton eµ trigger, the p T thresholds are 23 and 12 GeV for the leading and trailing lepton, respectively. For the first part of data taking in 2016, a lower p T threshold of 8 GeV for the trailing muon was used. In the 2017 data set, the p T thresholds of the single electron and single muon triggers are raised respectively to 35 and 27 GeV, while they are set to 32 and 24 GeV in the 2018 data set. For both 2017 and 2018 data sets, the p T thresholds of the dilepton triggers are kept the same as the last part of the 2016 data set. The trigger selection is summarized in Table 1.
Monte Carlo (MC) event generators are used in the analysis to model the signal and background processes. Three independent sets of simulated events, corresponding to the 2016, 2017, and 2018 data sets, are used for each process of interest, in order to take into account yeardependent effects in the CMS detector, data taking, and event reconstruction. Despite different matrix element generators being used for different processes, all simulated events corresponding to a given data set share the same set of parton distribution functions (PDFs), underlying event (UE) tune, and parton shower (PS) configuration. The PDF set used is NNPDF 3.0 [38,39] at NLO for 2016 and NNPDF 3.1 [40] at NNLO for 2017 and 2018. The CUETP8M1 [41] tune is used to describe the UE in 2016 simulations, while the CP5 [42] tune is adopted in 2017 and 2018 simulated events. For all the simulations, the matrix-element event generators are interfaced with PYTHIA [43] 8.226 in 2016, and 8.230 in 2017 and 2018, for the UE description, PS, and hadronization.
Simulated events are used in the analysis to model Higgs boson production through ggH, VBF, VH, and associated production with top quarks (ttH) or bottom quarks (bbH), although ttH and bbH have a negligible contribution in the analysis phase space. All Higgs boson production processes except bbH are generated using the POWHEG v2 [44][45][46][47][48][49][50] event generator, which describes Higgs boson production at next-to-leading order (NLO) accuracy in quantum chromodynamics (QCD), including finite quark mass effects. Instead, bbH production is simulated using the MADGRAPH5 aMC@NLO v2.2.2 generator [51]. The ZH production process is simulated including both gluon-and quark-induced contributions. The MINLO HVJ [49] extension of POWHEG v2 is used for the simulation of WH and quark-induced ZH production, providing a description of VH+0-and 1-jet processes with NLO accuracy. For ggH production, the simulated events are reweighted to match the NNLOPS [52,53] prediction in the hadronic jet multiplicity (N jet ) and Higgs boson transverse momentum (p H T ) distributions, according to a two-dimensional map constructed using these observables. Moreover, for a better description of the phase space with more than one jet, the MINLO HJJ [54] generator is used, giving NLO accuracy for N jet ≥ 2 and leading order (LO) accuracy for N jet ≥ 3. The simulated samples are normalized to the cross sections recommended in Ref. [55]; in particular, the next-to-nextto-next-to-leading order cross section is used to normalize the ggH sample. The Higgs boson mass (m H ) in the event generation is assumed to be 125 GeV, while the value of 125.38 GeV [56] is used for the calculation of cross sections and branching fractions, yielding values of 48.31 pb, 3.77 pb, 1.36 pb, 0.88 pb, and 0.12 pb for the ggH, VBF, WH, quark-induced ZH, and gluoninduced ZH processes, respectively, and 22.0% for the H → WW branching ratio [55]. The decay to a pair of W bosons and subsequently to leptons or hadrons is performed using the JHUGEN [57] v5.2.5 generator in 2016, and v7.1.4 in 2017 and 2018, for ggH, VBF, and quarkinduced ZH samples. The Higgs boson and W boson decays are performed using PYTHIA 8.212 for the other signal simulations. For the ggH, VBF, and VH production mechanisms, additional Higgs boson simulations are produced using the POWHEG v2 generator, where the Higgs boson decays to a pair of τ leptons. These events are treated as signal in the analysis, with the exception of the measurement in the STXS framework, in which they are treated as background.
The background processes are simulated using several event generators. The quark-initiated nonresonant WW process is simulated using POWHEG v2 [58] with NLO accuracy for the inclusive production. The MCFM v7.0 [59][60][61] generator is used for the simulation of gluoninduced WW production at LO accuracy, and the normalization is chosen to match the NLO cross section [62]. The nonresonant electroweak (EW) production of WW pairs with two additional jets (in the vector boson scattering topology) is simulated at LO accuracy with MAD-GRAPH5 aMC@NLO v2.4.2 using the MLM matching and merging scheme [63]. Top quark pair production (tt), as well as single top quark processes, including tW, s-, and t-channel contributions, are simulated with POWHEG v2 [64][65][66]. The Drell-Yan (DY) production of chargedlepton pairs is simulated at NLO accuracy with MADGRAPH5 aMC@NLO v2.4.2 with up to two additional partons, using the FxFx matching and merging scheme [67]. Production of a W boson associated with an initial-state radiation photon (Wγ) is simulated with MAD-GRAPH5 aMC@NLO v2.4.2 at NLO accuracy with up to one additional parton in the matrix element calculations and the FxFx merging scheme. Diboson processes containing at least one Z boson or a virtual photon (γ * ) with mass down to 100 MeV are generated with POWHEG v2 [58] at NLO accuracy. Production of a W boson in association with a γ * (Wγ * ) for masses below 100 MeV is simulated by PYTHIA 8.212 in the parton showering of Wγ events. Triboson processes with inclusive decays are also simulated at NLO accuracy with MADGRAPH5 aMC@NLO v2.4.2.
For all processes, the detector response is simulated using a detailed description of the CMS detector, based on the GEANT4 package [68]. The distribution of the number of pileup interactions in the simulation is reweighted to match the one observed in data. The average number of pileup interactions was 23 (32) in 2016 (2017 and 2018).
The efficiency of the trigger system is evaluated in data on a per-lepton basis by selecting dilepton events compatible with originating from a Z boson. The per-lepton efficiencies are then combined probabilistically (i.e., the overall efficiency for an event passing any of the triggers listed above is calculated) to obtain the overall efficiencies of the trigger selections used in the analysis. The procedure has been validated by comparing the resulting efficiencies with MC simulation of the trigger. A correction has been derived as a function of ∆R = √ (∆η) 2 + (∆φ) 2 between the two leptons to account for any residual discrepancy, which is found to be on average below 1%. The resulting efficiencies are then applied directly on simulated events.

Event selection and categorization
The analysis targets events in which a Higgs boson is produced via ggH, VBF, or VH processes, and subsequently decays to a pair of W bosons. Events are selected by requiring at least two charged leptons (electrons or muons) with high p T , high p miss T , and a varying number of hadronic jets. Throughout this paper, unless otherwise specified, only hadronic jets with p T > 30 GeV are considered. Categories targeting Higgs bosons produced via ggH, VBF, and VH with a hadronically decaying vector boson (VH2j) are subdivided in different-flavor (DF) and same-flavor (SF) by selecting eµ, and ee/µµ pairs, respectively. Categories targeting VH production with a leptonically decaying vector boson are subdivided in four subcategories based on the number of leptons and hadronic jets required: WHSS (same sign), WH3 , ZH3 , and ZH4 targeting the WH → ± ± 2νqq, WH → 3 3ν, ZH → 3 νqq, and ZH → 4 2ν processes, respectively. In all cases events containing additional leptons with p T > 10 GeV are rejected. A summary of the different categories is given in Table 2, with a more detailed breakdown given in Table 12. Across all categories, in the 2016 data set, events are required to pass single-or double-lepton triggers. An additional requirement is placed on the lepton p T to be above 10 GeV, and the highest p T (leading) lepton in the event is furthermore required to have p T > 25 GeV. In the 2017 and 2018 data sets the threshold for leptons is increased to 13 GeV because of a change in the trigger setup. Where yields suffice, events are further split according to the charge and p T ordering of the dilepton system, p T of the subleading lepton, and number of hadronic jets in the event, as detailed in following sections. The number of expected and observed events in each category are given in Section 11.

Gluon fusion categories
This section describes the categories targeting the ggH production mechanism, both in DF and SF final states. In DF final states, the main background processes are nonresonant WW, top quark production (both single and pair), DY production of a pair of τ leptons that subsequently decay to an eµ pair and associated neutrinos, and W+jets events when a jet is misidentified as a lepton. Subdominant backgrounds include WZ, ZZ, Vγ, Vγ * , and VVV production. In SF final states, the dominant background contribution is given by DY events, with subdominant components arising from top quark and WW production, as well as events with misidentified leptons.

Different-flavor ggH categories
On top of the common selection, the leading leptons are required to form an eµ pair with opposite charge. Contributions arising from top quark production are reduced by rejecting events containing any jet with p T > 20 GeV that is identified as originating from a bottom quark by the tagging algorithm. The dilepton invariant mass m is required to be above 12 GeV to suppress QCD events with multiple misidentified jets. Events with no genuine missing transverse momentum (arising from the presence of neutrinos in signal events), as well as ττ events, are suppressed by requiring p miss T > 20 GeV. The latter are further reduced by requiring the p T of the dilepton system p T to exceed 30 GeV, as leptons arising from a ττ pair are found to have on average lower p T than those coming from a WW pair. Finally, to further suppress contributions from ττ and W+jets events, where the subleading lepton does not arise from a W boson decay, the transverse mass built with p miss T and the momentum of the subleading lepton m T ( 2 , p miss T ) is required to be greater than 30 GeV, where m T for a collection of particles {P i } with transverse momenta p T,i is defined as: (1) Selected events are further split into subcategories in order to exploit the peculiar kinematics of the target final state. Events with zero, one, and more than one hadronic jets are separated into distinct categories. In order to better constrain the W+jets background, the 0-and 1-jet categories are subdivided into two categories each according to the charge and p T ordering of the dilepton pair. This subdivision exploits the fact that the signal is charge symmetric, while in W+jets events W + bosons are more abundant than W − bosons. Finally, these subcategories are further divided according to whether the p T of the subleading lepton (p T2 ) is above or below 20 GeV. This results in a four-fold partitioning of the 0-and 1-jet DF ggH categories.
In categories with more than one hadronic jet, a selection on the invariant mass of the leading dijet pair m jj is added to ensure that there is no overlap with the VBF and VH categories.
Given the presence of neutrinos in the final state, the mass of the Higgs boson candidate can not be reconstructed in the WW channel. Nevertheless, specific features of the channel make it possible to achieve good sensitivity. In particular, the scalar nature of the Higgs boson results in the two final-state leptons being preferentially emitted in the same hemisphere. This fact compresses the distribution of m for signal events to lower values with respect to the nonresonant WW process. This shape difference alone however is not sufficient to disentangle the signal from other background processes, such as DY production of ττ pairs and Vγ, that populate the low-m phase space. The Higgs boson transverse mass m H T = m T ( , p miss T ) is thus introduced as a second discriminating variable. A selection on m H T is applied by requiring its value to be above 60 GeV for signal events. It is found that signal and background events populate different regions of the (m , m H T ) plane. The signal extraction fit is therefore performed on a two-dimensional (m , m H T ) binned template, allowing for good signal-to-background discrimination.
In order to optimize background subtraction in the signal region (SR), two additional orthogonal selections are defined for each jet multiplicity category. These define two sets of control regions (CR), enriched in ττ and top quark events, respectively. They are defined by the same selection as the SR, but inverting the b jet veto for the top CR and the m H T requirement for the ττ CR. The full selection and categorization strategy is summarized in Table 3. Observed distributions for m and m    H T (lower) fit variables in the 0-jet ggH p T2 < 20 GeV (left) and p T2 > 20 GeV (right) DF categories. The uncertainty band corresponds to the total systematic uncertainty in the templates after the fit to the data. The signal template is shown both stacked on top of the backgrounds, as well as superimposed. The yields are shown with their best fit normalizations from the simultaneous fit. Vertical bars on data points represent the statistical uncertainty in the data. The overflow is included in the last bin. The lower panel in each figure shows the ratio of the number of events observed in data to that of the total SM MC as extracted from the fit. Syst.

Same-flavor ggH categories
The categories described in this section target the ggH production mechanism in final states with either two electrons or two muons. The two leading leptons in the event are required to form an oppositely charged ee or µµ pair. Events containing at least one b-tagged jet with p T > 20 GeV are discarded. Low-mass resonances are suppressed by requiring m > 12 GeV. The W+jets background is reduced by requiring the p T of the dilepton system to exceed 30 GeV. Events are also required to have p miss T > 20 GeV to enrich the selection in processes with genuine missing transverse momentum. Finally, to reduce the DY background, which is dominant in this channel, a veto is placed on events in which m is within 15 GeV of the nominal mass of the Z boson (m Z ).
Events are divided in subcategories based on the number of hadronic jets, and further selections on m H T , m , and the azimuthal angle between the two leading leptons (∆φ ) are applied depending on the subcategory. A dedicated multivariate discriminant based on a DNN, called DYMVA in the following, is built and trained with the TENSORFLOW package [69] to distinguish signal events from DY events. The DNN is trained separately for each jet multiplicity subcategory. The architecture of the DNN is that of a feed-forward multilayer perceptron, taking 21, 22, and 27 input variables in the 0-, 1-, and 2-jet categories, respectively. These include kinematic information from the dilepton system, p miss T , and jets where present. To better constrain the top quark and WW backgrounds, two CRs are defined in each jet multiplicity subcategory, enriched in the respective processes. The full selection is given in Table 4. The selection efficiency of the requirement on the DYMVA score in 0-jet categories is found to be approximately 50, 7, and 30% for signal, DY, and total background events, respectively. In 1-and 2-jet categories the corresponding efficiencies are ≈50, 1, and 10%. Once the selection is performed, the signal is extracted via a simultaneous fit to the number of events in each category.

Vector boson fusion categories
This section describes the categories targeting the VBF production mechanism, both in DF and SF final states. This mode involves the production of a Higgs boson in association with a pair Table 4: Summary of the selection used in same-flavor ggH categories. The DYMVA threshold is optimized separately in each subcategory and data set.
Top quark CR As SR but with m > 100 GeV, m T ( 2 , p miss T ) > 30 GeV At least one b-tagged jet with p T > 30 GeV of forward-backward jets. The dijet system is characterized by a large m jj , large pseudorapidity separation ∆η jj , and low hadronic activity in the pseudorapidity region between the tagging jets. The fully leptonic final state in the VBF category therefore consists of two isolated leptons, large p miss T from the two undetectable neutrinos, and a pair of forward-backward jets. The main background processes for the VBF categories are the same as for the ggH categories. An additional complication however arises in the entanglement of VBF and ggH events, given the identical decay mode and the fact that the ggH cross section is larger than the VBF one by one order of magnitude.

Different-flavor VBF categories
On top of the common global selection, the same requirements on leptons and p miss T used in the DF ggH categories are applied. In this case, however, there are no subcategories based on jet multiplicity. Instead, exactly two jets with p T > 30 GeV and m jj > 120 GeV are required, while still requiring the absence of b-tagged jets with p T > 20 GeV. In this category the DEEPFLAVOR tagger [32] is used. Finally, 60 < m H T < 125 GeV is required. In order to separate the signal from the background, a DNN approach has been followed. The DNN is constructed to perform a multiclass classification of an event as either signal (VBF) or any of the three main background processes, namely: WW, top quark production, and ggH. As a result, a vector o of four numbers is attributed to an event. Each number represents the degree of agreement of the event with the signal and the three background processes. Each of these outputs can be interpreted as a probability, since they are normalized to one. Therefore, for a given event, the process j with the highest output o j is interpreted as the most probable process. For this reason, the four outputs are referred to as classifiers: C VBF , C t , C WW , and C ggH . In the SR four orthogonal categories are made using the classifiers. If, for a given event, C j is higher than the other three, the event is classified in the j-like category, and C j is used as the discriminating variable. A shape-based analysis is hence performed in these categories. The DNN is trained on a set of 26 input variables, including kinematic information from the dilepton system, p miss T , and jets. The variables with the most discrimination power are found to be m jj , ∆η jj and m . As done in the DF ggH categories, in order to optimize background subtraction in the SR, two CRs are defined, enriched in ττ and top quark events, respectively. They are defined by the same selection as the SR, but inverting the b jet veto for the top quark CR and the m H T requirement for the ττ CR. The full selection and categorization strategy is summarized in Table 5. Observed distributions for the C VBF and C ggH classifiers in the VBF-like and ggH-like categories respectively are shown in Fig. 10.
In order to verify that the simulated background processes agree with data in the DNN classifiers, the distributions are also checked at the level of the VBF SR global selection, i.e., before the further event categorization based on the classifier outputs. The C VBF DNN output in the aforementioned global selection region is shown in Fig. 11.

Same-flavor VBF categories
On top of the common global selection, the same selection used in the SF ggH categories is applied. However, in this case, at least two jets with p T > 30 GeV are required, with m jj > 350 GeV, while also rejecting events that contain any b-tagged jets with p T > 20 GeV. To define a Higgs-boson-enriched phase space, a selection on the DYMVA DNN is added. The DNN is trained and optimized separately in each category. Two background CRs help in constraining the normalization of the top quark and WW backgrounds. These CRs consist in regions of phase space orthogonal but as close as possible to the signal phase space. This channel utilizes  a simple counting experiment analysis, thus the event requirements are chosen to maximize the expected signal significance. The full selection and categorization strategy is summarized in Table 6. Table 6: Selection used in the same-flavor VBF categories. The DYMVA threshold is optimized separately in each subcategory and data set.
Top quark CR As SR but with m > 100 GeV, m T ( 2 , p miss T ) > 30 GeV At least one of the leading jets b-tagged

Vector boson associated production categories
This section describes categories targeting the VH production mode. Four subcategories are defined (WHSS, WH3 , ZH3 , and ZH4 ) to target final states in which the vector boson V, produced in association with the Higgs boson, decays leptonically. Two more categories (VH2j DF/SF) select events in which the V boson decays into two resolved jets. An additional selection is applied in each category to reduce the background, as well as an event categorization, defining phase spaces more sensitive to either signal or specific backgrounds. Details on the event selection and categorization are given below.

WHSS categories
The WHSS category targets the WH → 2 2νqq final state, where the two charged leptons are required to have same sign to reduce DY background. Therefore, the final state contains two same-sign leptons, p miss T , and at least one jet. The analysis requires the leading (subleading) lepton to have p T > 25 (20) GeV. To remove contributions from low-mass resonances, m is required to be greater than 12 GeV. The two leptons must have a pseudorapidity separation (∆η ) of less than two. Events are also required to have p miss T > 30 GeV, as well as no b-tagged jet with p T > 20 GeV.
Signal region events are further categorized based on the number of jets and the lepton flavor composition. Events in the 1-jet category are required to contain exactly one jet with p T > 30 GeV and |η| < 4.7. Events in the 2-jet category are required to contain at least two jets with the same kinematic constraints. For events containing more than two jets, only the two jets with highest p T are considered for the analysis. These jets must have m jj < 100 GeV. The SRs One jet with p T > 30 GeV eµ(µµ) pair with same charge 2-jet eµ(µµ) At least two jets with p T > 30 GeV, m jj < 100 GeV eµ(µµ) pair with same charge are further divided into eµ and µµ categories. Events with two electrons are not considered, as this flavor category is less sensitive to signal.
To improve discrimination between signal and background, the variable m H is defined, which serves as a proxy for m H . It is computed as the invariant mass of the dijet pair four-momentum P jj = (E jj , p jj ) and twice the four-momentum of the lepton closest to the dijet pair P = (p , p ): The second lepton four-momentum serves as a proxy for the neutrino. If an event in the 1-jet category contains a second jet with 20 < p T < 30 GeV, this jet is included in the computation of this variable; otherwise the four-momentum of the single jet is used. Events in all categories are required to have m H > 50 GeV. A summary of the event selection is given in Table 7.
The main backgrounds in the WHSS category are WZ, W+jets, Vγ, and Vγ * production. Additional backgrounds are top quark, triboson, WW, and ZZ production. The W+jets events pass the selection when a nonprompt lepton passes the lepton selection. This nonprompt background is estimated from data, as described in Section 9. The remaining backgrounds are estimated using MC simulation. The WZ background normalization is estimated in the 1-and 2-jet CRs shared with the ZH3 category, described in Section 7.3.
To extract the Higgs boson production cross section, a binned fit is performed to the m H variable. Figure 12 shows the m H distribution after the fit to the data.

WH3 categories
The WH3 category targets the WH → 3 3ν decay. The final state therefore contains three leptons and p miss T . The analysis selects events containing three leptons with p T > 25, 20, and 15 GeV, respectively and total charge (Q 3 ) ±1. The invariant mass of any dilepton pair is required to be greater than 12 GeV to remove low-mass resonances. Events are rejected if they contain a jet with p T > 30 GeV, or any b-tagged jet with p T > 20 GeV.
Events in the SR are categorized based on the flavor composition of the lepton pairs. Events with at least one opposite-sign SF (OSSF) lepton pair are placed in the OSSF category, while all other events are placed in the same-sign SF (SSSF) category. To reject backgrounds containing Z bosons, events in the OSSF SR must pass a Z boson veto, where all lepton pairs must satisfy |m − m Z | > 20 GeV, as well as p miss T > 40 GeV. Syst.
The main backgrounds in the WH3 category are WZ, ZZ, Vγ, and Vγ * production, as well as backgrounds containing nonprompt leptons. Nonprompt backgrounds are estimated from data, as described in Section 9. The remaining backgrounds are estimated from simulated samples. The WZ and Zγ backgrounds are normalized using dedicated CRs, matching the OSSF SR with the exception of an inverted Z boson veto, a differing p miss T requirement, and an additional selection on the invariant mass of the full lepton system (m 3 ). A summary of the event selection and categorization is given in Table 8.
To discriminate between signal and background, two BDTs, trained separately for the OSSF and SSSF categories, are used. The BDTs are built using the TMVA [70] package and trained on events passing the OSSF and SSSF SR selections without the |m − m Z | requirement. The number of input variables used in the BDT training is 19 and 15 in the OSSF and SSSF regions, respectively. They include kinematic information on the leptons, p miss T , b tagging scores for the leading jets, and various invariant masses built from leptons and p miss T , with the minimum invariant mass and ∆R separation of the opposite sign lepton pairs giving the most discrimination power. To extract the Higgs boson production cross section, a binned fit is performed to the BDT score. Figure 13 shows the BDT discriminant distributions after the fit to the data.

ZH3 categories
The ZH3 category targets the ZH → 3 νqq decay. The final state therefore contains three leptons with total charge ±1. The invariant mass of any dilepton pair is required to be greater than 12 GeV to reject low-mass resonances. The event must contain an OSSF lepton pair with invariant mass |m − m Z | < 25 GeV. Events are rejected if any b-tagged jet with p T > 20 GeV passing the medium WP of the tagging algorithm is found.
Events are categorized based on the number of jets. Events in the 1-jet category contain exactly one jet with p T > 30 GeV and |η| < 4.7, while events in the 2-jet category contain at least two jets passing these requirements. Signal region events must also have an azimuthal separation between the two W bosons (∆φ( p miss T , j(j))), represented by the +p miss T and (di)jet systems respectively, below π/2, and pass a Z boson internal conversion veto |m 3 − m Z | > 20 GeV.
The main backgrounds in the ZH3 analysis are WZ, ZZ, and Z+jets events. The Zγ/γ * , VVV, Signal region 1-jet =1 jet with p T > 30 GeV, ∆φ( p miss T , j(j)) < π/2 2-jet ≥2 jets with p T > 30 GeV, ∆φ( p miss T , j(j)) < π/2 Control region 1-jet WZ =1 jet with p T > 30 GeV, ∆φ( p miss T , j(j)) > π/2 2-jet WZ ≥2 jets with p T > 30 GeV, ∆φ( p miss T , j(j)) > π/2 and tt+jets processes also contribute. The Z+jets events pass the selection when a nonprompt lepton passes the lepton selection. This background is estimated from data as described in Section 9. The remaining backgrounds are modeled using MC simulation. The WZ normalization as a function of the number of jets is extracted from dedicated CRs, which are categorized by the number of jets in the same way as the SRs. The WZ CRs are also used to constrain the WZ background in the WHSS category. A summary of the event selection and categorization is shown in Table 9.
To extract the Higgs boson production cross section, a binned fit is performed to the m H T = m T ( p miss T , j(j)) variable, defined in Eq. (1). Figure 14 shows the m H T distributions after the fit to the data.

ZH4 categories
The ZH4 category targets the ZH → 4 2ν decay. The final state therefore contains four leptons and p miss T . The analysis selects events containing four leptons with p T > 25, 15, 10, and 10 GeV, respectively, and null total charge (Q 4 ). The invariant mass of any dilepton pair is required to be greater than 12 GeV to reject low-mass resonances. The opposite-sign SF lepton pair with m closest to m Z is designated as the Z boson candidate, while the remaining lepton pair is referred to as the X candidate. The Z boson candidate mass is required to be within 15 GeV of m Z . Events are rejected if they contain any b-tagged jet with p T > 20 GeV.
Events are categorized based on the flavor of the lepton pair forming the X candidate. Events in the XSF category have an SF X lepton pair, while events in the XDF category have a DF X lepton pair. In the XSF category, events are required to satisfy m 4 > 140 GeV, 10 < m X < 60 GeV, and p miss T > 35 GeV. Events in the XDF category must have 10 < m X < 70 GeV and p miss T > 20 GeV.
Production of ZZ pairs is the main background in this category. Additional contributions arise from ttZ, VVV, and Vγ processes. These backgrounds are all modeled with MC simulation. The ZZ normalization is extracted from data in a dedicated CR defined by the requirements 75 < m X < 105 GeV and p miss T < 35 GeV. The event selection and categorization in the ZH4 category is summarized in Table 10.
A BDT approach is used to discriminate between signal and background. The BDT is trained on events passing the global selection, with p miss T > 20 GeV and 10 < m X < 70 GeV. The number of inputs used in the BDT is eight, and these include separation in the η-φ plane between the leptons in each dilepton pair, transverse masses of combinations of leptons and p miss T , as well as p miss T itself. The kinematic variables of the X candidate give the most discriminating power, along with p miss T . To extract the Higgs boson cross section, a binned fit is performed on the BDT score. Figure 15 shows the BDT score distributions after the fit to the data.

Different-flavor VH2j categories
This category targets VH events in which the vector boson decays into two resolved jets and the Higgs boson decays to an eµ pair and neutrinos. The final state, and therefore the selection, is analogous to that of the ggH DF 2-jet category, with the added requirement that the dijet invariant mass be close to that of the W and Z bosons.
The main backgrounds in this category are top quark and nonresonant WW pair production, as well as ττ pair production. The top quark and ττ backgrounds are normalized to the data in dedicated CRs. The full selection is summarized in Table 11. The VH production is found to contribute about 30% of the total signal in the VH2j DF SR.
The signal extraction fit is performed on a binned template shape of m , which has a different profile for the signal and the nonresonant WW background. The distribution of m after the fit to the data is shown in Fig. 16.

Same-flavor VH2j categories
This category targets VH events in which the vector boson decays into two jets and the Higgs boson decays to either an ee or a µµ pair and neutrinos. The selection is identical to the 2-jet ggH SF categories described in Section 5.2 and Table 4, with the following modifications: the additional requirement 65 < m jj < 105 GeV is imposed, the m threshold is moved to 70 GeV, a selection on m H T < 150 GeV is added, and the angle between the two leptons in the transverse plane (∆φ ) is required to be less than 1.6. The threshold on the DYMVA is tuned to achieve the highest signal-to-background ratio. The signal is extracted via a simultaneous fit to the number of events in each category.

The STXS measurement
Together with inclusive production cross sections, differential cross section measurements are also presented. These are performed within the STXS framework, using Stage 1.2 definitions [55]. In the STXS framework, the cross sections of different Higgs boson production mechanisms are measured in mutually exclusive regions of generator-level phase space, referred to as STXS bins, designed to enhance sensitivity to possible deviations from the SM. The full set of Stage 1.2 STXS bins is given in Fig. 17. The selections used in the STXS measurement match the ones described in the previous section, and the measurement is carried out by defining a set of analysis categories that target each STXS bin, as summarized in Fig. 18. The same CR setup as described in the previous section is maintained, and each CR is then subdivided to match the STXS categorization shown in Fig. 18. In all cases, the number of events is used as a fit variable in CRs. Results are then unfolded to the generator level, with the contribution from each STXS bin to each analysis category estimated from MC simulation, as shown in Fig. 19. Given the statistical power of the present data set, sensitivity to some of the Stage 1.2 bins is limited. Some bins are therefore measured together, by fixing the corresponding cross section ratios to the value predicted by the SM. We refer to this procedure as bin merging. Some STXS bins have been excluded, given the very low sensitivity. Groups of STXS bins merged with this procedure are highlighted in Fig. 17.
In the DF ggH and VBF categories, the discriminants of the same DNN explained in Section 6 are used for the categories which are common between VBF and ggH (m jj > 350 GeV and p H T < 200 GeV), and in the category exclusive to the VBF (m jj > 350 GeV and p T > 200 GeV). The signal extraction fit is performed on the two-dimensional (m , m jj ) template in the VH2j DF category (60 < m jj < 120 GeV), while either m or (m , m H T ) templates are used in the remaining DF categories, depending on the number of expected events in each. In the same flavor categories a similar approach is followed, but only the number of events is used for the fit.
In the VH categories with a leptonic decay of the V boson, to extract the cross section as a function of the vector boson p T , events are categorized into corresponding regions of reconstructed   vector boson p T . The reconstructed vector boson p T is defined differently depending on the vector boson type and decay channel. Because in the WHSS and WH3 categories the W boson p T ( p W T ) cannot be fully reconstructed due to the unobserved neutrino, proxies are defined in both cases. In the WHSS category, the four-momenta of the lepton and neutrino from the associated W boson decay can be designated W and ν W , while the four-momenta of the lepton and neutrino from the Higgs boson decay can be designated H and ν H . The lepton from the W boson decay is identified as the one with the largest azimuthal separation from the jet or dijet. The transverse momentum of the W boson is defined as W,T + ν W,T , where ν W is defined as: for events with two jets, or p miss T − H,T for events with fewer than two jets. Here jj indicates the dijet momentum. In the WH3 category, p W T is difficult to resolve given the ambiguities from the three neutrinos in the final state. Instead, p T ( W ) is used as a proxy for the W boson p T in the WH3 category. Here, W is defined as the lepton pointing away from the oppositesign dilepton pair with smallest angular separation ∆R. In the ZH3 and ZH4 categories, the reconstructed Z boson p T ( p Z T ) is defined as the p T of the OSSF dilepton pair with m closest to m Z . The variables used in the fit are the same as described in Section 7.
A summary of the expected signal fraction of the considered STXS signal processes in each category is shown in Fig. 20, together with the total number of expected H → WW signal events.

Nonprompt lepton background
The nonprompt lepton backgrounds originating from leptonic decays of heavy quarks, hadrons misidentified as leptons, and electrons from photon conversions are suppressed by the identification and isolation requirements imposed on electrons and muons. The nonprompt lepton background in the two-lepton final state primarily originates from W+jets events, while the nonprompt lepton background in the three-lepton final state primarily comes from Z+jets events. Top quark production with a jet misidentified as a lepton also contributes to the threelepton final state. The nonprompt lepton background gives a negligible contribution in the four-lepton final state. This background is estimated from data, as described in detail in Ref. [7]. The rate at which a nonprompt lepton passing a loose selection further passes a tight selection (misidentification rate) is measured in a data sample enriched in events composed uniquely of jets produced through the strong interaction, referred to as QCD multijet events. The corresponding rate for a prompt lepton to pass this selection (prompt rate) is measured using a tag-and-probe method [71] in a data sample enriched in DY events. The misidentification and prompt rates are used to construct a relation between the number of leptons passing the loose selection, the number of leptons passing the tight selection, and the number of true prompt leptons in an event. This relation is applied as a transfer function to a data sample containing leptons passing the loose selection, weighting the events by the probability for N leptons to pass the tight selection while fewer than N leptons are truly prompt. The nonprompt background with two leptons is validated with data in a CR enriched with W+jets events, in which a pair of same-sign leptons is required, while the nonprompt background with three leptons is validated in a CR enriched with top quark events or DY events. The systematic uncertainty   in the misidentification rate determination, which arises mainly from the different jet flavor composition between the events entering the QCD multijet and the analysis phase space, is estimated with a twofold approach. First, a validation check in the aforementioned CRs yields a normalization uncertainty of about 30% that fully covers any differences with respect to data in all the kinematic distributions of interest in this analysis. Second, a shape uncertainty is estimated by varying the jet p T threshold used in the calculation of the misidentification rate in the 15-25 GeV range, in bins of the lepton η and p T . For each threshold variation, the fake rate is recomputed and the difference with respect to the nominal fake rate is taken as a systematic uncertainty.

Top quark background
The background contributions from top quark processes are estimated using a combination of MC simulations and dedicated regions in data. A reweighting of the top quark and antiquark p T spectra at parton level is performed for the tt simulation in order to match the NNLO and next-to-next-to-leading logarithmic (NNLL) QCD predictions, including also the NLO EW contribution [72]. A shape uncertainty based on renormalization (µ R ) and factorization (µ F ) scale variations is taken into account. For the ggH, VBF, and VH2j categories, in which the contribution of top quark backgrounds is dominant, the normalization of the simulated templates is left unconstrained in the fit separately for 0-, 1-, 2-jet ggH, VH, and VBF categories. The normalizations in these phase spaces are therefore measured from the data, by constraining the free-floating normalization parameters in top quark enriched CRs.

Nonresonant WW background
The nonresonant WW background is estimated using a combination of MC simulations and dedicated regions in data, and the quark-induced WW simulated events are reweighted to match the diboson p T spectrum computed at NNLO+NNLL QCD accuracy [73,74]. The shape uncertainties related to the missing higher-order corrections are estimated by varying the µ R and µ F scales, as well as considering the independent variation of the resummation scale from its nominal value, taken as the mass of the W boson. For the ggH, VBF, and VH2j categories, the normalizations of the quark-induced and gluon-induced WW backgrounds are left unconstrained in the fit (the ratio between the two is kept fixed within the uncertainty), keeping a different parameter for each signal phase space as done for the top quark background. In the DF final states the normalization parameters are constrained directly in the SRs without the need of defining CRs, as the SRs span the high-m phase space enriched in WW events with a negligible Higgs boson signal contribution. Since in SF final states a counting analysis is performed, dedicated CRs enriched in WW events are defined selecting events with high m . The normalizations of the EW and QCD WW+2 jets backgrounds are instead fixed to the respective SM cross sections provided by the MC simulation, taking into account the theoretical uncertainties arising from the variation of the µ R and µ F scales.

Drell-Yan background
The backgrounds arising from DY+jets processes are estimated using a different approach depending on the signal category.
In the ggH, VBF, and VH2j DF categories, the only source of DY background arises from ττ production with subsequent leptonic decays of the τ leptons. This background process is estimated with a data-embedding technique [75], in which µ + µ − events with well-identified muons are selected in a data sample. In each event, the selected muons are removed and replaced with simulated τ leptons, keeping the same four-momentum of the initial muons. The embedded sample is then corrected using scale factors related to the simulation of τ leptons. The usage of the embedded sample allows for a better modeling of the observables that are sensitive to the detector response and calibration, such as p miss T and other variables related to the hadronic activity in the event. Since the embedded sample takes into account all processes with a ττ pair decaying to either electrons or muons, simulated tt, single top, and diboson background events that contain a ττ pair are not considered in the analysis to avoid any double counting. To correct for any additional discrepancy associated with the different acceptance of the H → WW signal phase space, the normalization of the embedded samples is left unconstrained in the fit as done for top quark and WW backgrounds. An orthogonal ττ enriched CR is defined for the 0-, 1-, 2-jet ggH-like, 2-jet VH-like, and 2-jet VBF-like phase spaces to help in constraining the free normalization parameters. The embedded samples cover the events that pass the eµ triggers, which represent the vast majority of the events selected in the DF final state. The contribution of the remaining ττ events that enter the analysis phase space thanks to the single-lepton triggers (≈5% of the total) is estimated using MC simulation.
In the ggH, VBF, and VH2j SF categories, the dominant background contribution arises from DY production of pairs and is estimated using a data-driven technique described in Ref. [7]. The background contribution for events with |m − m Z | > 7.5 GeV is estimated by counting the number of events in data passing a selection with an inverted m requirement (i.e., under the Z boson mass peak), subtracting the non-Z-boson contribution from it, and scaling the obtained yield by the fraction of events outside and inside the Z boson mass region in MC simulation. The contribution of processes such as top quark and WW production in the Z boson mass peak region, which have the same probability to decay into the ee, eµ, µe, and µµ final states, is estimated by counting the number of e ± µ ∓ events in data, and applying a correction factor that accounts for the differences in the detection efficiency between electrons and muons. Other minor processes in the Z boson mass peak region (mainly ZZ and ZW) are subtracted based on MC simulations. The yield obtained with this approach outside the Z boson mass peak is further corrected with a scale factor that takes into account the different acceptances between the estimation and SRs. The method is validated in orthogonal CRs enriched in DY events with a negligible signal contribution. The residual mismodeling between data and the estimated DY contribution arising from this validation is taken into account as a systematic uncertainty. The same procedure is repeated separately for estimating and validating the DY contribution in the e + e − and µ + µ − final states.
In the leptonic VH categories DY represents a minor background and is estimated using MC simulations.

Multiboson background
In categories with two charged leptons, the production of WZ and Wγ * contributes to the SRs whenever one of the three leptons is not identified. This background contribution is simulated as described in Section 3, and a data-to-simulation scale factor is derived in a three-lepton CR, orthogonal to the three-lepton SRs, as described in Ref. [7]. A normalization uncertainty of about 25% is associated to the scale factor determination. A different CR containing events with one pair of same-sign muons is also used as an additional validation of the Wγ * simulation. The contribution of the Wγ process may also be a background in two-lepton SRs due to photon conversions in the detector material when one of the three leptons is not identified. This process is estimated using MC simulation and validated using data in a two-lepton CR requesting events with a leading µ and a trailing e with same sign and a separation in ∆R smaller than 0.5. These requirements mainly select events arising from Wγ production where the W boson decays to µν µ and the photon is produced as final-state radiation from the muon. The theoretical uncertainties in Wγ and Wγ * processes estimated using µ R and µ F scale variations are taken into account.
The WZ process represents one of the main backgrounds in the leptonic VH categories and its normalization is left as a free parameter in the fit, separately for different jet multiplicity categories. Dedicated 0-, 1-and 2-jet CRs are included in the fit to help constraining the WZ normalization parameters.
The production of a Z boson pair is the main background in the ZH4 category and is estimated using MC simulation. The normalization of this background is left free to float and constrained using data in a ZZ-enriched CR.
Triple vector boson production is a minor background in all the considered categories and is estimated using MC simulation.

Statistical procedure and systematic uncertainties
The statistical approach used to interpret the selected data sets for this analysis and to combine the results from the independent categories has been developed by the ATLAS and CMS Collaborations in the context of the LHC Higgs Combination Group [76]. All selections have been optimized entirely on MC simulation and have been frozen before comparing the templates to data, in order to minimize possible biases. In all the categories considered, the signal extraction is performed using binned templates based on variables that allow for a good discrimination between signal and background, as summarized in Table 12. Therefore, the effect of each source of systematic uncertainty is either a change of the normalization of a given signal or background process, or a change of its template shape. The signal extraction is performed by a binned maximum likelihood fit, and each such change is modeled as a constrained nuisance parameter distributed according to a log-normal probability distribution function with standard deviation set to the size of the corresponding change. Where the change in shape of a template caused by a nuisance parameter is found to be negligible (i.e., its effect on the expected uncertainty on signal strength modifiers is well below 1%), only its effect on the normalization is considered. The systematic uncertainties in this analysis arise either from an experimental or a theoretical source. The experimental uncertainties in the signal and background processes, as well as the theoretical uncertainties in the background processes, are taken into account for all the results discussed in Section 11. The treatment of the theoretical uncertainties in the signal processes is instead dependent on the measurement and interpretation being made. As an example, when measuring production cross sections for the STXS measurements, the theoretical uncertainties affecting the signal cross section in a given STXS bin are dropped and only the shape component is kept.
The following experimental uncertainties are included in the signal extraction fit.
• The integrated luminosities for the 2016, 2017, and 2018 data-taking years have 1.2-2.5% individual uncertainties, while the overall uncertainty for the 2016-2018 period is 1.6% [35][36][37]. This uncertainty is partially correlated among the three data sets, and is applied to all samples that are purely based on simulation.
• The uncertainties in the trigger efficiency and lepton reconstruction and identification efficiencies are modeled in bins of the lepton p T and η, independently for electrons and muons. These uncertainties cause both a normalization and a shape change of the signal and background templates and are kept uncorrelated among the three data sets. Their effect is of ≈2% for electrons and ≈1% for muons. • The uncertainties in the determination of the lepton momentum scale, jet energy scale, and unclustered energy scale cause the migration of the simulated events inside or outside the analysis acceptance, as well as migrations across the bins of the signal and background templates. The impact of these sources in the template normalizations is 0.6-1.0% for the electron momentum scale, 0.2% for the muon momentum scale, and 1-10% for p miss T . The main contribution to these uncertainties arises from the limited data sample used for their estimation, and they are therefore treated as uncorrelated nuisance parameters among the three years. The jet energy scale uncertainty is modeled by implementing eleven independent nuisance parameters corresponding to different jet energy correction sources, six of which are correlated among the three data sets. Their effects vary in the range of 1-10%, according mainly to the jet multiplicity in the analysis phase space.
• The uncertainty in the jet energy resolution smearing applied to simulated samples to match the p T resolution measured in data causes both a normalization and a shape change of the templates. This uncertainty has a minor impact on all the analyzed categories (effect below ≈1%) and is uncorrelated among the three data sets. • The uncertainty in the pileup jet identification efficiency is modeled in bins of the jet p T and η. It is considered for jets with p T < 50 GeV, since pileup jet identification techniques are only used for low-p T jets. This uncertainty produces a change in both normalization and shape of the signal and background templates and is kept uncorrelated among the three data sets. The effect of this uncertainty on the measured quantities is found to be below 1%.
• The uncertainty in the b tagging efficiency is modeled by implementing seventeen nuisance parameters, five of which are related to the theoretical uncertainties involved in the measurements and are therefore correlated among the three data sets. The remaining four parameters per data set, which arise from the statistical accuracy of the efficiency measurement, are kept uncorrelated [31]. These uncertainties have an impact on both the shape of the templates and their normalization for all the simulated samples.
• The uncertainties in the nonprompt lepton background estimation affect both the normalization and shape of the templates of this process. They arise from the limited size of the data set used for the misidentification rate measurement and the difference in the flavor composition of jets mismeasured as leptons between the measurement region and the signal phase space. Both sources are implemented as uncorrelated nuisance parameters between electrons and muons, given the different mismeasurement probabilities for the two flavors, and are uncorrelated among the three data sets. Their effects vary between few percent to ≈10% depending on the SR. A further normalization uncertainty of 30% is assigned to cover any additional mismodeling of the jet flavor composition using data in control samples, as described in Section 9. The latter uncertainty is correlated among the data sets, but uncorrelated among SRs containing different lepton flavor combinations, for which the main mechanism of nonprompt lepton production arises from different processes.
• The statistical uncertainty due to the limited number of simulated events is associated with each bin of the simulated signal and background templates [77].
The theoretical uncertainties relevant to the simulated MC samples have different sources: the choice of the PDF set and the strong coupling constant α S , missing higher-order corrections in the perturbative expansion of the simulated matrix elements, and modeling of the pileup. Template variations, both in shape and normalization, associated with the aforementioned sources are treated as correlated nuisance parameters for the three data sets.
The uncertainties in the PDF set and α S choice are found to have a negligible effect on the simulated templates (the effect of the shape variation on the expected uncertainties was found to be below 1%), therefore only the normalization change is considered, taking into account the effect due to the cross section and acceptance variation. These uncertainties are not considered for backgrounds with normalization constrained through data in dedicated CRs. For the Higgs boson signal processes, these theoretical uncertainties are computed by the LHC Higgs Cross Section Working Group [55] for each production mechanism.
The effect of missing higher-order corrections for the background processes is estimated by reweighting the MC simulation events with alternative event weights, where the µ R and µ F scales are varied by a factor of 0.5 or 2, and the envelopes of the varied templates are taken as the one standard deviation variation. All the combinations of the µ R and µ F scale variations are considered for computing the envelope, except for the extreme case where µ R is varied by 0.5 and µ F by 2, or vice versa. For backgrounds with normalization constrained using data in dedicated CRs, only the shape variation of the simulated templates arising from this procedure is considered. For the WW background, an uncertainty in the higher-order reweighting described in Section 9 is derived by shifting µ R , µ F , and the resummation scale. For the ggH signal sample, the uncertainties are decomposed into several sources according to Ref. [55], to account for the overall cross section, migrations of events among jet multiplicity and p H T bins, choice of the resummation scale, and finite top quark mass effects. For the VBF signal sample, different sources of uncertainty are also decoupled to account for the overall normalization, migrations of events among Higgs boson p T , N jet , and m jj bins, and EW corrections to the production cross section. The uncertainties due to missing higher-order corrections for the other signal samples are taken from Ref. [55]. For both PDF and missing higher-order uncertainties, the nuisance parameters are correlated for the WH and ZH processes and uncorrelated for the other ones.
In order to assess the uncertainty in the pileup modeling, the total inelastic pp cross section of 69.2 mb [78,79] is varied within a 5% uncertainty, which includes the uncertainty in the inelas-tic cross section measurement, as well as the difference in the primary vertex reconstruction efficiency between data and simulation.
A theoretical uncertainty due to the modeling of the PS and UE is taken into account for all the simulated samples. The uncertainty in the PS modeling is evaluated by varying the PS weights computed by PYTHIA 8.212 on an event-by-event basis, keeping the variations of the weights related to initial-and final-state radiation contributions uncorrelated. The uncertainty in the UE modeling is evaluated by shifting the nominal templates according to alternative MC simulations generated with a variation of the UE tune within its uncertainty. The corresponding nuisance parameter is correlated among all samples and between 2017 and 2018 data sets. An uncorrelated nuisance parameter is used for the 2016 data set, as the corresponding simulations are based on a different UE tune. The PS uncertainty affects the shape of the templates mainly through the migration of events across jet multiplicity bins, while the UE uncertainty is found to have a negligible impact on the shape of the templates and a normalization effect of ≈1.5%.
Additional theoretical uncertainties in specific background processes are also taken into account. A 15% uncertainty is assigned to the relative fraction of the gluon-induced component in the WW background process [62]. An uncertainty of 8% is assigned to the relative fraction of single top quark and tt processes. A 30% uncertainty is assigned to the Wγ * process associated with the measurement of the scale factor in the trilepton CR.
For the measurement of the signal cross sections in the STXS framework, the effect of theoretical uncertainties in the template normalizations is removed for signal processes in each STXS bin being measured. In cases where two or more STXS bins are measured together because of the lack of statistical accuracy in measuring single bin cross sections, the shape effect of theoretical uncertainties causing event migrations among the merged bins is kept. In addition, residual theoretical uncertainties arising from µ R and µ F variations are taken into account to describe the acceptance effects that cause a shape variation of the signal templates within each STXS bin. The latter uncertainties are correlated among STXS bins that share a similar phase space definition, for example, ggH 0-jet bins, ggH 1-jet bins, ggH high-p T bins, and ggH in VBF topology bins. A similar approach is used for the VBF STXS bins. For the measurement of leptonic VH cross sections in STXS bins, the aforementioned theoretical uncertainties are found to have a marginal impact with respect to the measurement statistical accuracy and have been neglected.
The contributions of different sources of systematic uncertainty in the signal strength measurement are summarized in Table 13.

Results
Results are presented in terms of signal strength modifiers, STXS cross sections, and coupling modifiers. In all cases they are extracted via a simultaneous maximum likelihood fit to all the analysis categories, as explained in Section 10. The mass of the Higgs boson is assumed to be 125.38 GeV, as measured by the CMS Collaboration [56]. The effect on event yields of varying m H within its uncertainty is found to be below 1%. The number of expected and measured events for signal and background processes, as well as the number of observed events in each category, are reported in Tables 14-17. The normalization factors of the background contributions are found to be consistent with unity within their uncertainties. Figure 21 summarizes the full analysis template by showing the distribution of events as a function of the observed significance of the corresponding bins.       The H → WW selection is subject to some degree of contamination from events in which the Higgs boson decays to a pair of τ leptons that themselves decay leptonically. These events are included in the signal definition, and their contribution ranges from below 1% in the ggH and VBF categories up to ≈10% in some of the WH categories. As described in previous sections, CRs are used to fix the normalization of dominant backgrounds from data. This is achieved by scaling the corresponding background contributions jointly in the CR and SR. Given that the procedure effectively amounts to a measurement of the cross section of the background in question, the contributions from the 2017 and 2018 data sets are scaled together. The 2016 data set is kept separate in this regard because a different PYTHIA tune was used.
For inclusive measurements, results are extracted in the form of signal strength modifiers µ. These are defined as the product of the production cross section and the branching ratio to a W boson pair, normalized to the SM prediction (σB/(σB) SM ). Couplings of the Higgs boson to fermions and vector bosons are measured in the κ framework [80], while STXS results are provided as cross sections.

Signal strength modifiers
The global signal strength modifier is extracted by fitting the template to data leaving all contributions coming from the Higgs boson free to float, but keeping the relative importance of the different production modes fixed to the values predicted by the SM. As such, this measurement gives information on the compatibility of the SM with the LHC Run 2 data set. The observed signal strength modifier is: where the uncertainty has been broken down into its statistical and systematic components. The purely statistical component is extracted by fixing all nuisance parameters in the likelihood function to their best fit values and extracting the corresponding profile. The systematic component is obtained by the difference in quadrature between the total uncertainty and the statistical one. The observed and expected profile likelihood functions, both with the full set of uncertainty sources as well as with statistical ones only, are shown in Fig. 22.
Results are also extracted for individual production modes, by performing a 4-parameter fit  Figure 23: Observed signal strength modifiers for the main SM production modes.
in which contributions from the ggH, VBF, WH, and ZH modes are left free to float independently. Contributions from the ttH and bbH production modes are fixed to their SM expected values within uncertainties, given that this analysis has little sensitivity to them. Results are summarized in Fig. 23, where the separate contributions of statistical and systematic sources of uncertainty are also shown. Results correspond to observed (expected) significances of 10.5 (11.8)σ, 3.15 (4.74)σ, 3.61 (1.82)σ, and 3.73 (2.19)σ for the ggH, VBF, WH, and ZH modes, respectively. The correlation matrix among the signal strengths is given in Fig. 24. The compatibility of the result with the SM is found to be 7%.

Higgs boson couplings
Given its large branching fraction and relatively low background, the H → WW channel is a good candidate to measure the couplings of the Higgs boson to fermions and vector bosons. This is performed in the so-called κ framework. Two coupling modifiers κ V and κ f are defined, for couplings to vector bosons and fermions respectively. These scale the signal yield of the H → WW channel as follows: where κ H = κ H (κ V , κ f ) is the modifier to the total Higgs boson width, and X i are the different production modes. The corresponding coupling modifiers κ i equal κ f for the ggH, ttH, and bbH modes, and κ V for the VBF and VH modes. Possible contributions to the total width of the Higgs boson coming from outside of the SM are neglected. The best fit values for the coupling modifiers are found to be κ V = 0.99 ± 0.05 and κ f = 0.86 +0.14 −0.11 , where the better sensitivity to κ V is due to the H → WW decay vertex. The two-dimensional likelihood profile for the fit is shown in Fig. 25.

STXS
As explained in Section 8, the STXS measurement is carried out under the Stage 1.  sults are shown in Table 18 and in Fig. 26, for the signal strength modifiers and cross sections. The uncertainties are reported separately for statistical (stat), theoretical (theo), and experimental (exp) systematic sources. The correlation matrix for the measured STXS bins is shown in Fig. 27. Since final results are reported as cross sections, the effect of theoretical uncertainties in the normalization of signal templates is dropped, while uncertainties in the shape of the templates, such as STXS bin migration, are accounted for. In cases where cross sections are measured to be zero, an upper limit is reported instead of a symmetric confidence interval, so that all intervals reported correspond to a 68% confidence level. The compatibility of the STXS fit with the SM is found to be 1%.

Summary
A measurement of production cross sections for the Higgs boson has been performed targeting the gluon fusion, vector boson fusion, and Z or W associated production processes in the H → WW decay channel. Results are presented as signal strength modifiers, coupling modifiers, and differential cross sections in the simplified template cross section Stage 1.2 framework. The measurement has been performed on data from proton-proton collisions recorded by the CMS detector at a center-of-mass energy of 13 TeV in 2016-2018, corresponding to an integrated luminosity of 138 fb −1 . Specific event selections targeting different final states have been employed, and results have been extracted via a simultaneous maximum likelihood fit to all analysis categories. The overall signal strength for production of a Higgs boson is found to be µ = 0.95 +0.10 −0.09 . All results are in good agreement with the standard model expectation.        [8] CMS Collaboration, "Measurement of the transverse momentum spectrum of the Higgs boson produced in pp collisions at √ s = 8 TeV using H → WW decays", JHEP 03 (2017) 032, doi:10.1007/JHEP03(2017)032, arXiv:1606.01522.