Measurement of electroweak production of a W\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm{W} $$\end{document} boson in association with two jets in proton–proton collisions at s=13Te\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}=13\,\text {Te}\text {V} $$\end{document}

A measurement is presented of electroweak (EW) production of a W\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm{W} $$\end{document} boson in association with two jets in proton–proton collisions at s=13Te\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}=13\,\text {Te}\text {V} $$\end{document}. The data sample was recorded by the CMS Collaboration at the LHC and corresponds to an integrated luminosity of 35.9fb-1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text {fb}^{-1}$$\end{document}. The measurement is performed for the ℓν\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell \nu $$\end{document}jj final state (with ℓν\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell \nu $$\end{document} indicating a lepton–neutrino pair, and j representing the quarks produced in the hard interaction) in a kinematic region defined by invariant mass mjj>120Ge\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m_\mathrm {jj} >120\,\text {Ge}\text {V} $$\end{document} and transverse momenta pTj>25Ge\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p_\mathrm {T j} > 25\,\text {Ge}\text {V} $$\end{document}. The cross section of the process is measured in the electron and muon channels yielding σEW(Wjj)=6.23±0.12(stat)±0.61(syst)pb\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _\mathrm {EW}(\mathrm{W} \mathrm {jj})= 6.23 \pm 0.12 \,\text {(stat)} \pm 0.61 \,\text {(syst)} \,\text {pb} $$\end{document} per channel, in agreement with leading-order standard model predictions. The additional hadronic activity of events in a signal-enriched region is studied, and the measurements are compared with predictions. The final state is also used to perform a search for anomalous trilinear gauge couplings. Limits on anomalous trilinear gauge couplings associated with dimension-six operators are given in the framework of an effective field theory. The corresponding 95% confidence level intervals are -2.3<cWWW/Λ2<2.5Te-2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-2.3< c_{{\mathrm{W} \mathrm{W} \mathrm{W}}}/\varLambda ^2 < 2.5\,\text {Te}\text {V} ^{-2}$$\end{document}, -8.8<cW/Λ2<16Te-2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-8.8< c_{\mathrm{W}}/\varLambda ^2 < 16\,\text {Te}\text {V} ^{-2}$$\end{document}, and -45<cB/Λ2<46Te-2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-45< c_{\mathrm{B}}/\varLambda ^2 < 46\,\text {Te}\text {V} ^{-2}$$\end{document}. These results are combined with the CMS EW Zjj\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm{Zjj} $$\end{document} analysis, yielding the constraint on the cWWW\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$c_{{\mathrm{W} \mathrm{W} \mathrm{W}}}$$\end{document} coupling: -1.8<cWWW/Λ2<2.0Te-2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-1.8< c_{{\mathrm{W} \mathrm{W} \mathrm{W}}}/\varLambda ^2 < 2.0\,\text {Te}\text {V} ^{-2}$$\end{document}.


Introduction
In proton-proton (pp) collisions at the CERN LHC, the pure electroweak (EW) production of a lepton-neutrino pair ( ν) in association with two jets (jj) includes production via vector boson fusion (VBF). This process has a distinctive signature of two jets with large energy and separation in pseudorapidity (η), produced in association with a lepton-neutrino pair. This EW process is referred to as EW Wjj, and the two jets e-mail: cms-publication-committee-chair@cern.ch produced through the fragmentation of the outgoing quarks are referred to as "tagging jets". Figure 1 shows representative Feynman diagrams for the EW Wjj signal processes, namely VBF ( Fig. 1, left), bremsstrahlung-like ( Fig. 1, center), and multiperipheral ( Fig. 1, right) production. Gauge cancellations lead to a large negative interference between the VBF diagram and the other two diagrams, with the larger interference coming from bremsstrahlung-like production. Interference with multiperipheral production is limited to cases where the leptonneutrino pair mass is close to the W boson mass.
In addition to the purely EW signal diagrams described above, there are other, not purely EW processes, that lead to the same νjj final states and can interfere with the signal diagrams in Fig. 1. This interference effect between the signal production and the main Drell-Yan (DY) background processes (DY Wjj) is small compared to the interference effects among the EW production amplitudes, but needs to be included when measuring the signal contribution. Figure 2 (left) shows one example of W boson production in association with two jets that has the same initial and final states as those in Fig. 1. A process that does not interfere with the EW signal is shown in Fig. 2 (right).
The study of EW Wjj processes is part of a more general investigation of standard model (SM) VBF and scattering processes that includes the measurements of EW Zjj processes, Higgs boson production [1][2][3], and searches for physics beyond the SM [4]. The properties of EW Wjj events that are isolated from the backgrounds can be compared with SM predictions. Probing the additional hadronic activity in selected events can shed light on the modeling of the additional parton radiation [5,6], which is important for signal selection and the vetoing of background events.
Higher-dimensional operators outside the SM can generate anomalous trilinear gauge couplings (ATGCs) [7,8], so the measurement of the coupling strengths provides an indirect search for beyond-the-SM physics at mass scales not directly accessible at the LHC. Representative diagrams for W boson production in association with two jets (DY Wjj) that constitute the main background for the measurement At the LHC, the EW Wjj process was first measured by the CMS Collaboration using pp collisions at √ s = 8 TeV [9] and then by the ATLAS Collaboration at both √ s = 8 TeV and √ s = 7 TeV [10]. The closely related EW Zjj process was first measured during Run 1 by the CMS Collaboration using pp collisions at √ s = 7 TeV [11], and then at √ s = 8 TeV by both the CMS [12] and ATLAS [13] Collaborations. The EW Zjj measurements using data samples of pp collisions at √ s = 13 TeV have been performed by ATLAS [14] and by CMS [15]. Considering leptonic final states in the same kinematic region the EW Wjj cross section is about a factor 10 larger than the EW Zjj cross section. All results so far agree with the expectations of the SM within a precision of 10-20%.
This paper presents measurements of the EW Wjj process with the CMS detector using pp collisions collected at √ s =13 TeV during 2016, corresponding to an integrated luminosity of 35.9 fb −1 . A multivariate analysis (BDT), based on the methods developed for the EW Zjj measurement [11,12], is used to separate signal events from the large W+jets background. The analysis of the 13 TeV data offers the opportunity to measure the cross section at a higher energy than previously done and to reduce the uncertainties obtained with previous measurements, given both the larger integrated luminosity and the larger predicted total cross section. This paper is organized as follows: Sect. 2 describes the experimental apparatus and Sect. 3 the event simulations. Event selection procedures are described in Sect. 4, together with the selection efficiencies and background estimations using control regions (CRs). Section 5 describes an estimation of the multijet background from quantum chromodynamics (QCD), based on CRs in data. Section 6 discusses a correction applied to the simulation as a function of the invariant mass m jj . Section 7 presents distributions of the main discriminating variables in data. Section 8 details the strategy adopted to extract the signal from the data, and the corresponding systematic uncertainties are summarized in Sect. 9. The cross section and anomalous coupling results are presented in Sects. 10 and 11, respectively. Section 12 presents a study of the additional hadronic activity in an EW Wjj enriched region. Finally, a brief summary of the results is given in Sect. 13.

The CMS detector and physics objects
The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Forward calorimeters extend the η coverage provided by the barrel and endcap detectors to |η| = 5.2. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid.
The tracker measures charged particles within the range |η| < 2.5. It consists of 1440 pixel and 15,148 strip detector modules. For nonisolated particles with transverse momenta 1 < p T < 10 GeV and |η| < 1.4, the track resolutions are typically 1.5% in p T and 25-90  µm in the transverse (longitudinal) impact parameter [16].
The energy of electrons is measured after combining the information from the ECAL and the tracker, whereas their direction is measured by the tracker. The momentum res-olution for electrons with p T ≈ 45 GeV from Z → ee decays ranges from 1.7 to 4.5%. It is generally better in the barrel region than in the endcaps, and also depends on the bremsstrahlung energy emitted by the electron as it traverses the material in front of the ECAL [17].
Muons are measured in the range |η| < 2.4, with detection planes made using three technologies: drift tubes, cathode strip chambers, and resistive-plate chambers. Matching muons to tracks measured in the silicon tracker results in a relative transverse momentum resolution for muons with 20 < p T < 100 GeV of 1.3-2.0% in the barrel and better than 6% in the endcaps. The p T resolution in the barrel is better than 10% for muons with p T up to 1 TeV [18].
Events of interest are selected using a two-tiered trigger system [19]. The first level (L1), composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100 kHz within a time interval of less than 4 µs. The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full-event reconstruction software optimized for fast processing, and reduces the event rate to around 1 kHz before data storage.
A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [20].

Simulation of signal and background events
Signal events are simulated at leading order (LO) using the MadGraph5_amc@nlo (v2.3.3) Monte Carlo (MC) generator [21], interfaced with pythia (v8.212) [22] for parton showering (PS) and hadronization. The NNPDF30 [23] parton distribution functions (PDFs) are used to generate the events. The underlying event is modeled using the CUETP8M1 tune [24]. The simulation does not include extra partons at matrix element (ME) level. The signal is defined in the kinematic region with parton transverse momentum p Tj > 25 GeV, and diparton invariant mass m jj > 120 GeV. The simulated cross section for the νjj final state (with = e, μ or τ ), applying the above requirements, is σ LO (EW νjj) = 6.81 +0.03 −0.06 (scale) ± 0.26 (PDFs) pb, where the first uncertainty is obtained by changing simultaneously the factorization (μ F ) and renormalization (μ R ) scales by factors of 2 and 1/2, and the second one reflects the uncertainties in the NNPDF30 PDFs. The LO signal cross section and relevant kinematic distributions estimated with MadGraph5_amc@nlo are in agreement within 2-5% with the next-to-leading-order (NLO) predictions of the vbfnlo generator (v2.6.3) [25][26][27], which include QCD NLO corrections to the LO ME-level diagrams evaluated with MadGraph5_amc@nlo. For additional comparisons, signal events produced with MadGraph5_amc@nlo are also processed with the herwig++ (v2.7.1) [28] PS, using the EE5C [29] tune.
An additional signal sample that includes NLO QCD corrections but does not include the s-channel contributions to the final state has been generated with powheg (v2.0) [30][31][32], based on the vbfnlo ME calculations [33,34]. In the powheg sample the m jj > 120 GeV condition is applied on the two p T -leading parton-level jets, after clustering the ME final state partons with the k T -algorithm [35][36][37], with a distance parameter D = 0.8, as done in Ref. [33]. The powheg sample has also been processed alternatively with pythia and herwig++ parton showering (PS) and hadronization programs, as done for the MadGraph5_amc@nlo samples. In the following, results obtained with the powheg signal samples are given as a cross check of the main results obtained with the MadGraph5_amc@nlo signal samples.
Events coming from processes including ATGCs are generated with the same settings as the SM sample, but include additional information for reweighting in the threedimensional effective field theory (EFT) parameter space, which is described in more detail in Sect. 11. The 'EWdim6NLO' model [8,21] is used for the generation of anomalous couplings.
Background W boson events are also simulated with MadGraph5_amc@nlo using (1) an NLO ME calculation with up to three final-state partons generated from QCD interactions, and (2) an LO ME calculation with up to four partons from QCD interactions. The ME-PS matching is performed following the FxFx prescription [38] for the NLO case, and the MLM prescription [39,40] for the LO case. The NLO background simulation is used to extract the final results, while the independent LO samples are used to perform the multivariate discriminant training. The inclusive W boson production is normalized to σ th (W) = 61.5 nb, as computed at next-to-next-to-leading order (NNLO) with fewz (v3.1) [41].
The evaluation of the interference between EW Wjj and DY Wjj processes relies on the predictions obtained with MadGraph5_amc@nlo. A dedicated sample of events arising from the interference terms is generated directly by selecting the contributions of order α s α 3 EW , and passed through the full detector simulation to estimate the expected interference contribution.
The contribution from QCD multijet processes is derived via an extrapolation from a QCD data CR with the lepton relative isolation selection inverted. All background simulations make use of the pythia PS model with the CUETP8M1 tune.
A detector simulation based on Geant4 (v9.4p03) [48,49] is applied to all the generated signal and background samples. The presence of multiple pp interactions is incorporated by simulating additional interactions (pileup), both in-time and out-of-time with respect to the hard interaction, with a multiplicity that matches the distribution observed in data. The average pileup is measured to be about 23 additional interactions per bunch crossing.

Reconstruction and selection of events
Events containing exactly one isolated, highp T lepton and at least two highp T jets are selected. Isolated single-lepton triggers are used to acquire the data, where the lepton is required to have p T > 27 GeV for the electron trigger and p T > 24 GeV for the muon trigger.
The offline analysis uses candidates reconstructed by the particle-flow (PF) algorithm [50]. In the PF event reconstruction, all stable particles in the event -i.e., electrons, muons, photons, charged and neutral hadrons -are reconstructed as PF candidates using information from all subdetectors to obtain an optimal determination of their direction, energy, and type. The PF candidates are used to reconstruct the jets and the missing transverse momentum.
The reconstructed primary vertex (PV) with the largest value of summed physics-object p 2 T is the primary pp interaction vertex. The physics objects are the objects returned by a jet finding algorithm [51,52] applied to all charged particle tracks associated with the vertex, along with the corresponding associated missing transverse momentum. Charged tracks identified as hadrons from pileup vertices are omitted in the subsequent PF event reconstruction [50].
Offline electrons are reconstructed from clusters of energy deposits in the ECAL that match tracks extrapolated from the silicon tracker [17]. Offline muons are reconstructed by fitting trajectories based on hits in the silicon tracker and in the muon system [53]. Reconstructed electron or muon candidates are required to have p T > 20 GeV. Electron candidates are required to be reconstructed within |η| ≤ 2.4, excluding the barrel-to-endcap transitional region 1.444 < |η| < 1.566 of the ECAL [20]. Muon candidates are required to be reconstructed in the fiducial region |η| ≤ 2.4. The track associated with a lepton candidate is required to have both its trans-verse and longitudinal impact parameters compatible with the position of the PV of the event.
The leptons are required to be isolated; the isolation (I ) variable is calculated from PF candidates and is corrected for pileup on an event-by-event basis [54]. The scalar p T sum of all PF candidates reconstructed in an isolation cone with radius ΔR = √ (Δη) 2 + (Δφ) 2 = 0.4 around the lepton's momentum vector, excluding the lepton itself, is required to be less than 6% of the electron or muon p T value. For additional offline analysis, the isolated lepton is required to have p T > 25 GeV for the muon channel and p T > 30 GeV for the electron channel. Events with more than one lepton satisfying the above requirements are rejected. The lepton flavor samples are exclusive and precedence is given to the selection of muons.
The missing transverse momentum vector, p miss T , is calculated offline as the negative of the vector sum of transverse momenta of all PF objects identified in the event [55], and the magnitude of this vector is denoted p miss T . Events are required to have p miss T in excess of 20 GeV in the muon channel and 40 GeV in the electron channel. The tighter requirement for the electron channel reduces the corresponding higher background of QCD multijet events. The transverse mass (m T ) of the lepton and p miss T four-vector sum is then required to exceed 40 GeV in both channels.
Jets are reconstructed by clustering PF candidates with the anti-k T algorithm [51,56] using a distance parameter of 0.4. The jet momentum is the vector sum of all particle momenta in the jet and is typically within 5-10% of the true momentum over the whole p T spectrum and detector acceptance.
An offset correction is applied to jet energies because of the contribution from pileup. Jet energy corrections are derived from simulation, and are confirmed with in situ measurements of the energy balance in dijet, multijet, pho-ton+jet, and Z+jets events with leptonic Z boson decays [57]. Loose jet identification criteria are applied to reject misreconstructed jets resulting from detector noise [58]. Loose criteria are also applied to remove jets heavily contaminated with pileup energy (clustering of energy deposits not associated with a parton from the primary pp interaction) [58,59]. The efficiency of the jet identification is greater than 99%, with a rejection of 90% of background pileup jets with p T 50 GeV and |η| ≤ 2.5. For jets with |η| > 2.5 and 30 < p T < 50 GeV, the efficiency is approximately 90% and the pileup jet rejection is approximately 50%. The jet energy resolution (JER) is typically ≈15% at 10 GeV, 8% at 100 GeV, and 4% at 1 TeV for jets with |η| ≤ 1 [57]. Jets reconstructed with p T ≥ 15 GeV and |η| ≤ 4.7 are used in the analysis.
The two highest p T jets are defined as the tagging jets, and are required to have p T > 50 GeV and p T > 30 GeV for the leading and subleading (in p T ) jet, respectively. The Table 1 Event yields expected for background and signal processes using the initial selections and with a selection on the multivariate analysis output (BDT) that provides similar signal and background yields. The yields are compared to the data observed in the different channels and categories. The total uncertainties quoted for signal, DY Wjj and diboson backgrounds, and processes with top quarks (tt and single top quarks) include the systematic uncertainties invariant mass of the two tagging jets is required to satisfy m jj > 200 GeV. The transverse momentum of the W boson ( p TW ) is evaluated as the vector sum of the lepton p T and p miss T . The event p T balance (R( p T )) is then defined as where p Tj 1 and p Tj 2 are the transverse momenta of the two tagging jets. Finally, events are required to have R( p T ) < 0.2. This has a negligible effect on the analysis sensitivity and allows the definition of a nonoverlapping control sample with R( p T ) > 0.2 that is used to derive a correction to the invariant mass based on a CR in data, as described in Sect. 6.
A multivariate analysis technique, described in Sect. 8, is used to provide an optimal separation of the DY Wjj and EW Wjj components of the inclusive νjj spectrum. The main discriminating variables are the dijet invariant mass m jj and pseudorapidity separation Δη jj . Angular variables useful for signal discrimination include the y * Zeppenfeld variable [6], defined as the difference between the rapidity of the W boson y W and the average rapidity of the two tagging jets, i.e., and the z * Zeppenfeld variable [6] defined as where Δy jj is the dijet rapidity separation. Table 1 reports the expected and observed event yields after the initial selection and after imposing a minimum value for the final multivariate discriminant output applied to define the signal-enriched region used for the studies of additional hadronic activity described in Sect. 12.

Discriminating quarks from gluons
Jets in signal events are expected to originate from quarks, whereas for background events it is more probable that jets are initiated by a gluon. A quark-gluon likelihood (QGL) discriminant [11] is evaluated for the two tagging jets with the intent of distinguishing the nature of each jet.
The QGL discriminant exploits differences in the showering and fragmentation of quarks and gluons, making use of the following internal jet composition observables: (1) the particle multiplicity of the jet, (2) the minor root-meansquare of distance between the jet constituents in the η-φ plane, and (3) the p T distribution function of the jet constituents, as defined in Ref. [60].
The variables are used as inputs to a likelihood discriminant on gluon and quark jets constructed from simulated dijet events. The performance of the QGL discriminant is evaluated and validated using independent, exclusive samples of Z+jet and dijet data [60]. Corrections to the simulated QGL distributions and related systematic uncertainties are derived from a comparison of simulation and data distributions.

The QCD multijet background
The QCD multijet contribution is estimated by defining a multijet-enriched CR with inverted lepton isolation criteria for both the muon and electron channels. In the nominal selection both lepton types are required to pass the relative isolation requirement I < 0.06, whereas the multijetenriched CRs are defined with the same event selection but with isolation requirements 0.06 < I < 0.12 and 0.06 < I < 0.15, for the muon and electron channel respectively. It is then assumed that the p miss T distribution of QCD events has the same shape in both the nominal and the multijet-enriched CR.
The various components, with floating W+jets and QCD multijet background scale factors, are simultaneously fitted to the p miss T data distributions, independently in the muon and electron channels, and the expected QCD multijet yields in the nominal regions are derived.
The contribution of QCD multijet processes in any other observable (x) used in the analysis is then normalized to the yields obtained above from the fit to the p miss T distribution, and the shape for the distribution x is taken as the difference between data and all simulated background contributions in the x distribution in the multijet-enriched CR.
The estimation of the QCD multijet contribution based on a CR in data is validated by checking the modeling of other variables that discriminate QCD multijets from W+jets such as the W transverse mass and the minimum difference in φ between the missing transerse energy and the jets. Good agreement with the data is observed in all distributions. The stability of the W+jets fitted normalization is checked by varying the selection requirements for the fitted region and repeating the QCD extraction fit. The observed variations in fitted normalization when varying the m T (W) and p miss T selection requirements with respect to the fit region definition are much smaller than systematic uncertainties.
Although b tagging is not used in this analysis, a b-tagging discriminant output [61] is used to check the fitted W+jets background normalization as well as the tt normalization from simulation, and they agree with data within the uncertainties. Finally, the selections on m jj , p miss T , and m T (W) are also loosened in order to verify that the W+jets background scale factor is not biased by these requirements.

The m jj correction
A systematic overestimation of the simulation yields is caused by a partial mistiming of the signals in the forward region of the ECAL endcaps (2.5 < |η| < 3.0). This effect, which increases with increasing m jj , is observed in both electron and muon channels. A correction for this effect is derived in the nonoverlapping signal-depleted CR obtained by requiring that the event transverse momentum balance R( p T ), defined in Sect. 4, exceeds 0.2. A third-order polynomial correction is first applied to the W+jets simulation separately in the muon and electron channels in order to match the R( p T ) distribution in data. The magnitude of the applied R( p T ) corrections is about 10%. The uncertainty in this correction due to the limited statistical precision of the simulation as well as data is propagated to the fitted W+jets templates.
A correction to the m jj prediction from simulation is derived in the signal-depleted R( p T ) > 0.2 CR via a thirdorder polynomial fit to the ratio of data to the overall prediction from simulation for signal and background as a function of ln(m jj / GeV). The electron and muon channels are combined when deriving the m jj correction. The uncertainty in the correction includes the data statistical component as well as the systematic uncertainty due to the limited statistical precision of the simulation. Figure 3 shows the fitted correction including the uncertainty. This correction is applied to all simulated results, including the signal, and the corresponding uncertainty is propagated to the signal extraction fits.   Distribution of the missing transverse momentum (upper) and the leptonp miss T system transverse mass (lower) after the event preselection for the selected leading lepton in the event, in the muon (left) and electron (right) channels. In all plots the last bin contains overflow events the data agree within total uncertainties for all discriminating variables.

Signal discriminants and extraction procedure
The EW Wjj signal is characterized by a large pseudorapidity separation between the tagging jets, due to the smallangle scattering of the two initial partons. Because of both the topological configuration and the large energy of the outgoing partons, m jj is also expected to be large, and can be used to distinguish the EW Wjj and DY Wjj processes. The correlation between Δη jj and m jj is expected to be different in signal and background events, therefore these characteristics are expected to yield a high separation power between EW Wjj and DY Wjj production. In addition, in signal events it is expected that the W boson candidate is produced centrally in the rapidity region defined by the two tagging jets. As a consequence, signal events are expected to yield lower values of z * compared to the DY background. Other variables that are used to enhance the signal-to-background separation are related to the kinematics of the event or to the properties of the jets that are expected to be initiated by quarks.
The variables that are used in the multivariate analysis are: (1) m jj , (2) Δη jj , (3) z * , and (4) the QGL values of the two tagging jets. The output is built by training a boosted decision tree (BDT) discriminator with the tmva package [62] to achieve an optimal separation between the EW Wjj and DY Wjj processes. The simulated events that are used for the BDT training are not used for the signal extraction.
To improve the sensitivity for the extraction of the signal component, the transformation that originally projects the BDT output value in the [−1,+1] interval is changed to BDT = tanh −1 ((BDT +1)/2). This allows the purest signal region of the BDT output to be better sampled while keeping an equal-width binning of the BDT variable.   . 6 Distributions of the "Zeppenfeld" variables y (W) (upper) and z (W) (lower) after event preselection in the muon (left) and electron (right) channels. In all plots the first and last bins contain overflow events Figure 8 shows the distributions of the discriminants for the two leptonic channels. Good overall agreement between simulation and data is observed in all distributions, and the signal presence is visible at high BDT' values.
A binned maximum likelihood is built from the expected rates for each process, as a function of the value of the discriminant, which is fit to extract the strength modifiers for the EW Wjj and DY Wjj processes, μ = σ (EW Wjj)/σ LO (EW νjj) and υ = σ (W)/σ NNLO (W). Nuisance parameters are added to modify the expected rates and shapes according to the estimate of the systematic uncertainties affecting the measurement.
The interference between the EW Wjj and DY Wjj processes is included in the fit procedure, and its strength scales as √ μυ. The interference model is derived from the Mad-Graph5_amc@nlo simulation described in Sect. 3.
The parameters of the model (μ and υ) are determined by maximizing the likelihood. The statistical methodology follows the one used in other analyses [63] using asymptotic formulas [64]. In this procedure the systematic uncertain-

Systematic uncertainties
The main systematic uncertainties affecting the measurement are classified into experimental and theoretical according to their sources. Some uncertainties affect only normalizations, whereas others affect both the normalization and shape of the BDT output distribution.

Experimental uncertainties
The following experimental uncertainties are considered.
Integrated luminosity. A 2.5% uncertainty is assigned to the value of the integrated luminosity [65]. Trigger and selection efficiencies. Uncertainties in the efficiency corrections based on control samples in data for the leptonic trigger and offline selections are included and amount to a total of 2-3% depending on the lepton p T and η, for both the e and μ channels. These uncertainties are estimated by comparing the lepton efficiencies expected in simulation and measured in data with a "tagand-probe" method [66].
Jet energy scale and resolution. The uncertainty in the energy of the jets affects the event selection and the computation of the kinematic variables used to calculate the discriminants. Therefore, the uncertainty in the jet energy scale (JES) affects both the expected event yields and the final shapes. The effect of the JES uncertainty is studied by rescaling up and down the reconstructed jet energy by p T -and η-dependent scale factors [57]. An analogous approach is used for the JER. QGL discriminator. The uncertainty in the performance of the QGL discriminator is measured using independent Z+jet and dijet data, after comparing with the corresponding simulation predictions [60]. Shape variations corresponding to the full differences between the data and the simulation are used as estimates of the uncertainty. Pileup. Pileup can affect the identification and isolation of the leptons or the corrected energy of the jets. When the jet clustering algorithm is run, pileup can distort the reconstructed dijet system because of the contamination of tracks and calorimetric deposits. This uncertainty is evaluated by generating alternative distributions of the number of pileup interactions, corresponding to a 4.6% uncertainty in the total inelastic pp cross section at √ s = 13 TeV [67]. Limited number of simulated events. For each signal and background simulation, shape variations for the distributions are considered by shifting the content of each bin up or down by its statistical uncertainty [68]. This generates alternatives to the nominal shape that are used to describe the uncertainty from the limited number of simulated events. m jj correction. As described in Sect. 6, the m jj prediction from simulation is corrected to match the distribution in data in a signal-depleted R( p T ) > 0.2 control region. The uncertainty in this correction is derived by varying the fitted points within the statistical uncertainty from data and simulation combined and refitting the correction. QCD multijet background template. As described in Sect. 5, the QCD multijet prediction is extrapolated from the data in a nonoverlapping CR. The uncertainty in the QCD multijet background template shape is derived by taking the envelope of the shape obtained when varying the lepton isolation requirement used to define the multijet-enriched CR. A 50% uncertainty in the QCD multijet background normalization is also included.

Theoretical uncertainties
The following theoretical uncertainties are considered in the analysis.
PDF. The PDF uncertainties are evaluated by comparing the nominal distributions to those obtained when using the alternative PDFs of the NNPDF set, including α s variations. Factorization and renormalization scales. To account for theoretical uncertainties, signal and background shape variations are built by changing the values of μ F and μ R from their defaults by factors of 2 or 1/2 in the ME calculation, simultaneously for μ F and μ R , but independently for each simulated sample. Signal acceptance. A 5% uncertainty on the signal yield is assigned to account for differences between the prediction for the LO signal with respect to the NLO predictions of the vbfnlo generator (v2.6.3). Normalization of top quark and diboson backgrounds. Diboson and top quark production processes are modeled with MC simulations. An uncertainty in the normalization of these backgrounds is assigned based on the PDF and μ F , μ R uncertainties, following calculations in Refs. [42,43,47]. Interference between EW Wjj and DY Wjj. An overall normalization and a shape uncertainty are assigned to the interference term in the fit, based on an envelope of predictions with different μ F , μ R scales. Parton showering model. The uncertainty in the PS model and the event tune is assessed as the full difference of the acceptance and shape predictions using pythia and herwig++. R( p T ) correction. As described in Sect. 6, the R( p T ) prediction from W+jets simulation is corrected to match the distribution in data with all expected contributions other than W+jets subtracted. The uncertainty in this correction is derived by varying the fitted points within the statistical uncertainty from data and simulation combined and refitting the correction.

Measurement of the EW Wjj production cross section
The signal strength, defined with the νjj final state in the kinematic region described in Sect. 3, is extracted from the fit to the BDT output distribution as discussed in Sect. 8. Figure 9 shows the BDT distribution in the muon and electron channels for data and simulation after the fit, where the grey uncertainty band includes all systematic uncertainties. Good agreement is observed between the data and simulation within the uncertainties.
In the muon channel, the signal strength is measured to be In the electron channel, the signal strength is measured to be μ = 0.92 ± 0.03 (stat) ± 0.13 (syst) = 0.92 ± 0.13 (total), corresponding to a measured signal cross section σ (EW νjj) = 6.27 ± 0.19 (stat) ± 0.80 (syst) pb = 6.27 ± 0.82 (total) pb. The results obtained for the different lepton channels are compatible with each other, and in agreement with the SM predictions.
From the combined fit of the two channels, the signal strength is measured to be μ = 0.91 ± 0.02 (stat) ± 0.10 (syst) = 0.91 ± 0.10 (total), corresponding to a measured signal cross section σ (EW νjj) = 6.23 ± 0.12 (stat) ± 0.61 (syst) pb = 6.23 ± 0.62 (total) pb, in agreement with the MadGraph5_amc@nlo LO prediction σ LO (EW νjj) = 6.81 +0.03 −0.06 (scale) ± 0.26 (PDF) pb. In the combined fit, the DY strength is ν = 0.88 ± 0.07. Using the statistical methodology described in Sect. 8, the background-only hypotheses in the electron, muon, and combined channels are all excluded with significance above five standard deviations. Table 2 lists the major sources of uncertainty and their impact on the measured precision of μ. The largest sources of experimental uncertainty are the m jj correction, the JES, and the limited number of simulated events, while the largest sources of theoretical uncertainty are the μ F , μ R scale uncertainties and the uncertainty in the signal acceptance, derived by comparing the LO signal prediction with the prediction from the vbfnlo generator.
The signal strength is also measured with respect to the NLO signal prediction, as described in Sect. 3. In the muon channel, the signal strength is measured to be μ NLO = 0.91 ± 0.02 (stat) ± 0.12 (syst) = 0.91 ± 0.12 (total).

Limits on anomalous gauge couplings
It is useful to look for signs of new physics via a modelindependent EFT framework. In the framework of EFT, new physics can be described as an infinite series of new interaction terms organized as an expansion in the mass dimension of the operators.
In the EW sector of the SM, the first higher-dimensional operators containing bosons are six-dimensional [8]: where, as is customary, group indices are suppressed and the mass scale Λ is factorized from the coupling constants c. In Eq. (4), W μν is the SU(2) field strength, B μν is the U(1) field strength, Φ is the Higgs doublet, and operators with a tilde are the magnetic duals of the field strengths. The first three operators are charge and parity conserving, whereas the last two are not. Models with operators that preserve charge conjugation and parity symmetries can be included in the calculation either individually or in pairs. With these assumptions, the values of coupling constants divided by the mass scale c/Λ 2 are measured. show the ratio between data and prediction minus one with the statistical uncertainty from simulation (grey hatched band) as well as the leading systematic uncertainties in the shape of the p T distribution These operators have a rich phenomenology since they contribute to many multiboson scattering processes at tree level. The operator O W W W modifies vertices with three or six vector bosons, whereas the operators O W and O B modify both the HVV vertices and vertices with three or four vector bosons. A more detailed description of the phenomenology of these operators can be found in Ref. [69]. Modifications to the ZWW and γ WW vertices are investigated in this analysis, since these modify the pp → Wjj cross section.

Statistical analysis
The measurement of the coupling constants uses templates in the p T of the lepton from the W → ν decay. Because this is well measured and longitudinally Lorentz invariant, this variable is robust against mismodeling and ideal for this purpose. An additional requirement of BDT > 0.5 has been applied, which is optimized based on the expected sensitivity to the ATGC signal. The expected limits are subsequently improved by 20-25% with respect to the expected limits without a BDT selection. In each channel, four bins from 0 < p T < 1.2 TeV are used, where the last bin contains overflow and its lower bin edge boundary has been optimized separately for each channel.
For each signal MC event, 125 weights are assigned that correspond to a 5×5×5 grid in To construct the p T templates, the associated weights calculated for each event are used to construct a parametrized model of the expected yield in each bin as a function of the values of the dimension-six operators' coupling constants. For each bin, the ratios of the expected signal yield with dimension-six operators to the one without (leaving only the SM contribution) are fitted at each point of the grid to a quadratic polynomial. The highest p T bin has the largest statistical power to detect the presence of higher-dimensional operators. Figure 10 shows examples of the final templates, with the expected signal overlaid on the background expectation, for three different hypotheses of dimension-six operators. The SM distribution is normalized to the expected cross section.
A simultaneous binned fit for the values of the ATGCs is performed in the two lepton channels. A profile likelihood method, the Wald Gaussian approximation, and Wilks' theorem [78] are used to derive confidence intervals at 95% confidence level (CL). One-dimensional and two-dimensional limits are derived on each of the three ATGC parameters and each combination of two ATGC parameters while all other parameters are set to their SM values. Systematic and theoret-  ical uncertainties are represented by the individual nuisance parameters with log-normal distributions and are profiled in the fit.

Results
No significant deviation from the SM expectation is observed. Limits on the EFT parameters are reported and also translated into the equivalent parameters defined in an effective Lagrangian (LEP parametrization) in Ref.
Results for the one-dimensional limits are listed in Table 3 for c W W W , c W and c B , and in Table 4 for λ, Δg Z 1 and Δκ Z 1 ; two-dimensions limits are shown in Figs. 11 and 12. The results are dominated by the sensitivity in the muon channel due to the larger acceptance for muons. An ATGC signal is not included in the interference between EW and DY production. The effect on the limits is small (<3%). The LHC semileptonic WZ analysis using 13 TeV data currently sets the most stringent limits on c W W W /Λ 2 and c W /Λ 2 , while the WW analysis using 8 TeV data currently sets the tightest limits on c B /Λ 2 . This analysis is most sensitive to c W W W /Λ 2 , where the limit is slightly less restrictive but comparable.

Combination with the VBF Z boson production analysis
As mentioned in Sect.   This result included constraints on ATGC EFT parameters obtained via a fit to the p T (Z) distribution, an experimentally clean observable sensitive to deviations from zero in the ATGC parameters. Both the EW Zjj and EW Wjj analyses are sensitive to anomalous couplings related to the WWZ vertex. A simultaneous binned likelihood fit for the ATGC parameters is performed to the p T (Z) distribution in the EW Zjj production and and p T in the EW Wjj production. In the combined fit, the primary uncertainty sources are correlated including the JES and JER uncertainties. Results for the onedimensional limits are listed in Table 5 for c W W W , c W and c B , and in Table 6 for λ, Δg Z 1 , and Δκ Z 1 ; two-dimensions limits are shown in Figs. 13 and 14.

Study of the hadronic and jet activity in W+jet events
Having established the presence of the SM signal, the properties of the hadronic activity in the selected events can be examined, in particular in the the region in rapidity between the two tagging jets, with low expected hadron activity (rapidity gap). The production of additional jets in the rapidity gap, in a region with a larger contribution of EW Wjj processes is explored in Sect. 12. 1

CMS
(13 TeV) The comparison reveals a deficit in the simulation predictions with pythia parton showering for the rate of events with lower additional jet activity, whereas the tail of higher additional activity is generally in better agreement.
A suppression of additional jets is observed in data compared with the background-only simulation shapes. In

Study of charged hadron activity
For this study, a collection is formed of high-purity tracks [80] with p T > 0.3 GeV, uniquely associated with the main PV in the event. Tracks associated with the lepton or with the tagging jets are excluded from the selection. The association between the selected tracks and the reconstructed PVs is carried out by minimizing the longitudinal impact parameter, which is defined as the z-distance between the PV and the point of closest approach of the track helix to the PV, labeled d PV z . The association is required to satisfy the conditions d PV A collection of "soft-track" jets is defined by clustering the selected tracks using the anti-k T clustering algorithm [51] with a distance parameter of R = 0.4. The use of track jets represents a clean and well-understood method [81] to reconstruct jets with energy as low as a few GeV. These jets are not affected by pileup because of the association of the constituent tracks with the hard scattering vertex [82].
Track jets of low p T and within η tag jet min < η < η tag jet max are considered for the study of the hadronic activity between the tagging jets, and referred to as "soft activity" (SA). For each event, the scalar p T sum of the soft-track jets with p T > 1 GeV is computed, and referred to as the "soft H T " variable. Figures 17 and 18 show the distribution of the leading soft-track jet p T and soft H T in the signal-enriched region (BDT > 0.95), for the electron and muon channels, compared to predictions from pythia and herwig++ PS models. The plots show some disagreement between the data and the predictions, in particular in the regions of small additional activity, when compared with the pythia predictions.

Study of hadronic activity vetoes
The efficiency of a hadronic activity veto corresponds to the fraction of events with a measured gap activity below a given threshold. This efficiency is studied as a function of the applied threshold for various gap activity observables. The veto thresholds studied here start at 15 GeV for gap activities measured with standard PF jets, while they go down to 1 GeV for gap activities measured with soft-track jets. Figure 19 shows the gap activity veto efficiency of combined muon and electron events in the signal-enriched region when placing an upper threshold on the p T of the additional third jet, on the H T of all additional jets, on the leading soft-activity jet p T , or on the soft-activity H T . The observed efficiency in data is compared to expected efficiencies for background-only events, and efficiencies for background plus signal events where the signal is modeled with pythia or herwig++. Data points clearly disfavor the backgroundonly predictions and are in reasonable agreement with the presence of the signal with the herwig++ PS predictions for gap activities above 20 GeV, while the signal with pythia PS seems to generally overestimate the gap activity. In the events with very low gap activity, in particular below 10 GeV as measured with the soft track jets, the data indicates gap activities also below the herwig++ PS predictions. In addition, the expected efficiencies are included for background plus signal events where the signal is modeled with powheg (Sect. 3) with herwig++ PS. The powheg plus herwig++ prediction is in good agreement with the LO plus herwig++ prediction.

Summary
The cross section of the electroweak production of a W boson in association with two jets is measured in the kinematic region defined as invariant mass m jj > 120 GeV and transverse momenta p Tj > 25 GeV. The data sample corresponds to an integrated luminosity of 35.9 fb −1 of proton-proton collisions at centre-of-mass energy √ s = 13 TeV recorded by the CMS Collaboration at the LHC. The measured cross section σ EW (Wjj) = 6.23 ± 0.12 (stat) ± 0.61 (syst) pb agrees with the leading order standard model prediction. This is the first observation of this process at √ s = 13 TeV. A search is performed for anomalous trilinear gauge couplings associated with dimension-six operators as given in the framework of an effective field theory. No evidence for ATGCs is found, and the corresponding 95% confidence level intervals on the dimension-six operators are −2.3 < c WWW /Λ 2 < 2.5 TeV −2 , −8.8 < c W /Λ 2 < 16 TeV −2 , and −45 < c B /Λ 2 < 46 TeV −2 . These results are combined with previous results on the electroweak production of a Z boson in association with two jets, yielding the limit on the c WWW coupling −1 The additional hadronic activity, as well as the efficiencies for gap activity vetos, are studied in a signalenriched region. Generally reasonable agreement is found between the data and the quantum chromodynamics predictions with the herwig++ parton shower and hadronization model, while the pythia model predictions typically show greater activity in the rapidity gap between the two tagging jets.
Acknowledgements We congratulate our colleagues in the CERN accelerator departments for the excellent performance of the LHC and thank the technical and administrative staffs at CERN and at other CMS institutes for their contributions to the success of the CMS effort. In addition, we gratefully acknowledge the computing centers and personnel of the Worldwide LHC Computing Grid for delivering so effectively the computing infrastructure essential to our analyses. Finally, we acknowledge the enduring support for the construction and operation of the LHC and the CMS detector provided by the following funding agencies: BMBWF

Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors' comment: Release and preservation of data used by the CMS Collaboration as the basis for publications is guided by the CMS policy as written in its document "CMS data preservation, re-use and open access policy" (https://cms-docdb.cern. ch/cgi-bin/PublicDocDB/RetrieveFile?docid=6032&filename=CMSD ataPolicyV1.2.pdf&version=2).] Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecomm ons.org/licenses/by/4.0/. Funded by SCOAP 3 .

A Additional rapidity gap observables
A set of rapidity gap observables in the high signal purity region BDT > 0.95 is studied in addition to the results described in Sect. 12. The number of soft activity jets, defined in Sect. 12.2, in the rapidity gap between the two tag jets is shown for soft activity jet p T > 10, 5, and 2 GeV in Figures 20, 21, and 22, respectively. These distributions are consistent with the general underestimation of the simulation with respect to data at low activity values, particularly for the pythia parton showering.

B Jet activity in signal-depleted region
Section 12 shows a comparison of the data with simulation with pythia and herwig++ parton showering separately in a high purity signal region with BDT > 0.95. The agreement of the simulation with data for the background prediction is validated for the rapidity gap observables in the signal-depleted region BDT < 0.95, where the signal purity is less than 2%. Figures 23, 24, 25, and 26 show the leading additional jet p T , the total H T of the additional jets, the leading soft activity jet p T , and the total soft activity jet H T , respectively, in the region BDT < 0.95. Good agreement is observed between the background prediction and the data for all observables.  The efficiency of a hadronic activity veto, as described in Sect. 12.3, is studied in the signal-depleted BDT < 0.95 region. Figure 27 shows the gap activity veto efficiency of combined muon and electron events in the signal-depleted region when placing an upper threshold on the p T of the additional third jet, on the H T of all additional jets, on the leading soft-activity jet p T , or on the soft-activity H T . There is very little difference between the background-only prediction and the predictions including signal with either pythia or herwig++ parton showering due to the very small fraction