Abstract
The purely electroweak (EW) cross section for the production of two jets in association with a Z boson, in proton–proton collisions at \(\sqrt{s}=8\,\text {TeV}\), is measured using data recorded by the CMS experiment at the CERN LHC, corresponding to an integrated luminosity of 19.7\(\,\text {fb}^\text {-1}\). The electroweak cross section for the \(\ell \ell \mathrm {jj}\) final state (with \(\ell = \mathrm {e}\) or \(\mu \) and j representing the quarks produced in the hard interaction) in the kinematic region defined by \(M_{\ell \ell } >50\) \(\,\text {GeV}\), \(M_\mathrm {jj} >120\) \(\,\text {GeV}\), transverse momentum \(p_\mathrm {T j}> 25\) \(\,\text {GeV}\), and pseudorapidity \(|\eta _\mathrm {j} |< 5\), is found to be \(\sigma _\mathrm {EW}(\ell \ell \mathrm {jj})=174 \pm 15\,\text {(stat)}\pm 40\,\text {(syst)}\text {\,fb}\), in agreement with the standard model prediction. The associated jet activity of the selected events is studied, in particular in a signal-enriched region of phase space, and the measurements are found to be in agreement with QCD predictions.
Similar content being viewed by others
1 Introduction
The production of a \({\mathrm{Z}}\) boson in association with two jets in proton–proton (pp) collisions is dominated by a mixture of electroweak (EW) and strong processes of order \(\alpha _\mathrm {EW}^2\alpha _\mathrm {S}^2\). For \({\mathrm{Z}}\rightarrow \ell \ell \) leptonic decays, such events are referred to as “Drell–Yan (DY) + jets” or \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) events.
Purely electroweak \(\ell \ell \mathrm {jj}\) production contributing to the same final state is expected at order \(\alpha _\mathrm {EW}^4\), resulting in a comparatively small cross section [1]. This process is however predicted to have a distinctive signature of two jets of very high energy and large jj invariant mass, \(M_\mathrm {jj}\), separated by a large rapidity interval that can be occupied by the two charged leptons and where extra gluon emission is suppressed [2, 3]. We refer to jets produced through the fragmentation of the outgoing quarks in pure EW processes as “tagging jets”, and to the process from which they originate as “\(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) ”. Figure 1 shows representative Feynman diagrams for the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) processes, namely (left) vector boson fusion (VBF), (middle) bremsstrahlung-like, and (right) multiperipheral production. Detailed calculations reveal the presence of a large negative interference between the pure VBF process and the two other categories [1, 3]. These diagrams represent the signal (S) in the data.
For inclusive \(\ell \ell \mathrm {jj}\) final states, some of the diagrams with same initial- and final-state particles and quantum numbers can interfere, even if they do not involve exclusively EW interactions. Figure 2 (left) shows one example of order \(\alpha _\mathrm {S}^2\) corrections to DY production that have the same initial and final state as those in Fig. 1. A different order \(\alpha _\mathrm {S}^2\) correction that does not interfere with the EW signal, is shown in Fig. 2 (right).
The study of \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) processes is part of a more general investigation of standard model (SM) vector boson fusion and scattering processes that include the Higgs boson [4–6] and searches for physics beyond the standard model [7, 8]. When isolated from the backgrounds, the properties of \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) events can be compared with SM predictions. Probing the jet activity in the selected events in particular can shed light on the selection (or vetoing) of additional parton radiation to the tagging jets [9, 10].
At the CERN LHC, the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) process was first measured by the CMS experiment using pp collisions at \(\sqrt{s}=7\,\text {TeV}\) [11], and more recently by the ATLAS experiment at \(\sqrt{s}=8\,\text {TeV}\) [12]. Both results have been found to agree with the expectations of the SM. Our present work reflects the measurement at CMS using pp collision data collected at \(\sqrt{s}=8\) \(\,\text {TeV}\)during 2012 that correspond to an integrated luminosity of 19.7\(\,\text {fb}^\text {-1}\). As the signal-to-background ratio for the measurement is small, different methods are used to enhance the signal fraction, to confirm the presence of the signal, and to measure the cross section. Besides the two multivariate analyses, based on the methods developed for the 7\(\,\text {TeV}\) analysis [11], a new method is presented, using a model of the main background based on real pp collisions. The analysis of the 8\(\,\text {TeV}\) data, offers the opportunity of reducing the uncertainties of the 7\(\,\text {TeV}\) measurements, given the larger integrated luminosity, and to add robustness to the results with the new data-based method.
This paper is organised as follows: Sect. 2 describes the experimental apparatus and Sect. 3 the simulations. Event selection procedures are described in Sect. 4, and Sect. 5 discusses the selection efficiencies and background models in control regions. Section 6 details the strategies adopted in our analysis to extract the signal from the data, and the corresponding systematic uncertainties are summarised in Sect. 7. The results obtained are presented in Sect. 8, and we conclude with a study of jet properties in a \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\)-dominated control region, as well as in a high-purity, \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\)-enriched region in Sect. 9. Finally, a brief summary of the results is given in Sect. 10.
2 The CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 6\(\text {\,m}\) internal diameter, providing a magnetic field of 3.8\(\text {\,T}\). The solenoid volume contains a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass/scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Muons are measured in gas-ionisation tracking detectors embedded in the steel flux-return yoke outside the solenoid. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors.
The silicon tracker consists of 1440 silicon pixel modules and 15 148 silicon strip detector modules, located in the field of the superconducting solenoid. It measures charged particles within \(|\eta |< 2.5\), providing an impact parameter resolution of \({\approx }15\upmu \) and a transverse momentum (\(p_{\mathrm {T}}\)) resolution of about 1.5 % for \(p_{\mathrm {T}} =100\,\text {GeV}\) particles.
The energy of electrons is measured after combining the information from the ECAL and the tracker, whereas their direction is measured by the tracker. The invariant mass resolution for \({\mathrm{Z}}\rightarrow \mathrm {e}\mathrm {e}\) decays is 1.6 % when both electrons are in the ECAL barrel, and 2.6 % when both electrons are in the ECAL endcap [13]. Matching muons to tracks measured in the silicon tracker yields a \(p_{\mathrm {T}}\) resolution between \(1\) and 10 %, for \(p_{\mathrm {T}}\) values up to 1\(\,\text {TeV}\). The jet energy resolution (JER) is typically \({\approx }15\,\%\) at 50\(\,\text {GeV}\), 8 % at 100\(\,\text {GeV}\), and 4 % at 1\(\,\text {TeV}\) [14].
3 Simulation of signal and background events
Signal events are simulated at leading order (LO) using the MadGraph (v5.1.3.30) Monte Carlo (MC) generator [15, 16], interfaced to pythia (v6.4.26) [17] for parton showering (PS) and hadronisation. The CTEQ6L1 [18] parton distribution functions (PDF) are used to generate the event, the factorisation (\(\mu _F\)) and renormalisation (\(\mu _R\)) scales being both fixed to be equal to the \({\mathrm{Z}}\)-boson mass [19]. The underlying event is modelled with the so-called \(Z2^{*}\) tune [20]. The simulation does not include the generation of extra partons at matrix-element level. In the kinematic region defined by dilepton mass \(M_{\ell \ell } >50\,\text {GeV}\), parton transverse momentum \(p_\mathrm {T j}> 25\,\text {GeV}\), parton pseudorapidity \(\vert \eta _\mathrm{j}\vert < 5\), diparton mass \(M_\mathrm{jj} > 120\,\text {GeV}\), and angular separation \(\Delta R_\mathrm{jj}=\sqrt{{(\Delta \eta _\mathrm{jj})^2+(\Delta \phi _\mathrm{jj})^2}}>0.5\), where \(\Delta \eta _\mathrm {jj}\) and \(\Delta \phi _\mathrm {jj}\) are the differences in pseudorapidity and azimuthal angle between the tagging partons, the cross section in the \(\ell \ell \)jj final state (with \(\ell \) = e or \(\mu \)) is expected to be \(\sigma _\mathrm {LO}(\mathrm {EW}~\ell \ell \mathrm {jj})=208^{+8}_{-9}\,\text {(scale)}\pm 7\,\text {(PDF)}\text {\,fb}\), where the first uncertainty is obtained by changing simultaneously \(\mu _F\) and \(\mu _R\) by factors of \(2\) and \(1/2\), and the second from the uncertainties in the PDFs which has been estimated following the pdf4lhc prescription [18, 21–24]. The LO signal cross section and kinematic distributions estimated with MadGraph are found to be in good agreement with the LO predictions of the vbfnlo generator (v.2.6.3) [25–27].
Background DY events are also generated with MadGraph using a LO matrix element (ME) calculation that includes up to four partons generated from quantum chromodynamics (QCD) interactions. The ME-PS matching is performed following the ktMLM prescription [28, 29]. The dilepton DY production for \(M_{\ell \ell }>50\,\text {GeV}\) is normalised to \(\sigma _\text {th}(\mathrm {DY})=3.504\text {\,nb}\), as computed at next-to-next-leading order (NNLO) with fewz [30].
The evaluation of the interference between \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes, relies on the predictions obtained with MadGraph. Three samples, one of pure signal, one pure background, and one including both \(\alpha _\mathrm {EW}^4\) and \(\alpha _\mathrm {EW}^2\alpha _\mathrm {S}^2\) contributions are generated for this purpose. The differential cross sections are compared and used to estimate the expected interference contributions at the parton level.
Other residual background is expected from events with two leptons of same flavour with accompanying jets in the final state. Production of \({\mathrm{t}}\overline{{\mathrm{t}}}\) events is generated with MadGraph, including up to three extra partons, and normalised to the NNLO with next-to-next-to-leading-logarithmic corrections to an inclusive cross section of 245.8 \(\text {\,pb}\) [31]. Single-top-quark processes are modelled at next-to-leading order (NLO) with powheg [32–36] and normalised, respectively, to cross sections of \(22\pm 2\), \(86\pm 3\), and \(5.6\pm 0.2\,\text {\,pb}\) for the tW, \(t\)-, and \(s\)- channel production [37, 38]. Diboson production processes \(\mathrm {W}\mathrm {W}\), \(\mathrm {W}{\mathrm{Z}}\), and \({\mathrm{Z}}{\mathrm{Z}}\) are generated with MadGraph and normalised, respectively, to the cross sections of 59.8, 33.2, and 17.7 \(\text {\,pb}\), computed at NNLO [39] and with mcfm [40]. Throughout this paper we use the abbreviation VV when referring to the sum of the processes which yield two vector bosons.
The production of a \(\mathrm {W}\) boson in association with jets, where the \(\mathrm {W}\) decays to a charged lepton and a neutrino, is generated with MadGraph, and normalised to a total cross section of 36.3 nb, computed at NNLO with Fewz. Multijet QCD processes are also studied in simulation, but are found to yield negligible contributions to the selected events.
A detector simulation based on Geant4 (v.9.4p03) [41, 42] is applied to all the generated signal and background samples. The presence of multiple pp interactions in the same beam crossing (pileup) is incorporated by simulating additional interactions (both in-time and out-of-time with the collision) with a multiplicity that matches the one observed in data. The average number of pileup events is estimated as \(\approx \)21 interactions per bunch crossing.
4 Reconstruction and selection of events
The event selection is optimised to identify dilepton final states with two isolated, high-\(p_{\mathrm {T}}\) leptons, and at least two high-\(p_{\mathrm {T}}\) jets. Dilepton triggers are used to acquire the data, where one lepton is required to have \(p_{\mathrm {T}} >17 \,\text {GeV}\) and the other to have \(p_{\mathrm {T}} >8 \,\text {GeV}\). Electron-based triggers include additional isolation requirements, both in the tracker detectors and in the calorimeters. A single-isolated-muon trigger, with a requirement of \(p_{\mathrm {T}} >24 \,\text {GeV}\), is used to complement the dimuon trigger and increase the efficiency of the selection.
Electrons are reconstructed from clusters of energy depositions in the ECAL that match tracks extrapolated from the silicon tracker [43]. Muons are reconstructed by fitting trajectories based on hits in the silicon tracker and in the outer muon system [44]. Reconstructed electron or muon candidates are required to have \(p_{\mathrm {T}} >20\,\text {GeV}\). Electron candidates are required to be reconstructed within \(|\eta |\le 2.5\), excluding the CMS barrel-to-endcap transition region of the ECAL [45], and muon candidates are required to be reconstructed in the fiducial region \(|\eta |\le 2.4\) of the tracker system. The track associated to a lepton candidate is required to have both its transverse and longitudinal impact parameters compatible with the position of the primary vertex (PV) of the event. The PV for each event is defined as the one with the largest \(\sum p_{\mathrm {T}} ^2\), where the sum runs over all the tracks used to fit the vertex. A particle-based relative isolation parameter is computed for each lepton, and corrected on an event-by-event basis for contributions from pileup. The particle candidates used to compute the isolation variable are reconstructed with the particle flow algorithm which will be detailed later. We require that the sum of the scalar \(p_{\mathrm {T}}\) of all particle candidates reconstructed in an isolation cone with radius \(R=\sqrt{{(\Delta \eta )^{2}+(\Delta \phi )^{2}}}<0.4\) around the lepton’s momentum vector is \(<\)10 or \(<\)12 % of the electron or muon \(p_{\mathrm {T}}\) value, respectively. The two leptons with opposite electric charge and with highest \(p_{\mathrm {T}}\) are chosen to form the dilepton pair. Same-flavour dileptons (ee or \(\mu \mu \)) compatible with \({\mathrm{Z}}\rightarrow \ell \ell \) decays are then selected by requiring \(|M_{\mathrm{Z}}-M_{\ell \ell }|<15 \,\text {GeV}\), where \(M_{\mathrm{Z}}\) is the mass of the \({\mathrm{Z}}\) boson [19].
Two types of jets are used in the analysis: “jet-plus-track” (JPT) [46] and particle-flow (PF) [14] jets. Both cases use the anti-\(k_{\mathrm {T}}\) algorithm [47, 48] with a distance parameter of 0.5 to define jets. The information from the ECAL, HCAL and tracker are used by both algorithms in distinct ways. The JPT algorithm improves the energy response and resolution of calorimeter jets by incorporating additional tracking information. For JPT jets the associated tracks are classified as in-cone or out-of-cone if they point to within or outside the jet cone around the jet axis at the surface of the calorimeter. The momenta of both in-cone and out-of-cone tracks are then added to the energy of the associated calorimeter jet and for in-cone tracks the expected average energy deposition in the calorimeters is subtracted based on the momentum of the track. The direction of the jet axis is also corrected by the algorithm. As a result, the JPT algorithm improves both the energy and the direction of the jet. The PF algorithm [49, 50] combines the information from all relevant CMS sub-detectors to identify and reconstruct particle candidates in the event: muons, electrons, photons, charged hadrons, and neutral hadrons. The PF jets are constructed by clustering these particle candidates and the jet momentum is defined as the vectorial sum of the momenta of all particle candidates. An area-based correction is applied to both JPT and PF jets, to account for the extra energy that is clustered through in-time pileup [51, 52]. Jet energy scale (JES) and resolution (JER) for JPT and PF jets are derived from simulation and confirmed with in situ measurements of the \(p_{\mathrm {T}}\) balance observed in exclusive dijet and \({\mathrm{Z}}\)/photon+jet events. The simulation is corrected so that it describes the JER from real data. Additional selection criteria are applied to each event to remove spurious jet-like features originating from isolated noise patterns in certain HCAL regions. Jet identification criteria are furthermore applied to remove contributions from jets clustered from pileup events. These criteria are described in more detail in Ref. [53]. As will be detailed in Sect. 5.1, the efficiency of these algorithms has been measured in data and it is observed to be compatible with the expectations from simulation across the full pseudorapidity range used in the analysis.
In the preselection of events we require at least two jets with \(p_{\mathrm {T}} >30\,\text {GeV}\) and \({|\eta |\le 4.7}\). The two jets of highest \(p_{\mathrm {T}}\) jets are defined as the tagging jets. For the measurement of the cross section, we require the leading jet to have \(p_{\mathrm {T}} >50\,\text {GeV}\) and the dijet invariant mass \(M_\mathrm {jj}>200\,\text {GeV}\). Other selection requirements will be described below, as they depend on the analysis.
5 Control regions for jets and modelling of background
In our analysis, we select control regions for different purposes: to validate the calibrated jet energy response and efficiencies of jet-identification criteria, to estimate the backgrounds and to verify the agreement between data and estimates of background. The following details the result of these cross-checks.
5.1 Jet identification and response
Events with either a \({\mathrm{Z}}\rightarrow \mu \mu \) or a photon candidate, produced in association with a single jet with \(p_{\mathrm {T}}\) \(>30\,\text {GeV}\), are used as one of the control samples in this analysis. The \({\mathrm{Z}}\) candidate or the photon, and the associated jet are required to have \(|\Delta \phi (\text {jet},{\mathrm{Z}}\text { or }\gamma ) |>2.7\text {\,rad}\). These events enable a measure of the efficiency of the algorithms used to reject calorimeter noise and pileup-induced jets, and to check the jet energy response.
The jet identification criteria are based on the fractions of the jet energy deposited in different calorimeter elements [14]. Besides calorimetric noise, pileup events result in additional reconstructed jets. Such pileup jets can be rejected through a multivariate analysis based on the kinematics of the jet, on the topological configuration of its constituents, and on the fraction of tracks in the jet, associated to other reconstructed PVs in the same event [53]. The efficiency of both jet identification and pileup rejection is measured in the control sample, and determined to be >\(98\,\%\) for both JPT and PF jets. The dependence of this efficiency on \(\eta \) agrees with that predicted in MC simulation. The residual \(\eta \)-dependent difference is used to assign a systematic uncertainty in the selected signal.
The same control sample is also used to verify the jet energy response [14], which is defined from the ratio \(\left[ p_{\mathrm {T}} (\text {jet})/p_{\mathrm {T}} ({\mathrm{Z}}\text { or }\gamma )\right] \). The double ratio of the response in data and in simulation, i.e. \(\big [p_{\mathrm {T}} (\text {jet})/p_{\mathrm {T}} ({\mathrm{Z}}\text { or }\gamma )\big ]_\text {data}/ \big [p_{\mathrm {T}} (\text {jet})/p_{\mathrm {T}} ({\mathrm{Z}}\text { or }\gamma )\big ]_\mathrm {MC}\), provides a residual uncertainty that is assigned as a systematic source of uncertainty to the measurement. Although partially covered by the JES uncertainties, this procedure considers possible residual uncertainties in the particular phase-space regions selected in our analysis. This evaluation is crucial for the most forward region of \(\eta \), where the uncertainties in response are large. The double ratio defined above is observed to be close to unity except for a small loss in response (\(\approx \)5 %) observed in the region where the tracker has no acceptance and where there is a transition from the endcap to the forward hadron calorimeters of CMS (\(2.7<|\eta |<3.2\)).
5.2 Discriminating gluons from quarks
Jets in signal events are expected to originate from quarks while for background events it is more probable that jets are initiated by a gluon emitted from a radiative QCD process. A quark–gluon (q/g) discriminant [11] is evaluated for the two tagging jets with the intent of distinguishing the nature of each jet.
The q/g discriminant exploits differences in the showering and fragmentation of gluons and quarks, making use of the internal jet-composition and structure observables. The jet particle multiplicity and the maximum energy fraction carried by a particle inside the jet are used. In addition the q/g discriminant makes use of the following variables, computed using the weighted \(p_{\mathrm {T}} ^2\)-sum of the particles inside a jet: the jet constituents’ major root-mean-square (RMS) distance in the \(\eta \)–\(\phi \) plane, the jet constituents’ minor RMS distance in the \(\eta \)–\(\phi \) plane, and the jet asymmetry pull. Further details can be found in [54, 55].
The variables are used as an input to a likelihood-ratio discriminant that is trained using the tmva package [56] on gluon and quark jets from simulated dijet events. To improve the separation power, all variables are corrected for their pileup contamination using the same estimator for the average energy density from pileup interactions [51, 52], as previously defined in Sect. 4. The performance of the q/g discriminant has been evaluated and validated using independent, exclusive samples of \({\mathrm{Z}}\)+jet and dijet data [54]. The use of the gluon–quark likelihood discriminator leads to a decrease of the statistical uncertainty of the measured signal by about 5 %.
5.3 Modeling background
Alternative background models are explored for the dominant \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background. Given that the majority of the \(\ell \ell \mathrm {jj}\) final states are produced through \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes it is crucial to have different handles on the behavior of this process, in particular, in the signal phase space region.
Simulation-based prediction for background
The effect of virtual corrections to the MadGraph-based (Born-level) description of \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) is studied using mcfm. Comparisons are made between the predictions of mcfm parton-level distributions with NLO and LO calculations and these studies provide a dynamic NLO to LO scale factor (K-factor) as a function of \(M_\mathrm {jj}\) and of the difference between the rapidity of the \({\mathrm{Z}}\) boson and the average rapidity of the two tagging jets, i.e.
The K-factor is observed to have a minor dependence on \(M_\mathrm {jj}\), but to increase steeply with \(|y^* |\), and a correction greater than 10 %, relative to the signal, is obtained for \(|y^* |>1.2\). As a consequence, an event selection of \(|y^* |<1.2\) is introduced in the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) simulation-based analyses. Finally, the difference between the nominal MadGraph prediction and the one obtained after reweighting it with the dynamic K-factor, on an event-by-event basis, is assigned as a systematic uncertainty for the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background prediction from simulation.
For the selection of the signal-region in the analysis where \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) is based on simulation we make use of an event balance variable, \(Rp_{\mathrm {T}} ^\text {hard}\), defined as
where the numerator is the estimator of the \(p_{\mathrm {T}}\) for the hard process, i.e. \(p_{\mathrm {T}} ^\text {hard}\). The distribution of the \(Rp_{\mathrm {T}} ^\text {hard}\) variable is shown in Fig. 3 (left), where data and simulation are found to be in agreement with each other. It can be seen, from the same figure, that the variable is robust against the variation of JES according to its uncertainty. We apply a requirement of \(Rp_{\mathrm {T}} ^\text {hard}<0.14\) to select the signal region and the events failing this requirement are used as a control region for the analyses. The cut is motivated by the fact that the signal is expected to have the \({\mathrm{Z}}\) boson balanced with respect to the dijet system in the transverse plane. The events which fail this requirement are used as control region for the modelling of the background. The \(M_\mathrm {jj}\) distribution in dimuon events for the signal and control regions is shown in Fig. 3, (middle) and (right), correspondingly. The reweighting of the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background is applied to the simulation, as described above. Data and predictions are found to be in agreement with each other.
Figure 4 shows distributions for angle-related variables. Fair agreement is observed for the absolute differences in the azimuthal angle (\(\Delta \phi _\mathrm {jj}\)) and in the pseudorapidity (\(\Delta \eta _\mathrm {jj}\)) of the tagging jets which are shown on the left and middle, respectively. The \(z^*\) variable [10] is shown in Fig. 4 (right), and it is defined as
Data is verified to be in good agreement with the prediction for the distribution in \( z^*\) variable.
Data-based prediction for background
The diagrams contributing to the production of a photon and two jets (\(\gamma \mathrm {jj}\)) are expected to resemble those involved in the production of \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) (see Fig. 2). Thus, we build a data-based model for the shapes of the distributions of the kinematic observables of the tagging jets from \(\gamma \mathrm {jj}\) events selected in a similar way as the \({\mathrm{Z}}\mathrm {jj}\) ones. The differences, specific to the \({\mathrm{Z}}\) or photon-sample, are expected to be mitigated by reweighting the \(p_{\mathrm {T}}\) of the photons to the \(p_{\mathrm {T}}\) of the \({\mathrm{Z}}\) candidates. From simulation, we expect that the differences between the \(\gamma \) and \({\mathrm{Z}}\) masses do not contribute significantly when matching the dijet kinematics between the two samples after \(M_\mathrm {jj}>2M_{{\mathrm{Z}}}\) is required. Given that the photon sample is affected by multijet production, and that the selection of the low-\(p_{\mathrm {T}}\) region in data is also affected by very large prescaling at the trigger stages, we impose tighter kinematic constraints on the reconstructed boson, with respect to the ones applied at pre-selection (Sect. 4). To match effectively the \({\mathrm{Z}}\) and photon kinematics, we require \(p_{\mathrm {T}} ({\mathrm{Z}}\text { or }\gamma )>50\,\text {GeV}\) and rapidity \(\vert y({\mathrm{Z}}\text { or }\gamma )\vert <1.44\). The rapidity requirement corresponds to the physical boundary of the central (barrel) region of the CMS ECAL [45].
The method is checked in simulation by characterising the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) or direct photon events in different physical regions defined according to the reconstructed \(M_\mathrm {jj}\) and comparing both distributions. Figure 5 illustrates the compatibility of simulated events with a high dijet invariant mass. Good agreement is found for the \(\eta \) of the most forward jet, the \(\Delta \eta _\mathrm {jj}\) variable and the ratio between the \(p_{\mathrm {T}}\) of the dijet system to the scalar sum of the tagging jets’ \(p_{\mathrm {T}}\),
The smallest of the quark/gluon discriminant value among the tagging jets is also found to be in agreement — Fig. 5 (top right). In general, the kinematics of the tagging jets predicted from the photon sample are found to be in agreement with those observed in DY \({\mathrm{Z}}\) events also for lower \(M_\mathrm {jj}\) values. A similar conclusion holds for other global event observables inspected in the simulation, such as energy fluxes and angular correlations.
The result of the compatibility tests described above have the potential to yield a correction factor to be applied to the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) prediction from the photon data. However due to the limited statistics in our simulation and due to uncertainties in handling the simulation of residual background from multijet events in data, we have opted to use the simulation-based compatibility test results to assign, instead, an uncertainty in the final shape. We assign the difference in the compatibility tests relative to a pure prompt-photon possibility as one of the systematic uncertainties. The changes observed in the compatibility test, obtained after varying the PDF by its uncertainties synchronously in the two samples is also assigned as a source of uncertainty. In data, the difference between a “tight” and a “loose” photon selections is, furthermore, assigned as an extra source of systematic uncertainty. The selection is tightened by applying stricter requirements on the photon identification and isolation requirements. This prescription is adopted to cover possible effects from the contamination of multijet processes.
The final distributions for \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) events are obtained after subtracting a residual contamination from pure EW production of a photon in association with two jets (\(\mathrm {EW}\,\gamma \mathrm {jj}\)) [57]. The diagrams for the latter process are similar to the ones of Fig. 1 (left) and (middle), where the \({\mathrm{Z}}/\gamma ^*\) is now a real photon. For a fiducial phase space defined by \(M_\mathrm {jj} >120\,\text {GeV}\), \(p_\mathrm {T j}> 30\,\text {GeV}\), \(|\eta _\mathrm {j} |< 5\), \(p_{\mathrm {T} \gamma }>50\,\text {GeV}\) and \(|\eta _\gamma |<1.5\), the production cross section of \(\mathrm {EW}\,\gamma \mathrm {jj}\) process is expected to be 2.72 \(\text {\,pb}\), based on the MadGraph generator. After event reconstruction and selection, we estimate the ratio of the number of \(\mathrm {EW}\,\gamma \mathrm {jj}\) candidate events to the total number of photon events selected in data to be a factor of \(\approx \)5 times smaller than the ratio between the expected \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) yields. From simulations this ratio is expected to be independent of \(M_\mathrm {jj} \). In the subtraction procedure, a 30 % normalisation uncertainty is assigned to this residual process, which corresponds to approximately twice the envelope of variations obtained for the cross section at NLO with vbfnlo, after tightening the selection criteria and changing the factorisation and renormalisation scales.
The results obtained when the data-based prediction, used to characterise the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) contribution to the reconstructed kinematics of the tagging jets in data, show a good agreement for different dijet invariant mass categories. Figure 6 illustrates the agreement observed for \(M_\mathrm {jj}>750\,\text {GeV}\) in the distribution of different variables: (upper left) \(p_{\mathrm {T}}\) of the leading jet, (upper right) \(p_{\mathrm {T}}\) of the sub-leading jet, (middle left) hard process \(p_{\mathrm {T}}\) (dijet+\({\mathrm{Z}}\) system), (middle right) \(\eta \) of the most forward jet, (lower left) \(\eta \) of the most central jet and (lower right) \(\Delta \eta _\mathrm {jj}\) of the tagging jets.
6 Signal discriminants and extraction procedure
We use a multivariate analysis technique that provides separation of the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) components of the inclusive \(\ell \ell \mathrm {jj}\) spectrum. As discussed previously, the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) signal is characterised by a large \(\Delta \eta _\mathrm {jj}\) jet separation that stems from the small-angle scattering of the two initial partons. Owing to both the topological configuration and the large \(p_{\mathrm {T}}\) of the outgoing partons, the \(M_\mathrm {jj}\) variable is also expected to be large. The evolution of \(\Delta \eta _\mathrm {jj}\) with \(M_\mathrm {jj}\) is expected to be different in signal and background events and therefore these characteristics are expected to yield the best separation power between the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) productions. In addition, one can exploit the fact that the \({\mathrm{Z}}\)-boson candidate is expected to be produced centrally in the rapidity region defined by the two tagging jets and that the \({\mathrm{Z}}\mathrm {jj}\) system is approximately balanced in the transverse plane. As a consequence, we expect the signal to be found with lower values of both \(y^*\) and \(p_{\mathrm {T}} ^\text {hard}\), compared to the DY background. Other variables which can be used to enhance the separation are related to the kinematics of the event (\(p_{\mathrm {T}}\), rapidity, and distance between the jets and/or the \({\mathrm{Z}}\) boson) or to the properties of the jets that are expected to be initiated by quarks. We combine these variables using three alternative multivariate analyses with the goal of cross-checking the final result. All three analyses make use of boosted decision tree (BDT) discriminators implemented using tmva package [56] to achieve the best expected separation between the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes.
-
Analysis A expands one of the procedures previously adopted for the 7\(\,\text {TeV}\)measurement [11]. It uses both dimuon and dielectron final states and PF jet reconstruction. A multivariate discriminator making use of the dijet and \({\mathrm{Z}}\) boson kinematics is built. A choice is made for variables which are robust against JES uncertainties. Extra discrimination information, related to the q/g nature of the jet, is included. All processes are modelled from simulation, and the description of each variable is verified by comparing data with the simulation-based expectations in control regions.
-
Analysis B uses only the dimuon final state and the JPT jet reconstruction approach. It builds a discriminator which tries to profit from the full kinematics of the event including the tagging jets and the \({\mathrm{Z}}\) boson. Similarly to analysis A it expands one of the cross-check procedures previously adopted for the 7\(\,\text {TeV}\)measurement [11] and relies on simulation-based prediction of the backgrounds.
-
Analysis C uses solely dijet-related variables in the multivariate discriminator and selects both the dimuon and dielectron final states with PF jets. Lepton-related selection variables are not used as the main background is derived from the photon control sample. In this analysis events are split in four categories for \(M_\mathrm {jj}\) values in the intervals 450–550\(\,\text {GeV}\), 550–750\(\,\text {GeV}\), 750–1,000\(\,\text {GeV}\), and above 1,000\(\,\text {GeV}\), which have been chosen to have similar numbers of expected signal events.
Table 1 compares in more detail the three independent analyses A, B and C. From simulation, the statistical correlation between the analyses, if performed with the same final state, is estimated to be \(\approx \)60 %.
Figures 7, 8 and 9 show the distributions of the discriminants for the three analyses. Good agreement is observed overall in both the signal and in the control regions which are defined according to the value of the \(Rp_{\mathrm {T}} ^\text {hard}\) or \(M_{\mathrm {jj}}\) variables (see Sect. 5.3).
Each analysis has a binned maximum likelihood formed from the expected rates for each process, as function of the value of the discriminant, which is used to fit simultaneously across the control and signal categories the strength modifiers for the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes, \(\mu = \sigma ({\mathrm {EW}~{\mathrm{Z}}\mathrm {jj}}) / \sigma _\mathrm {LO}({\mathrm {EW}~\ell \ell \mathrm {jj}})\) and \(\upsilon = \sigma ({\mathrm {DY}})/\sigma _\text {th}({\mathrm {DY}})\). Nuisance parameters are added to modify the expected rates and shapes according to the estimate of the systematic uncertainties affecting the analysis and are mostly assumed to have a log-normal distribution.
The interference between the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes is taken into account in the fitting procedure. A parameterisation of the interference effects, as a function of the parton-level \(M_\mathrm {jj}\) variable, is derived from the MadGraph simulation described in Sect. 3. The matrix elements for the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes provide the total yields for the \(\ell \ell \mathrm {jj}\) final state as
where \(N_{\mathrm {EW}~{\mathrm{Z}}\mathrm {jj}}\), \(N_{\mathrm {DY}~{\mathrm{Z}}\mathrm {jj}}\) are the yields for the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) processes, \(N_\mathrm {I}\) is the expected contribution from the interference to the total yield, and \(\mu \) and \(\upsilon \) are the strength factors that modify the SM predictions. In the absence of signal (or background) the contribution from the interference term vanishes in Eq. (5).
The parameters of the model (\(\mu \) and \(\upsilon \)) are determined maximising a likelihood (\(\mathcal {L}\)). Systematic uncertainties are incorporated in the fit by scanning the profile likelihood ratio \(\lambda \), defined as
where the denominator has estimators \(\hat{\mu }\),\(\hat{\nu }\) and \(\hat{\theta }\) that maximise the likelihood, and the numerator has estimators \(\hat{\hat{\theta }}\) that maximise the likelihood for the specified \(\mu \) and \(\nu \) strengths. The statistical methodology used is similar to the one used in the CMS Higgs analysis [5] using asymptotic formulas [58]. In this procedure some of the systematic uncertainties affecting the measurement of the signal strength are partially constrained. The \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) strength is constrained by the uncertainties in analyses A and B and is free to change in C. In all cases the difference of the result relative to the one that would have been obtained without taking the interference term into account, is assigned as a systematic uncertainty of the measurement. This shall be discussed in more detail in the next section where the systematic uncertainties affecting our analysis are summarised.
7 Systematic uncertainties
The main systematic uncertainties affecting our measurement are classified into experimental and theoretical sources.
7.1 Experimental uncertainties
The following experimental uncertainties are considered:
-
Luminosity—A 2.6 % uncertainty is assigned to the value of the integrated luminosity [59].
-
Trigger and selection efficiencies—We assign total 2 and 3 % uncertainties on the total trigger and selection efficiencies in the ee and \(\mu \mu \) channels, respectively. These uncertainties have been estimated by comparing the lepton efficiencies expected in simulation and measured in data with a “tag-and-probe” method [60].
-
Jet energy scale and resolution—The energy of the jets enters in our analysis not only at the selection level but also in the computation of the kinematic variables used in forming discriminants. The uncertainty on JES affects therefore both the expected event yields, through the migration of events to different bins, and the final distributions. In addition to the standard JES uncertainty, the residual difference in the response observed in the balancing of a \({\mathrm{Z}}\) or \(\gamma \) candidate with a jet, discussed in Sect. 5, is assigned as a systematic uncertainty. The effect of the JES uncertainty is studied by rescaling up and down the reconstructed jet energy by a \(p_{\mathrm {T}}\)- and \(\eta \)-dependent scale factor [14]. An analogous approach is used for the JER. In both cases the uncertainties are derived separately of PF and JPT jets.
-
q/g discriminator—The uncertainty on the performance of the q/g discriminator has been measured using independent \({\mathrm{Z}}\)+jet and dijet data, after comparing with the corresponding simulation predictions [54]. The parametrization of the estimated uncertainty is used on an event-per-event basis to derive alternative predictions for the signal and background which are profiled in the fit for the signal.
-
Pileup—Pileup is not expected to affect the identification and isolation of the leptons or the corrected energy of the jets. When the jet clustering algorithm is run, pileup can, however, induce a distortion of the reconstructed dijet system due to the contamination of tracks and calorimetric deposits. We evaluate this uncertainty by generating two alternative distributions after changing the number of pileup interactions by \(\pm \)5 %, according to the uncertainty on the inelastic pp cross section at \(\sqrt{s}=8~\,\text {TeV}\).
-
Statistics of simulation—For signal and backgrounds which are estimated from simulation we form envelopes for the distributions by shifting all bin contents simultaneously up or down by its statistical uncertainty. This generates two alternatives to the nominal shape to be analysed. However, when a bin has an uncertainty which is >\(10\,\%\), we assign an additional, independent uncertainty to it in the fit in order to avoid overconstraining a specific background from a single bin in the fit.
7.2 Theoretical uncertainties
We have considered the following theoretical uncertainties in the analysis:
-
PDF—The PDF uncertainties are evaluated by considering the pdf4lhc prescription [18, 21–24], where for each source a new weight is extracted event-by-event and used to generate an alternative signal distribution. The up and down changes relative to the nominal prediction for each independent variable and are added in quadrature to estimate the final uncertainty.
-
Factorisation and renormalisation scales—In contrast to the main background, the two signal process partons originate from electroweak vertices. Changing the QCD factorisation and renormalisation scales is therefore not expected to have a large impact on the final cross section. The renormalisation scale, in particular, is not expected to have any impact at LO. Changing the values of \(\mu _F\) and \(\mu _R\) from their defaults by 2 or 1/2 we find a variation of \(\approx \) \(4\,\%\) in MadGraph and in vbfnlo. As the change in the scales can also affect the expected kinematics, we use the altered \(\mu _R/\mu _F\) samples to extract a weight that is applied at the generator level on an event-by-event basis. The parameterisation is done as function of the dilepton \(p_{\mathrm {T}}\). The changes induced in the form of the discriminant at the reconstruction level are assigned as systematic uncertainties.
-
DY Zjj prediction—For the modelling of the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background from simulation, as we indicated previously, we consider the full difference between the Born-level MadGraph prediction and the NLO prediction based on mcfm as a systematic uncertainty. The differences are particularly noticeable at very large \(M_\mathrm {jj}\) and at large \(y^*\). For the data-based modelling of \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) we consider the effect induced on the discriminant functions from five distinct sources. Not all are of theoretical nature, nevertheless, we list them here for simplicity. We consider not only the statistical size of the photon sample but also the difference observed in data selected with a loose-photon selection relative to the data selected with a tight-photon selection. From simulation, the expected difference, between the tight-photon selection and a pure photon sample is also considered, and added in quadrature to the previous. Furthermore, we consider the envelope of the PDF changes induced in the simulated compatibility tests, and the contamination from residual \(\mathrm {EW}\,\gamma \mathrm {jj}\) events in the photon sample. For the latter, we assign a 30 % uncertainty to the \(\mathrm {EW}\,\gamma \mathrm {jj}\) contribution, which is added in quadrature to the statistical uncertainty in the simulated events for this process.
-
Normalisation of residual backgrounds—Diboson and top-quark processes are modelled with a MC simulation. Thus, we assign an intrinsic uncertainty in their normalisation according to their uncertainty which arises from the PDF and factorisation/renormalisation scales. The uncertainties are assigned based on [31, 37, 40].
-
Interference between \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\)–The difference observed in the fit when the interference term is neglected relative to the nominal result is used to estimate the uncertainty due to the interference of the signal and the background.
7.3 Summary of systematic uncertainties
Table 2 summarises the systematic uncertainties described above. We give their magnitudes at the input level, and whether they are treated as normalisation uncertainties or uncertainties in the distributions used to fit the data. The uncertainties are organised according to their experimental or theoretical nature.
8 Measurement of the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) production cross section
Table 3 reports the expected and observed event yields after imposing a minimum value for the discriminators used in methods A and B such that \({S/B}>10\,\%\). Table 4 reports the event yields obtained in each category for method C. Fair agreement is observed between data and expectations for the sum of signal and background, for both methods, in all categories.
The signal strength is extracted from the fit to the discriminator shapes as discussed in Sect. 6. Table 5 summarises the results obtained for the fits to the signal strengths in each method. The results obtained are compatible among the dilepton channels and different methods, and in agreement with the SM prediction of unity. Methods A and B are dominated by the systematic uncertainty stemming from the modelling of the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background and the interference with the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) signal. Method C is dominated by the statistical uncertainty in the fit and, due to tighter selection criteria, is expected to be less affected by the modelling of the interference. In method C, the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) modelling uncertainty is partially due to the statistics of the photon sample. With the exception of jet energy resolution, which has a larger impact in method C due to its tighter \(M_\mathrm {jj}\) selection, all other uncertainties are of similar magnitude for the different methods.
For the results from method C, the 68 and 95 % confidence levels (CL) obtained for the combined fit of the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) and \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) strengths are shown in Fig. 10. Good agreement is found with the SM prediction for both components, as well as with the expected magnitude of the CL intervals. The \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) strength is measured to be \(0.978\pm 0.013\,\text {(stat)}\pm 0.036\,\text {(syst)}\) in the ee channel, \(1.016\pm 0.011\,\text {(stat)}\pm 0.034\,\text {(syst)}\) in the \(\mu \mu \) channel, and \(0.996\pm 0.008\,\text {(stat)}\pm 0.025\,\text {(syst)}\) after the combination of the previous two.
From the combined fit of the two channels in analysis A we obtain the signal strength
corresponding to a measured signal cross section
in agreement with the SM prediction \(\sigma _\mathrm {LO}(\mathrm {EW}\,\ell \ell \mathrm {jj})=208\pm 18\text {\,fb}\). Using the same statistical methodology, as described in Sect. 6, the background-only hypothesis is excluded with a significance greater than 5\(\sigma \).
9 Study of the hadronic and jet activity in \({\mathrm{Z}}\)+jet events
After establishing the signal, we examine the properties of the hadronic activity in the selected events. Radiation patterns and the profile of the charged hadronic activity as a function of several kinematic variables are explored in a region dominated by the main background, \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\); these studies are presented in Sects. 9.1 and 9.2. The production of additional jets in a region with a larger contribution of \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) processes is furthermore pursued in Sect. 9.3. We expect a significant suppression of the hadronic activity in signal events because the final-state objects have origin in purely electroweak interactions, in contrast with the radiative QCD production of jets in \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) events. The reconstructed distributions are compared directly to the prediction obtained with a full simulation of the CMS detector (see Sect. 3) and extends the studies reported in [61] to the phase space region of interest for the study of the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) process.
9.1 Jet radiation patterns
For the \({\mathrm{Z}}\)+jets events, the observables referred to as “radiation patterns” correspond to: (i) the number of jets, \(N_\mathrm {j}\), (ii) the total scalar sum of the transverse momenta of jets reconstructed within \(|\eta |<4.7\), \(H_{\mathrm {T}} \), (iii) \(\Delta \eta _\mathrm {jj}\) between the two jets with \(p_{\mathrm {T}} >40\,\text {GeV}\) which span the largest pseudorapidity gap in the event (not required to be the two leading-\(p_{\mathrm {T}}\) jets), and (iv) the cosine of the azimuthal angle difference, \(\cos |\phi _{\mathrm {j}_1} - \phi _{\mathrm {j}_2} | = \cos \Delta \phi _\mathrm {jj}\), for the two jets with criterion (iii). These observables are measured using events that are required to satisfy the \({\mathrm{Z}}\rightarrow \mu \mu \) and \({\mathrm{Z}}\rightarrow \mathrm {e}\mathrm {e}\) selection criteria of analyses A and B. These observables are investigated following the prescriptions and suggestions from Ref. [62], where the model dependence is estimated by comparing different generators.
Figures 11 and 12 show the average number of jets and the average \(\cos \Delta \phi _\mathrm {jj}\) as a function of the total \(H_{\mathrm {T}} \) and \(\Delta \eta _\mathrm {jj}\). The MadGraph + pythia (ME-PS) predictions are in good agreement with the data, even in the regions of largest \(H_{\mathrm {T}} \) and \(\Delta \eta _\mathrm {jj}\). In both cases we estimate that the contribution from \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) is \(<1\,\%\). Jet multiplicity increases both as function of \(H_{\mathrm {T}} \) and \(\Delta \eta _\mathrm {jj}\). The increase of \(H_{\mathrm {T}} \) and \(\Delta \eta _\mathrm {jj}\) induces, in average, an increase of jet multiplicity and leads to different dijet configurations in the azimuthal plane. In average the two selected jets are separated by \(120^0\deg \), independently of \(H_{\mathrm {T}} \). This separation tends to decrease for larger \(\Delta \eta _\mathrm {jj}\) separation. The behavior observed for \(\cos \Delta \phi _\mathrm {jj}\) when \(\Delta \eta _\mathrm {jj}<0.5\) is related to the jet distance parameter used in the reconstruction (\({R}=0.5\)). In data, the separation of the jets in the \(\cos \Delta \phi _\mathrm {jj}\) variable, is observed to be \(<\)5 % smaller with respect to the simulation.
9.2 Study of the charged hadronic activity
For this study, a collection is formed of high-purity tracks [63] with \(p_{\mathrm {T}} > 0.3\,\text {GeV}\), uniquely associated with the main PV in the event. Tracks associated with the two leptons or with the tagging jets are excluded from the selection. The association between the selected tracks and the reconstructed PVs is carried out by minimising the longitudinal impact parameter which is defined as the \(z\)-distance between the PV and the point of closest approach of the track helix to the PV, labeled \(d_z^\mathrm {PV}\). The association is required to satisfy the conditions \(d_z^\mathrm {PV}<2\text {\,mm}\) and \(d_z^\mathrm {PV}<3\delta d_z^\mathrm {PV}\), where \(\delta d_z^\mathrm {PV}\) is the uncertainty on \(d_z^\mathrm {PV}\).
A collection of “soft track-jets” is defined by clustering the selected tracks using the anti-\(k_{\mathrm {T}}\) clustering algorithm [47] with a distance parameter of \(R=0.5\). The use of track jets represents a clean and well-understood method [64] to reconstruct jets with energy as low as a few \(\text {GeV}\) . These jets are not affected by pileup, because of the association of their tracks with the hard-scattering vertex [65].
To study the central hadronic activity between the tagging jets, only track jets of low \(p_{\mathrm {T}}\), and within \(\eta ^\text {tag jet}_\text {min}+0.5 < \eta < \eta ^\text {tag jet}_\text {max}-0.5 \) are considered. For each event, we compute the scalar sum of the \(p_{\mathrm {T}}\) of up to three leading-\(p_{\mathrm {T}}\) soft-track jets, and define it as the soft \(H_{\mathrm {T}} \) variable. This variable is chosen to monitor the hadronic activity in the rapidity interval between the two jets.
The dependence of the average soft \(H_{\mathrm {T}} \) for the \({\mathrm{Z}}\mathrm {jj}\) events as a function of \(M_\mathrm {jj}\) and \(\Delta \eta _\mathrm {jj}\) is shown in Fig. 13. Inclusively, the contribution from \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) is estimated to be at the level of 1 %, but it is expected to evolve as function of the different variables, being 5 % (20 %) for \(\vert \Delta \eta _\mathrm {jj}\vert >4\) (\(M_\mathrm {jj}>1\,\text {TeV}\)). Overall, good agreement is observed between data and the simulation. The average value of the soft \(H_{\mathrm {T}} \) is observed to increase linearly with \(M_\mathrm {jj}\), and to saturate its value for \(\Delta \eta _\mathrm {jj}>5\), as a consequence of the limited acceptance of the CMS tracker.
9.3 Jet activity studies in a high-purity region
The evidence for EW production of \(\ell \ell \mathrm {jj}\) final states can also be supported through a study of the emission of a third and other extra jets in a region of high signal purity, i.e. for large \(M_{jj}\). In this study, we compare two regions, one with \(M_\mathrm {jj}>750\,\text {GeV}\) and another with \(M_\mathrm {jj}>1{,}250\,\text {GeV}\). Aside from the two tagging jets used in the preselection, we use all PF-based jets with a \(p_{\mathrm {T}} >15\,\text {GeV}\) found within the \(\Delta \eta _\mathrm {jj}\) of the tagging jets. The background is modelled from the photon control sample (analysis C), and uses the normalisations obtained from the fit discussed in Sect. 8. Where relevant we also compare the results using the MC-based modelling of the background.
The number of extra jets, as well as their scalar \(p_{\mathrm {T}}\) sum (\(H_{\mathrm {T}} \)), are shown in Fig. 14. Data and expectations are generally in good agreement for both distributions in the two \(M_\mathrm {jj}\) regions. A clear suppression of the emission of a third jet is observed in data, when we take into account the background-only predictions. After subtraction of the background, which is shown as an inset in the different figures, we observe that slightly less extra jets tend to be counted in data with respect to the simulated signal. Notice that in the simulation of the signal, the extra jets have their origin in a parton-shower approach (see Sect. 3).
The \(p_{\mathrm {T}}\) values and the pseudorapidities relative to the average of the two tagging jets, i.e. \(\eta ^*_{\mathrm {j}3}=\eta _{\mathrm {j}3}-(\eta _{\mathrm {j}1}+\eta _{\mathrm {j}2})/2\), of the third leading-\(p_{\mathrm {T}}\) jet in the event, are shown in Fig. 15. There are some deviations of the data observed relative to the predictions. In particular, the third jet is observed to be slightly more central than expected. The poor statistical and other uncertainties prevent us, however, from drawing further conclusions.
The above distributions can be used to compute gap fractions. We define a gap fraction as the fraction of events which do not have reconstructed kinematics above a given threshold. The most interesting gap fractions can be computed for the \(p_{\mathrm {T}}\) of the leading additional jet, and the \(H_{\mathrm {T}} \) variable. These gap fractions are, in practice, measurements of the efficiency of extra jet veto in VBF-like topologies. By comparing different expectations with the observed data we can quantify how reliable is the modelling of the extra jet activity, in particular in a signal-enriched region. Figure 16 shows the gap fractions expected and observed in data. Two expectations are compared: the one using a full MC approach and the one where the \(\mathrm {DY}\,{\mathrm{Z}}\mathrm {jj}\) background is predicted from the \(\gamma \mathrm {jj}\) data. Both predictions are found to be in agreement with the data for the \(p_{\mathrm {T}}\) of the leading additional jet and the soft \(H_{\mathrm {T}} \) variable.
10 Summary
The cross section for the purely electroweak production of a Z boson in association with two jets in the \(\ell \ell \mathrm {jj}\) final state, in proton–proton collisions at \(\sqrt{s}=8\,\text {TeV}\) has been measured to be
in agreement with the SM prediction. Aside from the two analyses previously used to determine the cross section of this process at 7\(\,\text {TeV}\) [11], a new analysis has been implemented using a data-based model for the main background. The increased integrated luminosity recorded at 8\(\,\text {TeV}\), an improved selection method, and more precise modelling of signal and background processes have allowed us to obtain a more precise measurement of the \(\mathrm {EW}\,{\mathrm{Z}}\mathrm {jj}\) process relative to the 7\(\,\text {TeV}\)result.
Studies of the jet activity in the selected events show generally good agreement with the MadGraph +pythia predictions. In events with high signal purity, the additional hadron activity has also been characterised, as well as the gap fractions. Good agreement has been found between data and QCD predictions.
References
C. Oleari, D. Zeppenfeld, QCD corrections to electroweak \(\ell \nu _\ell \)jj and \(\ell ^+\ell ^-\)jj production. Phys. Rev. D 69, 093004 (2004). doi: 10.1103/PhysRevD.69.093004. arXiv:hep-ph/0310156
D.L. Rainwater, R. Szalapski, D. Zeppenfeld, Probing color-singlet exchange in Z + 2 jet events at the CERN LHC. Phys. Rev. D 54, 6680 (1996). doi:10.1103/PhysRevD.54.6680. arXiv:hep-ph/9605444
V.A. Khoze, M.G. Ryskin, W.J. Stirling, P.H. Williams, A Z-monitor to calibrate Higgs production via vector boson fusion with rapidity gaps at the LHC. Eur. Phys. J. C 26, 429 (2003). doi:10.1140/epjc/s2002-01069-2. arXiv:hep-ph/0207365
ATLAS Collaboration Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC. Phys. Lett. B 716, 1 (2012). doi:10.1016/j.physletb.2012.08.020. arXiv:1207.7214
CMS Collaboration, Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC. Phys. Lett. B 716, 30 (2012). doi:10.1016/j.physletb.2012.08.021. arXiv:1207.7235
CMS Collaboration, Observation of a new boson with mass near 125 GeV in pp collisions at \({\sqrt{s}}= 7\) and 8 TeV. JHEP 06, 081 (2013). doi:10.1007/JHEP06(2013)081. arXiv:1303.4571
G.-C. Cho et al., Weak boson fusion production of supersymmetric particles at the CERN LHC. Phys. Rev. D 73, 054002 (2006). doi:10.1103/PhysRevD.73.054002. arXiv:hep-ph/0601063
B. Dutta et al., Vector boson fusion processes as a probe of supersymmetric electroweak sectors at the LHC. Phys. Rev. D 87, 035029 (2013). doi:10.1103/PhysRevD.87.035029. arXiv:1210.0964
J.D. Bjorken, Rapidity gaps and jets as a new physics signature in very high-energy hadron hadron collisions. Phys. Rev. D 47, 101 (1993). doi:10.1103/PhysRevD.47.101
F. Schissler, D. Zeppenfeld, Parton shower effects on W and Z production via vector boson fusion at NLO QCD. JHEP 04, 057 (2013). doi:10.1007/JHEP04(2013)057. arXiv:1302.2884
CMS Collaboration, Measurement of the hadronic activity in events with a Z and two jets and extraction of the cross section for the electroweak production of a Z with two jets in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 10, 062 (2013). doi:10.1007/JHEP10(2013)062. arXiv:1305.7389
ATLAS Collaboration, Measurement of the electroweak production of dijets in association with a Z-boson and distributions sensitive to vector boson fusion in proton–proton collisions at \(\sqrt{s}\) = 8 TeV using the ATLAS detector. JHEP 04, 031 (2014). doi:10.1007/JHEP04(2014)031. arXiv:1401.7610
CMS Collaboration, Energy calibration and resolution of the CMS electromagnetic calorimeter in pp collisions at \(\sqrt{s}\) = 7 TeV. JINST 8, P09009 (2013). doi:10.1088/1748-0221/8/09/P09009. arXiv:1306.2016
CMS Collaboration, Determination of jet energy calibration and transverse momentum resolution in CMS. JINST 6, P11002 (2011). doi:10.1088/1748-0221/6/11/P11002. arXiv:1107.4277
J. Alwall et al., MadGraph 5: going beyond. JHEP 06, 128 (2011). doi:10.1007/JHEP06(2011)128. arXiv:1106.0522
J. Alwall et al., The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations. JHEP 07, 079 (2014). doi:10.1007/JHEP07(2014)079. arXiv:1405.0301
T. Sjöstrand, S. Mrenna, P.Z. Skands, PYTHIA 6.4 physics and manual. JHEP 05, 026 (2006). doi:10.1088/1126-6708/2006/05/026. arXiv:hep-ph/0603175
J. Pumplin et al., New generation of parton distributions with uncertainties from global QCD analysis. JHEP 07, 012 (2002). doi:10.1088/1126-6708/2002/07/012. arXiv:hep-ph/0201195
Review of particle physics. Phys. Rev. D 86, 010001 (2012). doi:10.1103/PhysRevD.86.010001
CMS Collaboration, Measurement of the underlying event activity at the LHC with \(\sqrt{s}\) = 7 TeV and comparison with \(\sqrt{s}\) = 0.9 TeV. JHEP 09, 109 (2011). doi:10.1007/JHEP09(2011)109. arXiv:1107.0330
S. Alekhin et al., The PDF4LHC Working Group Interim Report (2011). arXiv:1101.0536
M. Botje et al., The PDF4LHC Working Group Interim Recommendations (2011). arXiv:1101.0538
NNPDF Collaboration, A first unbiased global NLO determination of parton distributions and their uncertainties. Nucl. Phys. B 838, 136 (2010). doi:10.1016/j.nuclphysb.2010.05.008. arXiv:1002.4407
A.D. Martin, W.J. Stirling, R.S. Thorne, G. Watt, Parton distributions for the LHC. Eur. Phys. J. C 63, 189 (2009). doi:10.1140/epjc/s10052-009-1072-5. arXiv:0901.0002
K. Arnold et al., VBFNLO: a parton level Monte Carlo for processes with electroweak bosons. Comput. Phys. Commun. 180, 1661 (2009). doi:10.1016/j.cpc.2009.03.006. arXiv:0811.4559
J. Baglio et al., VBFNLO: a parton level Monte Carlo for processes with electroweak bosons—manual for version 2.7.0 (2011). arXiv:1107.4038
K. Arnold et al., Release Note—VBFNLO-2.6.0 (2012). arXiv:1207.4975
M.L. Mangano, M. Moretti, F. Piccinini, M. Treccani, Matching matrix elements and shower evolution for top-quark production in hadronic collisions. JHEP 01, 013 (2007). doi:10.1088/1126-6708/2007/01/013. arXiv:hep-ph/0611129
J. Alwall et al., Comparative study of various algorithms for the merging of parton showers and matrix elements in hadronic collisions. Eur. Phys. J. C 53, 473 (2008). doi:10.1140/epjc/s10052-007-0490-5. arXiv:0706.2569
K. Melnikov, F. Petriello, Electroweak gauge boson production at hadron colliders through \(O(\alpha _S^2)\). Phys. Rev. D 74, 114017 (2006). doi: 10.1103/PhysRevD.74.114017. arXiv:hep-ph/0609070
M. Czakon, P. Fiedler, A. Mitov, The total top quark pair production cross-section at hadron colliders through \(\cal O(\alpha _S^4)\). Phys. Rev. Lett. 110, 252004 (2013). doi: 10.1103/PhysRevLett.110.252004. arXiv:1303.6254
S. Alioli, P. Nason, C. Oleari, E. Re, A general framework for implementing NLO calculations in shower Monte Carlo programs: the POWHEG BOX. JHEP 06, 043 (2010). doi:10.1007/JHEP06(2010)043. arXiv:1002.2581
P. Nason, A new method for combining NLO QCD with shower Monte Carlo algorithms. JHEP 11, 040 (2004). doi:10.1088/1126-6708/2004/11/040. arXiv:hep-ph/0409146
S. Frixione, P. Nason, C. Oleari, Matching NLO QCD computations with parton shower simulations: the POWHEG method. JHEP 11, 070 (2007). doi:10.1088/1126-6708/2007/11/070. arXiv:0709.2092
S. Alioli, P. Nason, C. Oleari, E. Re, NLO single-top production matched with shower in POWHEG: s- and t-channel contributions. JHEP 09, 111 (2009). doi:10.1088/1126-6708/2009/09/111. arXiv:0907.4076. [Erratum: doi:10.1007/JHEP02(2010)011]
E. Re, Single-top W t-channel production matched with parton showers using the POWHEG method. Eur. Phys. J. C 71, 1547 (2011). doi:10.1140/epjc/s10052-011-1547-z. arXiv:1009.2450
N. Kidonakis, Differential and total cross sections for top pair and single top production. in Proceedings of the XX International Workshop on Deep-Inelastic Scattering and Related Subjects, Bonn, Germany (2012). arXiv:1205.3453. doi:10.3204/DESY-PROC-2012-02/251
N. Kidonakis, Top Quark Production (2013). arXiv:1311.0283
T. Gehrmann et al., \({\rm {W}}^+{\rm {W}}^-\) production at hadron colliders in NNLO QCD (2014). arXiv:1408.5243
J.M. Campbell, R.K. Ellis, MCFM for the Tevatron and the LHC. Nucl. Phys. B Proc. Suppl. 10, 205–206 (2010). doi:10.1016/j.nuclphysbps.2010.08.011. arXiv:1007.3492
J. Allison et al., Geant4 developments and applications. IEEE Trans. Nucl. Sci. 53, 270 (2006). doi:10.1109/TNS.2006.869826
Nucl. Instrum. Meth. A GEANT4—a simulation toolkit. 506, 250 (2003). doi:10.1016/S0168-9002(03)01368-8
CMS Collaboration, Electron reconstruction and identification at \(\sqrt{s}\) = 7 TeV. CMS Physics Analysis Summary, CMS-PAS-EGM-10-004 (2010)
CMS Collaboration, Performance of CMS muon reconstruction in pp collision events at \(\sqrt{s}\) = 7 TeV. JINST 7, P10002 (2012). doi:10.1088/1748-0221/7/10/P10002. arXiv:1206.4071
CMS Collaboration, The CMS experiment at the CERN LHC. JINST 3, S08004 (2008). doi:10.1088/1748-0221/3/08/S08004
CMS Collaboration, Jet plus tracks algorithm for calorimeter jet energy corrections in CMS. CMS Physics Analysis Summary, CMS-PAS-JME-09-002 (2009)
M. Cacciari, G.P. Salam, G. Soyez, The anti-\(k_t\) jet clustering algorithm. JHEP 04, 063 (2008). doi: 10.1088/1126-6708/2008/04/063. arXiv:0802.1189
M. Cacciari, G.P. Salam, G. Soyez, FastJet user manual. Eur. Phys. J. C 72, 1896 (2012). doi:10.1140/epjc/s10052-012-1896-2. arXiv:1111.6097
CMS Collaboration, Particle-flow event reconstruction in CMS and performance for jets, taus, and \(E_{{\rm T}}^{{{\rm miss}}}\). CMS Physics Analysis Summary, CMS-PAS-PFT-09-001 (2009)
CMS Collaboration, Commissioning of the particle-flow event reconstruction with the first LHC collisions recorded in the CMS detector. CMS Physics Analysis Summary, CMS-PAS-PFT-10-001 (2010)
M. Cacciari, G.P. Salam, Pileup subtraction using jet areas. Phys. Lett. B 659, 119 (2008). doi:10.1016/j.physletb.2007.09.077. arXiv:0707.1378
M. Cacciari, G.P. Salam, G. Soyez, The catchment area of jets. JHEP 04, 005 (2008). doi:10.1088/1126-6708/2008/04/005. arXiv:0802.1188
CMS Collaboration, Pileup jet identification. CMS Physics Analysis Summary CMS-PAS-JME-13-005 (2013)
CMS Collaboration, Performance of quark/gluon discrimination using pp collision data at \(\sqrt{s}\) = 8 TeV. CMS Physics Analysis Summary CMS-PAS-JME-13-002 (2013)
J. Gallicchio, M.D. Schwartz, Seeing in color: jet superstructure. Phys. Rev. Lett. 105, 022001 (2010). doi:10.1103/PhysRevLett.105.022001. arXiv:1001.5027
H. Voss, A. Höcker, J. Stelzer, F. Tegenfeldt, TMVA, the toolkit for multivariate data analysis with ROOT. in XIth International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT), p. 40 (2007). arXiv:physics/0703039
B. Jager, Next-to-leading order QCD corrections to photon production via weak-boson fusion. Phys. Rev. D 81, 114016 (2010). doi:10.1103/PhysRevD.81.114016. arXiv:1004.0825
G. Cowan, K. Cranmer, E. Gross, O. Vitells, Asymptotic formulae for likelihood-based tests of new physics. Eur. Phys. J. C 71, 1554 (2011). doi:10.1140/epjc/s10052-011-1554-0. arXiv:1007.1727
CMS Collaboration, CMS luminosity based on pixel cluster counting—Summer 2013 update. CMS Physics Analysis Summary CMS-PAS-LUM-13-001 (2013)
CMS Collaboration, Measurements of inclusive W and Z cross sections in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 01, 080 (2011). doi:10.1007/JHEP01(2011)080. arXiv:1012.2466
C.M.S. Collaboration, Jet production rates in association with \(W\) and \(Z\) bosons in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 01, 010 (2012). doi: 10.1007/JHEP01(2012)010. arXiv:1110.3226
SM and NLO Multileg Working Group, The SM and NLO Multileg Working Group: Summary report (2010). arXiv:1003.1241
CMS Collaboration, Tracking and primary vertex results in first 7 TeV collisions. CMS Physics Analysis Summary CMS-PAS-TRK-10-005 (2010)
CMS Collaboration, Commissioning of trackJets in pp collisions at \(\sqrt{s}\) = 7 TeV. CMS Physics Analysis Summary CMS-PAS-JME-10-006 (2010)
CMS Collaboration, Performance of jet reconstruction with charged tracks only. CMS Physics Analysis Summary CMS-PAS-JME-08-001 (2009)
Acknowledgments
We congratulate our colleagues in the CERN accelerator departments for the excellent performance of the LHC and thank the technical and administrative staffs at CERN and at other CMS institutes for their contributions to the success of the CMS effort. In addition, we gratefully acknowledge the computing centres and personnel of the Worldwide LHC Computing Grid for delivering so effectively the computing infrastructure essential to our analyses. Finally, we acknowledge the enduring support for the construction and operation of the LHC and the CMS detector provided by the following funding agencies: BMWFW and FWF (Austria); FNRS and FWO (Belgium); CNPq, CAPES, FAPERJ, and FAPESP (Brazil); MES (Bulgaria); CERN; CAS, MoST, and NSFC (China); COLCIENCIAS (Colombia); MSES and CSF (Croatia); RPF (Cyprus); MoER, ERC IUT and ERDF (Estonia); Academy of Finland, MEC, and HIP (Finland); CEA and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRT (Greece); OTKA and NIH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); NRF and WCU (Republic of Korea); LAS (Lithuania); MOE and UM (Malaysia); CINVESTAV, CONACYT, SEP, and UASLP-FAI (Mexico); MBIE (New Zealand); PAEC (Pakistan); MSHE and NSC (Poland); FCT (Portugal); JINR (Dubna); MON, RosAtom, RAS and RFBR (Russia); MESTD (Serbia); SEIDI and CPAN (Spain); Swiss Funding Agencies (Switzerland); MST (Taipei); ThEPCenter, IPST, STAR and NSTDA (Thailand); TUBITAK and TAEK (Turkey); NASU and SFFR (Ukraine); STFC (United Kingdom); DOE and NSF (USA). Individuals have received support from the Marie-Curie programme and the European Research Council and EPLANET (European Union); the Leventis Foundation; the A. P. Sloan Foundation; the Alexander von Humboldt Foundation; the Belgian Federal Science Policy Office; the Fonds pour la Formation à la Recherche dans l’Industrie et dans l’Agriculture (FRIA-Belgium); the Agentschap voor Innovatie door Wetenschap en Technologie (IWT-Belgium); the Ministry of Education, Youth and Sports (MEYS) of the Czech Republic; the Council of Science and Industrial Research, India; the HOMING PLUS programme of Foundation for Polish Science, cofinanced from European Union, Regional Development Fund; the Compagnia di San Paolo (Torino); the Consorzio per la Fisica (Trieste); MIUR project 20108T4XTM (Italy); the Thalis and Aristeia programmes cofinanced by EU-ESF and the Greek NSRF; and the National Priorities Research Program by Qatar National Research Fund.
Author information
Authors and Affiliations
Consortia
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
Funded by SCOAP3 / License Version CC BY 4.0.
About this article
Cite this article
Khachatryan, V., Sirunyan, A.M., Tumasyan, A. et al. Measurement of electroweak production of two jets in association with a Z boson in proton–proton collisions at \(\sqrt{s}=8\,\text {TeV}\) . Eur. Phys. J. C 75, 66 (2015). https://doi.org/10.1140/epjc/s10052-014-3232-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-014-3232-5