Measurement of top quark-antiquark pair production in association with a W or Z boson in pp collisions at sqrt(s) = 8 TeV

A measurement of the cross section for the production of top quark-antiquark pairs (t tbar) in association with a vector boson V (W or Z) in proton-proton collisions at sqrt(s) = 8 TeV is presented. The results are based on a dataset corresponding to an integrated luminosity of 19.5 inverse femtobarns recorded with the CMS detector at the LHC. The measurement is performed in three leptonic (e and mu) channels: a same-sign dilepton analysis targeting t tbar W events, and trilepton and four-lepton analyses designed for t tbar Z events. In the same-sign dilepton channel, the t tbar W cross section is measured as sigma[t tbar W] = 170 +90/-80 (stat) +/- 70 (syst) fb, corresponding to a significance of 1.6 standard deviations over the background-only hypothesis. Combining the trilepton and four-lepton channels, a direct measurement of the t tbar Z cross section, sigma[t tbar Z] = 200 +80/-70 (stat) +40/-30 (syst) fb, is obtained with a significance of 3.1 standard deviations. The measured cross sections are compatible with standard model predictions within their experimental uncertainties. The inclusive t tbar V process is observed with a significance of 3.7 standard deviations from the combination of all three leptonic channels.


Introduction
Two decades after the discovery of the top quark [1,2], many of its properties are still to be determined or are only loosely constrained by experimental data.Among these properties are the couplings between the top quark and the vector bosons.
The existence of non-zero couplings between the top quark and the neutral vector bosons can be inferred through the analysis of direct production of tt pairs in association with a γ or a Z boson.The CERN LHC allows these two processes to be disentangled and the corresponding couplings to be measured.The associated production of tt pairs with a W boson, the ttW process, has a cross section similar to ttZ and ttγ production.All three processes can be used to test the internal consistency of the standard model (SM) [3][4][5] and search for the presence of new physics.Despite their small cross sections, they are significant backgrounds to analyses that probe phenomena with even smaller, or comparable, cross sections.Examples are searches for supersymmetry [6][7][8] in same-sign dilepton [9] and in multilepton [10] final states, and the analysis of the SM ttH process with the Higgs boson and the top quarks decaying leptonically.
The measurement of the ttγ process has been documented by the CDF Collaboration [11] for proton-antiproton collisions at a centre-of-mass energy √ s = 1.96TeV.This article presents instead the measurement of cross sections for the ttW and ttZ processes in proton-proton (pp) collisions at √ s = 8 TeV.The analysis is based on data corresponding to an integrated luminosity of 19.5 fb −1 collected with the CMS detector at the LHC in 2012.Unlike the previous observation of the ttV process (V equal to Wor Z) at √ s = 7 TeV [12], here the ttW process is treated separately.
Three leptonic (e and µ) final states are considered: same-sign dilepton events, trilepton events, and four-lepton events.The same-sign dilepton events are used for the measurement of the ttW process, where one lepton originates from the leptonic decay of one of the two top quarks and the other like-sign lepton is produced in the decay of the prompt vector boson.The trilepton events are used for the identification of ttZ events in which one lepton is again produced from the leptonic decay of one of the two top quarks, and the two other opposite-sign and sameflavour leptons stem from the decay of the Z boson.The four-lepton events are used to identify ttZ events in which both the top quarks and the Z boson decay leptonically.For all three signatures, signal events containing leptonic τ decays are implicitly included.Figure 1 shows the most important leading-order Feynman diagrams for ttW and ttZ production in pp collisions.For pp collisions at √ s = 8 TeV, the current best estimates of the cross sections for these processes are based on quantum chromodynamics (QCD) calculations at nextto-leading-order (NLO) in α s .Using CT10 NLO [13] parton distribution functions (PDF) and a top-quark mass of 173 GeV, the software framework MADGRAPH5 aMC@NLO [14,15] provides a cross section of 206 +21 −23 fb for ttW production and of 197 +22 −25 fb for ttZ production, in agreement with independent NLO calculations [16,17].
As the number of selected signal events is expected to be comparable to that of the background processes, the estimation of the background is a key aspect of the analysis.The strategy is to use background-dominated control samples in data to the maximum extent possible.Many contributions to the background, in particular those caused by detector misreconstruction, are estimated in this way, while the remaining irreducible backgrounds are estimated using Monte Carlo (MC) simulations and the most precise calculations of cross sections that are available.
For the three separate channels and also for their combination, the yields of events found in excess of the expected backgrounds are used to measure the corresponding signal cross sections.

The CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the superconducting solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass/scintillator hadron calorimeter.Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid.A forward calorimeter extends the coverage provided by the barrel and endcap detectors.CMS uses a right-handed coordinate system, with the origin at the nominal interaction point, the x axis pointing to the centre of the LHC, the y axis pointing up (perpendicular to the LHC plane), and the z axis along the anticlockwise-beam direction.The polar angle θ is measured from the positive z axis and the azimuthal angle φ is measured in the x-y plane in radians.Events are selected by a two-stage trigger system: a hardware-based trigger followed by a software-based high-level trigger running on the data acquisition computer farm.A more detailed description of the CMS apparatus can be found in Ref. [18].

Event selection and Monte Carlo simulation
For all the channels considered in this analysis, the data are selected online by dilepton (ee, eµ, and µµ) triggers that demand a transverse momentum (p T ) larger than 17 GeV for the highest p T lepton and 8 GeV for the second-highest.The online selection involves loose identification for both flavours and isolation requirements on electrons.Other channel-specific triggers, which are described in detail later, select control regions that are used for the estimation of specific backgrounds and the assessment of the signal selection efficiency.After the online selection, data and simulated events are reconstructed offline using the same software.
Each event is processed using a global event reconstruction approach [19,20].This consists in reconstructing and identifying particles using an optimized combination of the information from all subdetectors.In this process, the identification of the particle type (photon, electron, muon, charged hadron, and neutral hadron) plays an important role in the determination of the particle direction and energy.
The tracks reconstructed in the silicon tracker are clustered in several primary vertices corresponding to the different pp interactions occurring within the same LHC bunch crossing.The vertex that has the largest ∑ i p 2 T i , where i runs over all tracks of the vertex, is assumed to identify the signal primary vertex.Its position is used to discriminate against particles originating from the other interactions (pileup) and to distinguish between prompt and non-prompt particles stemming from the signal interaction.
For each event, hadronic jets are clustered from the reconstructed particles using the anti-k T algorithm [21], operated with a distance parameter of 0.5.The jet momentum is determined as the vector sum of all particle momenta in the jet.In this analysis the jets used for the definition of the signal regions (signal jets) are required to be inside the tracker acceptance, i.e. |η| < 2.4 where η ≡ − ln[tan(θ/2)], to reduce the uncertainty in the jet reconstruction efficiency and improve the precision of the energy measurement.Jet energy corrections are applied to account for the non-linear response of the calorimeters and other instrumental effects.These corrections are based on in situ measurements using dijet and γ + jet data samples [22].A two-fold approach is employed to reduce the effect of pileup during jet reconstruction.Firstly, charged particles whose trajectories point to pileup vertices are excluded from the set of particles that are used for the reconstruction of signal jets.Secondly, the average energy density due to neutral pileup particles is evaluated in each event, and the corresponding energy inside the jet is subtracted [23].Then a jet identification requirement [24], primarily based on the energy balance between charged and neutral hadrons in a jet, is applied to remove jets that are misreconstructed or originate from instrumental noise.Finally, the trajectories of all the charged particles of a jet are used to calculate a p T -averaged longitudinal impact parameter for each signal jet [25].This variable is then employed as a discriminator against jets from pileup.Unless otherwise specified, signal jets are required to have p T > 30 GeV.
To identify (tag) jets originating from the hadronization of bottom quarks, the combined secondary vertex (CSV) algorithm [26] is used here with two operating points.The loose working point corresponds to a b-tagging efficiency for jets originating from b quarks of about 85% and a misidentification probability for jets from light quarks and gluons of 10%.The medium working point provides an efficiency of about 70% and a misidentification probability of 1.5%.
Muons and electrons are identified using standard quality criteria [27,28] and are required to have p T > 20 GeV and |η| < 2.4.For the four-lepton channel only, identified leptons with p T between 10 and 20 GeV are also employed for the event selection.To reduce the contamination caused by leptons from heavy-flavour decays or misidentified hadrons in jets, leptons are required to be isolated and to pass a selection on the impact parameter, which is calculated with respect to the position of the signal primary vertex.Candidates are considered isolated when the ratio of the scalar sum of the transverse momenta of all the other reconstructed particles in a cone of ∆R = √ (∆η) 2 + (∆φ) 2 = 0.3 around the candidate, relative to the lepton p T value, is less than 5-10%, the exact value of the threshold depending on the flavour of the lepton and on the final state.This relative isolation is corrected for the expected contribution from pileup using an approach that is similar to the one employed for the reconstruction of jets [29].The leptons are required to originate from the primary interaction demanding that their transverse and longitudinal impact parameters are smaller than 50-200 µm and 0.1-1.0cm, respectively.The tightest selections in these ranges are used for the lepton flavour and final states that are most affected by backgrounds due to non-prompt leptons.
Finally, the observables E miss T and H T are used, respectively, to identify the presence of neutrinos and to measure the hadronic activity in the analysed events.The former is defined as the magnitude of the vector sum of the transverse momenta of all reconstructed particles, the latter is the scalar sum of the transverse momenta of all signal jets.Simulations, which include pileup effects, are used to estimate some of the backgrounds, as well as to calculate the selection efficiency for the ttW and ttZ signal events.Simulated samples are generated with the MADGRAPH5 [30] program, with the exception of the ttH background process that is generated using PYTHIA 6 [31].All simulated samples are processed using a GEANT4-based model [32] of the CMS detector.Signal samples are produced with MADGRAPH5, which is used with the CTEQ6L1 [33] PDF and is interfaced to PYTHIA 6.424 to

Same-sign dilepton analysis
The aim of the same-sign dilepton analysis is to search for ttW events where one lepton is produced in the leptonic decay chain of one of the two top quarks, and the other like-sign lepton stems directly from the decay of the prompt vector boson: where corresponds to an electron or a muon.By requiring that the two selected leptons have the same sign, only half of the signal produced in the dilepton final state can be selected.However, the requirement significantly improves the signal-over-background ratio.The main background is caused by misidentification and misreconstruction effects: decay products of heavy-flavour mesons that give rise to non-prompt leptons and pions in jets misidentified as prompt leptons.A second, smaller, source of background is also caused by misreconstruction and consists of opposite-sign dilepton events where the charge of one of the two leptons is wrongly assigned.
The selection for the dilepton channel is conducted through the following steps: 1.Each event must contain two isolated leptons of the same charge and p T > 40 GeV.Both leptons are required to be compatible with the signal primary vertex and have a relative isolation smaller than 5%.The invariant mass of the dilepton pair is required to be larger than 8 GeV.
2. Three or more signal jets must be reconstructed, and at least one of these has to be btagged using the medium working point of the CSV algorithm.
3. Events are rejected if they contain a third lepton forming, with one of the other two leptons, a same-flavour opposite-sign pair whose invariant mass is within 15 GeV of the known Z-boson mass [34].For the third lepton, the relative isolation must be less than 9 (10)% if it is an electron (muon), and the transverse momentum requirement is loosened to p T > 10 GeV.
4. The H T value is required to be greater than 155 GeV.
5. Selected events are grouped in three categories depending on the lepton flavour: ee, eµ, and µµ dilepton pairs.Each of these categories is further split into two separate sets of dileptons with either positive or negative charges, for a total of six signal regions.
The tight-lepton selection (1) reduces the background from misidentified leptons, while the invariant mass requirement rejects events with pairs of energetic leptons from decays of heavy hadrons.The requirement (2) on the general number of jets and on the number of b-tagged jets present in the event decreases the background from electroweak processes, e.g.WZ production, that can have same-sign leptons in the final state, but are accompanied by little hadronic activity.The WZ background is also significantly reduced by the third-lepton veto (3).The H T requirement (4) as well as the threshold on the lepton p T (1) have been optimized for the best signal significance.This selection also minimizes the expected uncertainty in the measured cross section.The splitting (5) of the signal candidates in six categories is done for two reasons: exploiting the smaller background from lepton and charge misidentification in signal regions with muons and benefitting from the greater signal cross section in the plus-plus dilepton final states, which is caused by the abundance of quarks, instead of antiquarks, within the colliding protons at the LHC.Finally, the Z-boson veto is necessary to make the dilepton analysis statistically independent from the trilepton one described later.Events with three leptons are not rejected if they pass the Z-boson veto, since these can stem from a fully-leptonic decay of the tt pair in ttW signal events.

Background estimation
After the full same-sign dilepton selection is applied, there are three general categories of background processes that are selected together with ttW signal events: background from non-prompt or misidentified leptons (misidentified lepton background); background from lepton charge misidentification (mismeasured charge background); WZ and ttZ production, as well as other rare SM processes that contain genuine pairs of prompt, isolated and same-sign leptons.The subset of these processes that do not contain a Z boson in the final state forms the irreducible component of the background.This includes the production of like-sign WW and the production of the Higgs boson in association with a pair of top quarks.The production of a tt pair in association with a W boson by means of double parton scattering is expected to have a cross section two orders of magnitude smaller than the ttW production through single scattering [35].This source of background is therefore considered negligible and is ignored in the rest of the analysis.
The first background consists mostly of tt events, with a second important contribution coming from W+jets events.In both cases, one prompt lepton originates from the leptonic decay of a W boson, and another same-sign lepton is caused by the misidentification of a non-prompt lepton stemming from the decay of a heavy-flavour hadron.In W+jets events, smaller sources of misreconstructed leptons affecting this category of background are given by the misreconstruction of hadrons, the production of muons from light-meson decays, and the reconstruction of electrons from unidentified photon conversions.The background yield is estimated from data using a sample of events that satisfy the full analysis selection, except that one of the two leptons is required to pass a looser lepton selection and fail the full selection (sideband region).The background rate is then obtained weighting the events in this sideband region by the "tight-to-loose" ratio, i.e. the probability for a loosely identified lepton to pass the full set of requirements.This tight-to-loose ratio is measured as a function of lepton p T and η in a control sample of dijet events, which is depleted of prompt leptons and is selected by dedicated singlemuon and single-electron triggers.The systematic uncertainty in the background estimate is due to the differences in the various sources of non-prompt or misidentified leptons, between the dijet events where the tight-to-loose ratio is measured and the sideband region where the ratio is applied.Among the most important differences are the p T spectrum and the flavour of the jets containing the misidentified leptons.These two quantities have been varied in the control sample using appropriate selections and then the effects on the tight-to-loose ratio, and on the background estimate itself, have been quantified.The range of variation for these two quantities has been guided by a simulation of the background processes.The full systematic uncertainty in the background is estimated to be 50%.The statistical part of the uncertainty is driven by the number of events in the sideband region and it is significantly smaller than the systematic uncertainty for all six signal regions.
The probability to misidentify the charge of muons is about an order of magnitude smaller than for electrons.Therefore the magnitude of the background caused by charge misidentification, mostly in Drell-Yan and tt events, is driven only by electrons.This background is estimated by selecting opposite-sign ee or eµ events that pass the full analysis selection, except the same-sign requirement, and then weighting them by the p T -and η-dependent probability for electron charge misassignment.This probability and its variation as a function of the lepton p T and η are determined by combining information from simulation and a control data sample of Z → ee events.For the electron selection used in this analysis, the probability of charge misidentification is about 10 −4 and 10 −3 for electrons reconstructed in the barrel and endcap detectors, respectively.The background estimate has an uncertainty of 30% (15%) for the ee (eµ) signal regions.This uncertainty accounts for differences between data and simulation, and the limited momentum range of electrons in the Z-boson control sample.
Production of WZ and ttZ events, and the irreducible backgrounds, are all estimated from simulation as done when calculating the signal selection efficiencies.For each SM process contributing to this category of background, the dominant systematic uncertainty is the one in the theoretical cross section prediction.Depending on the process, we use an uncertainty of 15-50% and consider it as fully correlated across all signal regions.

Same-sign dilepton results
After the full analysis selection is applied, 36 events are observed in data, to be compared with 25.2 ± 3.4 (syst ⊕ stat) events expected from background processes and 39.7 ± 3.5 (syst ⊕ stat) events from the sum of background and ttW signal with the SM cross section.For both predictions, the statistical and systematic uncertainties are added in quadrature.
The event yields, along with the corresponding uncertainties for each background component, are reported in Table 1.The top left panel of Fig. 2 shows the distribution of the expected and observed events across the six different signal regions, and for all dilepton channels added together.As already anticipated, the positively charged channels are expected to collect a larger quantity of signal than the negatively charged channels, for a comparable quantity of background.The first three channels therefore drive the sensitivity of this analysis.
Table 1: Expected signal, estimated backgrounds, the sum of the two, and observed number of events for the µ ± µ ± , e ± µ ± , and e ± e ± channels.Uncertainties include both the statistical and the systematic components.The systematic uncertainty in the signal contribution does not include the theoretical uncertainty in the signal production cross section.The other three panels of Fig. 2 show the distributions for the invariant mass m bjj of the three jets expected to originate from the hadronic top-quark decay (top right), H T (bottom left), and the leading-lepton p T (bottom right) for all six signal regions combined together.For each event, the three signal jets used for the m bjj distribution are selected as follows: one, and only one, of the three jets is b-tagged; among the possible three-jet combinations the one chosen minimizes , where ∆R j i ,t is the ∆R distance between the direction of the i-th jet and the direction of the reconstructed hadronic top-quark candidate.In all four  distributions data and simulation are found in agreement.In particular, the m bjj distribution confirms that most of the background from misidentified leptons is originating from top-quark events.
Based on the observed number of events, the background estimates, and the signal acceptance (including the leptonic branching fractions), the inclusive ttW production cross section is measured, through the combination of the six dilepton channels, as including statistical and systematic uncertainties, compared to the SM expectation of 206 +21 −23 fb.The significance of the result over the background-only hypothesis is equivalent to 1.6 standard deviations (2.0 standard deviations expected).
The systematic uncertainty in the signal selection efficiency is 8%.It is treated in a common way with the three-and four-lepton channels and is discussed in detail in Section 7. Additionally, for all channels there is a 2.6% uncertainty in the expected yield of signal and simulation-derived background events because of the uncertainty in the luminosity normalization [36].However, together with the low yield of signal events, the main factor dominating the uncertainty in the cross section measurement is the uncertainty in the largest background component, i.e. the 50% uncertainty in the background from misidentified leptons.

Trilepton analysis
The production of a tt pair in association with a Z boson is analysed in the final state with three high-energy, isolated, and prompt leptons.The trilepton analysis targets final states with only one W boson decaying leptonically: The event selection, described in more detail below, focuses on the main features of this final state: two oppositely charged leptons of the same flavour, consistent with the Z-boson decay; an additional lepton; and at least four jets, at least two of which are b-tagged.The isolation of the leptons has additionally been loosened to reflect the diminished contribution of misidentified leptons to the background.
The selection for the trilepton channel is conducted through the following steps: 1.Each event must contain three isolated leptons of p T > 20 GeV and pass identification requirements described in Section 3.All three leptons are required to be compatible with the signal primary vertex and have a relative isolation smaller than 9% (10%) for electrons (muons).
2. Two of the leptons must be of the same flavour, be oppositely charged, and form an invariant mass between 81 and 101 GeV to be consistent with a Z-boson decay.If multiple pairs pass this selection, the one with the mass closest to the known Z-boson mass is selected as the Z boson candidate.
3. To match the final-state signal topology, four or more signal jets must be reconstructed with at least three of these jets having p T > 30 GeV, and the fourth jet is required to have p T > 15 GeV.Additional identification and pileup suppression selections are applied as described in Section 3.
4. At least two of the jets with p T > 30 GeV must be b-tagged, the first using the medium working point of the CSV algorithm, and the second using the loose working point.
5. Events are rejected if they contain a fourth lepton with a loosened transverse momentum requirement of p T > 10 GeV, in order not to overlap with the four-lepton analysis.
These event selections have been optimized for the best precision on the expected measured cross section.A broad range of variations to the applied requirements has been considered in the optimization: including in the event selections a minimum number of jets, minimum jet p T , as well as H T ; changing the number of jets required to be b-tagged; and varying the lepton momentum and isolation thresholds.Estimates of the expected backgrounds used in the optimization of the final requirements have been made both with initial estimates from simulation alone as well as with events in data control samples using the methods described below.

Background estimation
Backgrounds passing the analysis selections are separated into three components: irreducible contributions from events with three prompt leptons and two b-quark jets (irreducible component), primarily with at least one top quark in the process; those with three prompt leptons and b-tagged jets without top-quark contributions (non-top-quark component); and contributions with at least one misidentified lepton (misidentified lepton component).This categorization is driven by the choice of methods used to estimate the backgrounds.
The irreducible component is split evenly among single-top-quark production in association with a Z boson (tbZ), ttH, and ttW production; additional contributions from production of three bosons and tt associated with an isolated photon or two additional vector bosons are much smaller, but are still considered.Since the ttW contribution is constrained by measurements in other (primarily the same-sign dilepton) final states, its expected SM contribution of 0.2 ± 0.1 (stat) events is quoted separately.The remaining irreducible background contributions are estimated directly from simulation: 0.77 ± 0.04 (stat) ± 0.39 (syst) events are expected.The systematic uncertainty in this background is conservatively estimated to be 50%, dominated by the uncertainty in the cross section, in accordance with corresponding values used in Section 4.1.This systematic uncertainty is applied also to the ttW contribution and serves as an initial constraint to the combined measurement, as discussed in Section 8.
The non-top-quark component contributions are primarily from events with three prompt leptons and b-tagged jets from misidentified light-flavour jets or b-quark jets arising from initialor final-state radiation.In simulation, this contribution is dominated by WZ events.Because neither the absolute rate of extra jet production from radiation and higher-order diagrams, nor the flavour composition of additional jets are well simulated [37], we rely on data to predict this background.
A sideband sample with three leptons and no b-tagged jets, with all other selections applied, is dominated by non-top-quark backgrounds and is used to normalize the non-top-quark component prediction.The method to predict the non-top-quark backgrounds works by assuming the ratio R b of the number of events passing the analysis b-tagging requirements relative to those not having b-tagged jets, is the same as for inclusive Z+jets production (Z boson decaying leptonically) for events passing the same jet selections.We derive the R b in a sample of events with opposite-sign same-flavour leptons passing the same identification requirements as in the trilepton sample.The contribution of tt and other flavour-symmetric backgrounds is subtracted using opposite-flavour dilepton events after a correction for a difference in the lepton selection efficiency.For the final prediction of the non-top-quark component, an additional correction C b = 1.4 ± 0.2 (stat) is applied based on the difference between the prediction and observation in simulation.This is done to account for residual differences in the kinematic properties of jets between Z+jets events and the trilepton non-top-quark background.The R b measured in dilepton events in data is 0.160 ± 0.003 (stat).The non-top-quark component is predicted to contribute 2.3 ± 0.5 (stat) ± 1.1 (syst) events.The systematic uncertainty of approximately 50% is estimated as a combination of observed difference of R b in the dilepton events between data and simulation and the deviation of C b from unity.
Finally, the misidentified-lepton background component is estimated with a method similar to that of the same-sign dilepton analysis, described in Section 4.1.In each of the four final states the control sample is culled from events passing the trilepton signal event selections except that only one of the leptons is required to fail the isolation and identification requirements, still passing looser requirements.Similar to the same-sign dilepton analysis, the ratio of misidentified leptons passing full identification and isolation selections relative to the loosened requirements (the tight-to-loose ratio) is modelled to be the same in the trilepton events as in a sample with one lepton candidate and a jet.The modelling is tested in simulation, where the tight-to-loose ratio is measured in simulated multijet events and is then applied to the dominant background sample, i.e. tt production.The level of agreement between predicted and observed background in simulation gives the leading source of systematic uncertainty in the method, estimated to be roughly 50%.Combined in all trilepton final states, the misidentified lepton component is estimated to be 1.2 ± 0.5 (stat) ± 0.6 (syst) events.

Trilepton results
The 12 events observed in data are consistent with the sum of the estimated backgrounds, 4.4 ± 1.6 (syst ⊕ stat) events, and the expected signal, 7.8 ± 0.9 (syst ⊕ stat) events.These results are summarized in Table 2 and illustrated in Fig. 3, which shows corresponding contributions in separate channels as well as several characteristic distributions.The trijet mass for the hadronic top-quark candidate is calculated with the same method as in Section 4.2.
The systematic uncertainty in the cross section measurement arises from uncertainties in the background yields and in the estimate of the signal selection efficiency.For the signal event selection, the dominant sources of systematic uncertainty are the modelling of the lepton selection and the uncertainty in the jet energy scale.They produce 6% and 5% uncertainty in the signal selection efficiency, respectively, and sum to a total of 10% systematic uncertainty together with the other sources of uncertainty described in Section 7.
Based on the observed number of events, the background estimates, and the signal acceptance, of 0.0021 ± 0.0001 (stat) ± 0.0002 (syst), the inclusive ttZ production cross section in the trilepton analysis is measured as σ ttZ,3 = 190 +100 −80 (stat) ± 40 (syst) fb, including statistical and systematic uncertainties, compared to the SM expectation of 197 +22 −25 fb.The significance of the result over the background-only hypothesis is equivalent to 2.3 standard deviations, compared to the expected value of 2.4.This result is combined with the four-lepton analysis and the same-sign dilepton analysis, as described in Section 8.

Four-lepton analysis
The aim of the four-lepton analysis is to select events originating from the process: These events are characterized by a pair of same-flavour, opposite-sign leptons (e and µ) with an invariant mass that is close to the nominal Z-boson mass and two additional prompt leptons.
Since the branching fraction of ttZ to four leptons is very low, it is a challenge to maintain high signal efficiency and at the same time reject as much background as possible.To that end, the events are separated into two categories, one of which has a significantly higher signalto-background ratio than the other.The event selection has been optimized using the signal Table 2: Expected signal, estimated backgrounds, the sum of the two, and observed number of events for the trilepton channel.Uncertainties include both the statistical and the systematic components.The systematic uncertainty in the signal contribution does not include the theoretical uncertainty in the signal production cross section.significance from simulated events and is summarized in the following:

Yield
1. Events must have a total of four leptons passing the lepton identification criteria described in Section 3.Each electron (muon) is required to have relative isolation smaller than 9 (10)%.
2. The highest lepton p T must be greater than 20 GeV.The remaining leptons must have p T > 10 GeV.
3. Two of the leptons must form an opposite-sign same-flavour pair with the dilepton mass between 76 and 106 GeV.
4. The remaining two opposite-sign leptons must not form a same-flavour pair with the dilepton mass between 76 and 106 GeV.
5. At least one jet must pass the medium CSV b-tagging selection.
6.At least one other jet must pass the loose CSV b-tagging selection.
The high signal-to-background signal region requires that events pass all of the criteria above.
A second signal region requires that they pass the first five conditions and fail the sixth.These two four-lepton channels are exclusive.

Background estimation
The standard model can produce four genuine, prompt leptons through multiboson+jets production where at least two bosons decay leptonically.Backgrounds to this search include ZZ, WWZ, WZZ, ZZZ, and rarer processes.They can prove irreducible if the multiboson production is accompanied by b-tagged jets arising from the underlying event or initial-state radiation (irreducible background).
The contribution from irreducible background processes is estimated using MC simulations.
The process with the largest contribution in the four-lepton signal regions comes from the ZZ process.The main concern with taking this background estimate solely from a simulation is how well the rate at which bottom quarks are produced is modelled.Since these bottom quarks mainly originate from initial-state radiation, this rate is estimated in a data sample of leptonically-decaying Z bosons with two additional jets.For events in this sample the probability to pass the two b-tagging criteria is found to be about 4%.Rescaling by this number the events in the appropriate ZZ enhanced region measured in data, the background estimate is found to agree very well with the estimate from simulations.Therefore, the latter estimate is used in the analysis.
Another source of background arises when electrons and muons are incorrectly identified as prompt and isolated (misidentified lepton background).These can either result from misreconstruction of hadrons or from non-prompt or non-isolated leptons passing the selection criteria.Isolated tracks are used as a proxy for misidentified leptons and to calculate a "track-to-lepton" ratio, which depends on the heavy-flavour content and jet activity.The track-to-lepton ratio is determined by measuring the number of prompt, isolated tracks and the number of prompt, isolated leptons after the contribution to the leptons from electroweak processes has been subtracted.It is calculated in two control regions in data: a region with leptonic decays of Z bosons and a region with semi-leptonic decays of tt pairs.The two regions cover the extremes of how much heavy-flavour content is expected in different event samples.The ratio is then interpolated between these two regions using a linear mixing of the two control samples and parameterized as a function of the variable R n-p/p , which is the ratio of non-isolated, nonprompt tracks to non-isolated, prompt tracks in the sample.A track is defined as prompt when its transverse impact parameter is less than 200 µm, and non-prompt otherwise.The variable R n-p/p is used in the parameterization of the track-to-lepton ratio since it quantifies the amount of heavy-flavour content in the events of a given sample.The validity of the parameterization is checked in a third control region that requires one dilepton pair consistent with the Z boson and at least one b-tagged jet: for this sample, whose heavy-flavour content is expected to be in between those of the two previous control regions, R n-p/p is calculated, and the predicted and observed track-to-lepton ratios are compared and found in agreement.Finally, two sideband regions with one dilepton pair consistent with the Z boson and a third lepton, and which also satisfy the two b-tagging categorizations are defined.By calculating R n-p/p and using the track-to-lepton parameterization, the probability for isolated, prompt tracks to be misidentified as electrons (muons) is found equal to 7.4 ± 2.2% (1.6 ± 0.5%) in these two samples.To determine the number of background events in the signal regions, the yields in the sideband regions are then multiplied by the track-to-lepton ratios and the relevant combinatoric factors depending on the number of isolated tracks present in the events.A background yield of 0.1 ± 0.1 (0.5 ± 0.2) in the 2 b-jet (1 b-jet) signal region is calculated in this way.

Four-lepton results
Applying the full event selection, the event yields shown in Table 3 are obtained.A total of 4 events are observed, compared to a background expectation of 1.4 ± 0.3 events, where the uncertainty in the background prediction contains both the contributions from the limited number of simulated events and from the uncertainties related to the rescaling procedure based on control samples in data.The results are shown in Fig. 4 (left).A comparison of the E miss T distributions for the background, signal, and observed data, combining the two signal regions, is shown in Fig. 4 (right).
The systematic uncertainties in the selection efficiencies for signal and irreducible background are derived in the same way as for the dilepton and trilepton channels and are described in Section 7.For the four-lepton analysis, the dominant source of uncertainty in the signal acceptance is the 8% uncertainty in the modelling of the lepton selection.Together with the other systematic uncertainties, it sums to a total uncertainty of 11% in the signal selection efficiency.
By performing a simultaneous fit to the two exclusive four-lepton signal regions, the following Table 3: Expected signal, estimated backgrounds, the sum of the two, and observed number of events for the four-lepton channel.Uncertainties include both the statistical and the systematic components.The systematic uncertainty in the signal contribution does not include the theoretical uncertainty in the signal production cross section.The ZZ component of the background is shown separately from the rest of the irreducible processes.

Systematic uncertainties in signal selection efficiency
Along with the corresponding techniques for the background estimation, the uncertainties in the estimates of the backgrounds affecting the three channels have been presented respectively in Sections 4, 5, and 6.Here are illustrated the uncertainties in the selection efficiency of signal events.
Except for the component due to trigger, simulation is used to estimate the selection efficiency for signal.Control samples in data are used to correct the selection acceptance estimated in simulation and to assess the corresponding uncertainty.A similar approach is employed for all three analysis channels and therefore a common list of systematic uncertainties in signal acceptance can be summarized as in Table 4.The total uncertainty in the computed acceptance varies between 8 and 11% depending on the channel.Table 4: Systematic uncertainties in the signal selection efficiency for the three considered channels: ttW in dilepton (2 ) final state; ttZ in trilepton (3 ) and four-lepton (4 ) final states.The trigger efficiency is directly measured in data using control samples selected by H T triggers that are orthogonal to the dilepton triggers employed by the three analyses to select signal event candidates [9].Trigger inefficiencies are then applied to all acceptances calculated from simulation, for both signal and the background processes derived from simulation.
The offline lepton selection efficiencies in data and simulation are measured using Z-boson events to derive simulation-to-data correction factors.The correction factors applied to simulation are about 0.94 (0.98) for p T > 20 GeV for electrons (muons).The uncertainty in the per-lepton selection efficiency is about 1.5% (0.3%) for electrons (muons) with p T > 20 GeV.An additional systematic uncertainty is assigned to account for potential mismodelling of the lepton isolation efficiency due to the larger hadronic activity in signal events than in Z-boson events.This uncertainty is in the 2-3% range.These per-lepton uncertainties are propagated to calculate the uncertainties in the selection efficiency of signal events, which are found to be in the 4-8% range depending on the leptonic final state.
Another source of systematic uncertainty is associated with the jet energy scale correction.This systematic uncertainty varies between 5% and 2% in the p T range 40-100 GeV for jets with |η| < 2.4 [22].It is evaluated on a single-jet basis, and its effect is propagated to H T , the number of jets, and the number of b-tagged jets.In addition, there is a contribution to the total uncertainty arising from limited knowledge of the resolution of the jet energy, but this effect is generally of less importance than the contribution from the jet energy scale.
The b-tagging efficiency for b-quark jets, and the mistagging probabilities for charm-quark jets and for jets originating from light-flavour quarks or gluons, are estimated from data [38].
The corresponding correction factors, dependent on jet flavour and kinematic properties, are applied to simulated jets to account for the differences in the tagging efficiency between simulation and data.The total uncertainty in the signal acceptance caused by the b-tagging selection is determined by varying the correction factors up and down by their uncertainties.
In the simulation of signal events, different pileup conditions have been probed varying the cross section for inelastic pp collisions by ±5%.Comparing the signal selection efficiency for these different conditions, the uncertainty associated to pileup effects is found to be approximately 1%.The uncertainty in the signal acceptance due to the PDF choice [13,[39][40][41][42]] is found to be 1.5%.An uncertainty of the order of 5% in the selection acceptance is also assigned to the finite-order calculation employed to generate signal events.This last uncertainty includes the effects of varying the matrix-element/parton-shower matching scale, and renormalization and factorization scales from their nominal values set to Q 2 in the event.A nominal matching threshold of 20 GeV is chosen, while for the up and down variations, thresholds of 40 and 10 GeV are used, respectively.Renormalization and factorization scales are varied between 4Q 2 and Q 2 /4.The generator uncertainty also includes the difference in acceptance between signal events simulated with MADGRAPH5 and aMC@NLO [15] generators.

Results
To extract the cross sections for the ttW and ttZ processes, the nine different channels are combined to maximize their sensitivity.Cross section central values and corresponding uncertainties are evaluated from a scan of the profile likelihood ratio.The adopted statistical procedure is the same that was used for the observation of the Higgs boson candidate in CMS, and is described in detail in Ref. [29].
The results of the measurements are summarized in Table 5.Two one-dimensional fits are performed to measure ttW and ttZ separately using the channels most sensitive to each process.Using only the same-sign dilepton channels, the extracted ttW cross section is measured to be 170 +90 −80 (stat) ± 70 (syst) fb, corresponding to a significance of 1.6 standard deviations over the background-only hypothesis.The three and four lepton channels are combined to extract a ttZ cross section of 200 +80 −70 (stat) +40 −30 (syst) fb, with a significance of 3.1 standard deviations.When calculating the one-dimensional fit of one process, the cross section of the other process is constrained to have the theoretical SM value with a systematic uncertainty of 50%.
As visible from Fig. 2 and Table 1, less than 10% of the events selected by the same-sign dilepton channels are expected to stem from ttZ production.The extracted ttW cross section varies by approximately 10% when the used ttZ cross section is altered to as much as 0.5-1.5 times its nominal theoretical value.For an equivalent modification of the ttW production rate, the variation of the extracted ttZ cross section is less than 2%.The dependence of the measured cross section on the assumed cross section of the other ttV process is solved by performing a simultaneous fit of the cross sections of the two processes using all dilepton, trilepton, and four-lepton channels at the same time.The result of the fit is shown visually in Fig. 5 and the cross sections are summarized numerically in Table 6.The cross sections extracted from this two-dimensional fit are identical to those obtained from the two one-dimensional fits.The result of the two-dimensional best fit for ttW and ttZ cross sections (cross symbol) is shown along with its 68 and 95% confidence level contours.The result of this fit is superimposed with the separate ttW and ttZ cross section measurements, and the corresponding 1 standard deviation (1σ) bands, obtained from the dilepton, and the trilepton/four-lepton channels, respectively.The figure also shows the predictions from theory and the corresponding uncertainties.
Finally, a one-dimensional fit of all channels is performed to extract a combined cross section σ ttV = 380 +100 −90 (stat) +80 −70 (syst) fb with a significance of 3.7 standard deviations.

Summary
A measurement with the CMS detector of the cross section of top quark-antiquark pair production in association with a W or Z boson at √ s = 8 TeV has been presented.Results from three independent channels, and their combination, have been reported.In the same-sign dilepton channel, the ttW cross section has been measured to be σ ttW = 170 +90 −80 (stat) ± 70 (syst) fb, corresponding to a significance of 1.6 standard deviations over the background-only hypothesis.In the trilepton and four-lepton channels the ttZ signal has been established with a significance of 2.3 and 2.2 standard deviations, respectively.From the combination of these two channels, a significance of 3.2 standard deviations has been obtained and the cross section bas been measured to be σ ttZ = 200 +80 −70 (stat) +40 −30 (syst) fb.Combining the total of nine sub-channels from the three lepton decay modes, a ttV cross section (V equal W or Z) of σ ttV = 380 +100 −90 (stat) +80 −70 (syst) fb has been obtained, corresponding to a combined significance of 3.7 standard deviations.The measured values are compatible within their uncertainties with standard model predictions.

Figure 1 :
Figure 1: The dominant leading-order Feynman diagrams for ttW and ttZ production in pp collisions.The charge conjugate of the diagrams shown is implied.

Figure 2 :
Figure 2: Event yields in data after final dilepton selection requirements, compared to the background estimates and signal expectations.Contributions separated by final states (top left), trijets mass distribution for the hadronic top-quark candidate (top right), H T distribution (bottom left), and the leading-lepton p T distribution (bottom right).The combination of statistical and systematic uncertainties is denoted by the shaded area.

(Figure 3 :
Figure 3: Event yields in data after final trilepton selection requirements, compared to the background estimates and signal expectations.Contributions separated by final states where the two leptons consistent with the Z boson are indicated inside parenthesis on the bin labels (top left), trijets mass distribution for the hadronic top-quark candidate (top right), Z-boson candidate dilepton mass distribution (bottom left), and the distribution of the number of btagged jets passing medium operating point of the b-tagger (bottom right).The combination of statistical and systematic uncertainties is denoted by the shaded area.

Figure 4 :
Figure 4: Event yields in data after final four-lepton selection requirements, compared to the background estimates and signal expectations.Contributions in the 1 b-tagged jet and 2 btagged jets signal regions (left) and inclusive E miss T distribution (right).The ZZ component of the background is shown separately from the rest of the irreducible processes.The combination of statistical and systematic uncertainties is denoted by the shaded area.

Figure 5 :
Figure5: The result of the two-dimensional best fit for ttW and ttZ cross sections (cross symbol) is shown along with its 68 and 95% confidence level contours.The result of this fit is superimposed with the separate ttW and ttZ cross section measurements, and the corresponding 1 standard deviation (1σ) bands, obtained from the dilepton, and the trilepton/four-lepton channels, respectively.The figure also shows the predictions from theory and the corresponding uncertainties.

Table 5 :
Results of the extraction of cross sections, from single and combined channels.The significance is expressed in terms of standard deviations.

Table 6 :
Results for the two dimensional fit of the ttW and ttZ cross sections.