Measurement of the differential cross section for top quark pair production in pp collisions at s=8TeV\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s} = 8\,\text {TeV} $$\end{document}

The normalized differential cross section for top quark pair (tt¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{t}}\overline{{\mathrm{t}}}$$\end{document}) production is measured in pp collisions at a centre-of-mass energy of 8TeV\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text {TeV}$$\end{document} at the CERN LHC using the CMS detector in data corresponding to an integrated luminosity of 19.7fb-1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\,\text {fb}^{-1}$$\end{document}. The measurements are performed in the lepton+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$+$$\end{document}jets (e/μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm {e}/\mu $$\end{document}+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$+$$\end{document}jets) and in the dilepton (e+e-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm {e}^+\mathrm {e}^-$$\end{document}, μ+μ-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu ^+ \mu ^- $$\end{document}, and e±μ∓\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm {e}^\pm \mu ^{\mp }$$\end{document}) decay channels. The tt¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{t}}\overline{{\mathrm{t}}}$$\end{document} cross section is measured as a function of the kinematic properties of the charged leptons, the jets associated to b quarks, the top quarks, and the tt¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{t}}\overline{{\mathrm{t}}}$$\end{document} system. The data are compared with several predictions from perturbative quantum chromodynamic up to approximate next-to-next-to-leading-order precision. No significant deviations are observed relative to the standard model predictions.


Introduction
Understanding the production and properties of top quarks is fundamental for testing the quality of the standard model (SM) and for searching for new physical phenomena beyond its scope. The large top quark data samples produced in proton-proton (pp) collisions at the CERN LHC provide access to precision measurements that are crucial for checking the internal consistency of the SM at the LHC energy scale. In particular, measurements of the top quark pair (tt) production cross section as a function of tt kinematic observables are important for comparing with the state-of-the-art quantum chromodynamic (QCD) predictions within the SM, and thereby constrain QCD parameters. In addition, the top quark plays a relevant role in theories beyond the SM, and such differential measurements are therefore expected to be sensitive to new phenomena [1].

Electronic supplementary material
The online version of this article (doi: 10.1007/s10052-015-3709-x) contains supplementary material, which is available to authorized users. * e-mail: cms-publication-committee-chair@cern.ch Differential tt production cross sections have been measured previously at the Fermilab pp Tevatron [2,3], and at the LHC at a centre-of-mass energy √ s = 7 TeV [4][5][6]. We present here the first measurement of the normalized differential tt production cross section with the CMS detector at √ s = 8 TeV. The analysis uses data recorded in 2012 corresponding to an integrated luminosity of 19.7 ± 0.5 fb −1 , which is about a factor of four larger than the sample used in the measurement performed by the CMS Collaboration at 7 TeV [5]. The analysis largely follows the procedures of Ref. [5] and benefits from the increase in statistical precision together with improvements in kinematic reconstruction algorithms and extended systematic studies, leading to a significant reduction of the total uncertainties.
The measurements are performed in +jets channels ( = e or μ), which contain a single isolated charged lepton and at least four jets in the final state, and in dilepton channels, with two oppositely charged leptons (e + e − , μ + μ − , e ± μ ∓ ) and at least two jets. The tt cross section is determined as a function of the kinematic properties of the top quarks and of the tt system, as well as of the leptons and jets associated with bottom (b) quarks (b jets) from top quark decays.
The kinematic properties of top quarks are obtained through kinematic-fitting and reconstruction algorithms. The normalized differential tt cross section is determined by counting the number of tt signal events in each bin of a given observable, correcting for detector effects and acceptance, and dividing by the measured total inclusive tt event rate. The latter is evaluated by integrating over all bins in each observable.
The results for directly measured quantities, such as kinematic properties of leptons and b jets, are presented in a fiducial phase space defined by the kinematic and geometric acceptance of all selected final-state objects. This avoids extrapolating the measured cross section into regions that are not experimentally accessible. In addition, the top quark and tt distributions are determined in the full phase space, in order to facilitate the comparison with higher-order perturbative QCD calculations. The results are compared to sev-eral predictions obtained with the leading-order (LO) Mad-Graph [7] generator interfaced to pythia [8] for parton evolution and hadronization, the next-to-leading-order (NLO) generators powheg [9][10][11], interfaced to both pythia and herwig [12], and mc@nlo [13] interfaced to herwig, and the latest NLO calculations with next-to-next-to-leadinglogarithm (NNLL) corrections [14,15], and approximate next-to-next-to-leading-order (NNLO) predictions [16]. The approximate NNLO predictions can be computed with the DiffTop [17] program.
This document is structured as follows. A brief description of the CMS detector is provided in Sect. 2. Details of the event simulation are given in Sect. 3, and event reconstruction and selection are discussed in Sect. 4. The estimated systematic uncertainties on the measurements of the cross section are described in Sect. 5. The results of the measurement are discussed in Sect. 6, followed by a summary in Sect. 7.

CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 13 m length and 6 m inner diameter, which provides an axial magnetic field of 3.8 T. Within the field volume are a silicon-pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Charged particle trajectories are measured by the inner tracking system, covering a pseudorapidity range of |η| < 2.5. The ECAL and the HCAL surround the tracking volume, providing highresolution energy and direction measurements of electrons, photons, and hadronic jets up to |η| < 3. Muons are measured in gas-ionization detectors embedded in the steel flux return yoke outside the solenoid covering the region |η| < 2.4. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors up to |η| < 5.2. The detector is nearly hermetic, allowing for energy balance measurements in the plane transverse to the beam directions. A two-tier trigger system selects the pp collisions for use in the analysis. A more detailed description of the CMS detector, together with a definition of the coordinate system and the relevant kinematic variables, can be found in Ref. [18].

Event simulation and theoretical calculations
Event generators, interfaced with a detailed detector simulation, are used to model experimental effects, such as consequences of event reconstruction and choice of selection criteria, as well as detector resolution. The tt sample is simulated using the LO MadGraph event generator (v. 5.1.5.11), which implements the relevant matrix elements with up to three additional partons. The MadSpin [19] package is used to incorporate spin correlation effects with matrix elements for up to three additional partons. The value of the top quark mass is fixed to m t = 172.5 GeV and the proton structure is described by the parton distribution functions (PDF) CTEQ6L1 [20]. The generated events are subsequently processed with pythia (v. 6.426, referred to as pythia6 in the following) for parton showering and hadronization, and the MLM prescription [21] is used for matching of matrixelement jets to parton showers. The CMS detector response is simulated using Geant4 (v. 9.4) [22].
In addition to the MadGraph prediction, calculations obtained with the NLO generators mc@nlo (v. 3.41) and powheg (v. 1.0 r1380) are compared to the results presented in Sect. 6. While powheg and mc@nlo are formally equivalent up to the NLO accuracy, they differ in the techniques used to avoid double counting of radiative corrections that can arise from interfacing with the parton showering generators. Two powheg samples are used: one is processed through pythia6 and the other through herwig (v. 6.520, referred to as herwig6 in the following) for the subsequent parton showering and hadronization. The parton showering in pythia6 is based on a transverse-momentum-ordered evolution scale, whereas in herwig6 it is angular-ordered. The events generated with mc@nlo are interfaced with herwig6. The herwig6 AUET2 tune [23] is used to model the underlying event in the powheg+herwig6 sample, while the default tune is used in the mc@nlo+herwig6 sample. The proton structure is described by the PDF sets CT10 [24] and CTEQ6M [20] for powheg and mc@nlo, respectively. In addition, the latest available NLO+NNLL [14,15] and approximate NNLO QCD predictions [16] are also used to compare with the data. The NNLO MSTW2008 [25] PDF set is used for both the NLO+NNLL and the approximate NNLO calculations.
Standard model background samples are simulated with MadGraph (without the MadSpin package), powheg, or pythia6, depending on the process. The main background contributions originate from the production of W and Z/γ * bosons with additional jets (referred to as W+jets and Z+jets, respectively, in the following), single top quark (s-, t-, and tW channels), diboson (WW, WZ, and ZZ), tt production in association with a Z, W, or γ boson (referred to as tt+Z/W/γ in the following), and QCD multijet events. The W+jets, Z+jets, and tt+Z/W/γ samples are simulated with MadGraph with up to two additional partons in the final state. The powheg generator is used for simulating single top quark production, while pythia6 is used to simulate diboson and QCD multijet events. Parton showering and hadronization are also simulated with pythia6 in all the background samples. The pythia6 Z2* tune [26] is used to characterize the underlying event in both the tt and the background samples.
For comparison with the measured distributions, the event yields in the simulated samples are normalized to an integrated luminosity of 19.7 fb −1 , according to their predicted cross sections. These are taken from NNLO (W+jets [27,28] and Z+jets [27]), NLO+NNLL (single top quark s-, t-, and tW channels [16]

Event reconstruction and selection
The event selection is similar to that described in Ref. [5] for the measurement of normalized differential tt cross sections at √ s = 7 TeV, and is based on the final-state topology of tt events. The top quark decays almost exclusively into a W boson and a b quark, and only the subsequent decays of one or two of the W bosons into a charged lepton (electron or muon) and a neutrino are considered. These signatures imply the presence of isolated leptons with high transverse momentum p T , large p T imbalance caused by the neutrinos that escape detection, and highly energetic jets. The identification of b jets through b-tagging techniques is used to increase the purity of the selected sample. The event selection in each channel is optimized to maximize the content of tt signal events and background rejection. The candidates are required to have p T > 33 GeV within the pseudorapidity interval |η| < 2.1 for the +jets channels, while electron candidates in the dilepton channels are required to have p T > 20 GeV and |η| < 2.4. As an additional quality criterion, a relative isolation I rel (0.3) < 0.10 in the +jets channels and I rel (0.3) < 0.15 in the dilepton channels is required, where I rel (x) is defined as the sum of the p T of all neutral and charged reconstructed particle candidates inside a cone of R ≡ ( η) 2 + ( φ) 2 < x around the electron (excluding the electron itself) in η-φ space, divided by the p T of the electron.
Muon candidates are reconstructed using the track information from the silicon tracker and the muon system. They are required to have p T > 33 GeV and |η| < 2.1 in the +jets channels, while in the dilepton channels the corresponding selection requires p T > 20 GeV and |η| < 2.4. Isolated muon candidates are selected if they fulfill I rel (0.4) < 0.12 and I rel (0.3) < 0.15 in the +jets and dilepton channels, respectively. The same definition of relative isolation described above is also used for muon candidates.
Jets are reconstructed by clustering the particle-flow candidates [38] using the anti-k T clustering algorithm with a distance parameter of R = 0.5 [39]. Electrons and muons passing less stringent selections on lepton kinematic quantities and isolation, relative to the ones specified above, are identified but excluded from clustering. A jet is selected if it has p T > 30 GeV and |η| < 2.4 for both the +jets and dilepton channels. Jets originating from b quarks are identified through a "combined secondary vertex" algorithm [40], which provides a b-tagging discriminant by combining secondary vertices and track-based lifetime information. The chosen working point in the +jets channels has an efficiency for tagging a b jet of ≈60 %, while the probability to misidentify light-flavour jets as b jets (mistag rate) is only ≈1.5 %. In the dilepton channels, the working point is selected to provide b-tagging efficiency and mistag rate of ≈80-85 and ≈10 %, respectively [40]. These requirements are chosen to reduce the background contribution in the corresponding channels while keeping a large fraction of the tt signal.
The missing transverse energy E T / is defined as the magnitude of the imbalance in the transverse momentum p T / in the event, which is the negative of the vectorial sum of the momenta in the transverse plane of all the particles reconstructed with the particle-flow algorithm [41]. To mitigate the effect of contributions from pileup on the resolution in E T / , we use a multivariate correction where the input is separated into components that originate from the primary and other collision vertices [42]. This correction improves the E T / resolution by ≈5 %.

Event selection
Events in the +jets channels that are triggered by the presence of a single electron (muon) with p T > 27 GeV ( p T > 24 GeV, |η| < 2.1), are selected if they contain exactly one reconstructed lepton fulfilling the requirements  Jets / 10 GeV Additionally, an event must contain at least four reconstructed jets satisfying the criteria described in Sect. 4.1. To suppress background contribution mainly from W+jets events, at least two of these jets are required to be tagged as b jets, and at least two must not be tagged as b jets, as they are used to reconstruct W → qq decays. In the dilepton channels, events are triggered using combinations of two leptons with p T thresholds of 8 and 17 GeV, and are selected if they contain at least two isolated leptons of opposite electric charge and at least two jets. At least one of the jets is required to be b-tagged. In events with more than two leptons, we choose the lepton pair with opposite charge and largest value in the sum of    The QCD multijet background is negligible and not shown. The Z/γ * +jets background is determined from data [5,43]. The hatched regions correspond to the shape uncertainties for the signal and backgrounds (cf. Sect. 5). The lower part of each plot shows the ratio of data to the predictions their scalar p T . Events with an invariant mass of the lepton pair smaller than 20 GeV are removed to suppress events from decays of heavy-flavour resonances and low-mass Drell-Yan processes. Backgrounds from Z+jets processes in the e + e − and μ + μ − channels are also suppressed by requiring the dilepton invariant mass to be outside a Z boson mass window of 91 ± 15 GeV, and to have E T / > 40 GeV.
After these selection steps, several basic distributions in +jets and dilepton events are shown in Figs. 1 and 2, respectively. The hatched regions correspond to the shape uncertainties for the signal and background (cf. Sect. 5), and are dominated by the former. The data are reasonably well described by the simulation, as shown in the lower part of each plot, where the ratio of data to simulation is presented to better indicate the level of agreement between data and the default tt signal (MadGraph+pythia6) and background samples used in the analysis. For both channels, however, data tend to have lower p T values than predicted by the simulation. It has been verified that the results presented in Sect. 6 are not affected by these remaining differences between data and simulation. A better data-to-simulation agreement in the lepton and jet p T distributions is obtained by scaling the top quark p T spectrum in simulation to match the data. However, the impact on the measurement of the cross sections is negligible.

Kinematic reconstruction of the tt system
The kinematic properties of the top quark pair are determined from the four-momenta of all final-state objects through kinematic reconstruction algorithms. These algorithms are improved versions of those described in Ref. [5].
In the +jets channels, a constrained kinematic fitting algorithm is applied [5,44] to the four-momenta of the selected lepton and up to five leading jets, and the p T / representing the transverse momentum of the neutrino, which are changed according to their resolutions. The fit is constrained to reconstruct two W bosons, each with a mass of 80.4 GeV. In addition, the reconstructed top quark and antiquark masses are required to be equal. To reduce the number of permutations in the association of jets to quarks, only btagged jets are considered as b quarks, and only untagged jets are considered as light quarks. In events with several combinatorial solutions, only the one with the minimum χ 2 in the fit is accepted. The main improvement relative to the method described in Ref. [5] is the increase in the number of correct assignments of b jets to b quarks. This is achieved by applying the kinematic fit twice, sequentially, in each event. In the first fit, the top quark mass is fixed to a value of 172.5 GeV. The jet combination that provides the minimum χ 2 in the fit is then used as input to the second kinematic fit, in which the top quark mass is not fixed, and the solution to this fit is retained. A further improvement in the method is to require the χ 2 -probability of the second kinematic fit to be >2 %. This criterion is chosen to optimize the fraction of correctly reconstructed signal events, without increasing significantly the statistical uncertainty in the data. The efficiency of this requirement is about 87 % for signal events with the correct jet assignment. As a result, the number of correctly reconstructed events is increased by almost a factor of two relative to the method used in Ref. [5], and effects from migration of events across bins, which are relevant for the measurements of the cross section, are reduced. It has been checked that any possible bias in the results that could be introduced by fixing the top quark mass to a specific value in the first kinematic fit is within the assigned systematic uncertainty on the dependence of the measurement on the top quark mass (cf. Sect. 5.2).
The dilepton channels use an algebraic kinematic reconstruction method [5,45]. The only unknowns are the threemomenta of the two neutrinos, which are reconstructed imposing the following kinematic constraints: p T conservation in the event; the W bosons, and top quark and antiquark masses. In contrast to the method of Ref. [5], the top quark mass is fixed to a value of 172.5 GeV. Each suitable pair of b jet candidates in the event, and both possible assignments of these two jets to the two selected leptons, are considered in the kinematic reconstruction. Combinations with two btagged jets are preferred to using single b-tagged jets. In the new method, events are reconstructed 100 times, each time randomly smearing the measured energies and directions of the reconstructed lepton and b jet candidates by their respective detector resolutions. This smearing recovers events that yielded no solution of the equations for the neutrino momenta, because of measurement fluctuations. The equations for the neutrino momenta can have up to four solutions. For a given smearing, the solution is identified by the one yielding the smallest invariant mass of the tt system. For each solution, a weight is calculated based on the expected true lepton-b-jet invariant mass spectrum. The weights are summed over the 100 reconstruction attempts, and the kinematic quantities associated to the top quark and antiquark are calculated as a weighted average. Finally, the two jet and lepton-jet assignments that yield the maximum sum of weights are chosen for analysis. It has been checked that any bias introduced through the use of the lepton-b-jet and tt invariant masses is negligible. This method yields on average a reconstruction efficiency of ≈94 %, which is 6 % higher than the one described in Ref. [5], and reduces systematic migration effects.
Distributions of the top quark or antiquark and tt kinematic observables (the transverse momenta p t T , p tt T , and the rapidities y t and y tt ) are presented in Figs. 3 and 4 for the +jets and dilepton channels, respectively. The hatched regions correspond to the shape uncertainties for the signal and background (cf. Sect. 5), and are dominated by the former. The lower panel in each plot also shows the ratio of data relative to the simulated signal and background samples.
In general, the data are reasonably well described by the simulation within the uncertainties. For both channels, the measured p T distributions, in particular p t T , are somewhat softer than the simulated distributions: the data lie above the simulation for p t T < 60(65) GeV in the +jets (dilepton) channels, while they lie below for p t T > 200 GeV. This pattern was also observed at 7 TeV [5]. To ensure that the results presented in Sect. 6 are not affected by such small remaining differences between data and simulation, the analysis has been repeated in different kinematic regions, with different selection requirements, and after scaling the top  regions correspond to the shape uncertainties for the signal and backgrounds (cf. Sect. 5). The lower part of each plot shows the ratio of data to the predictions quark p T spectrum in simulation to match the data. However, the impact on the measurement of the cross sections is negligible.
Following the event selection described in Sect. 4.2 and the kinematic reconstruction of the tt system, the main contributions to the background in the +jets channels arise from tt decays into channel other than +jets (including tt decays into τ leptons originating from the primary interaction) and single top quark events. The contribution from W+jets and QCD multijet events are well suppressed after the b-tagging requirement, while other tt events are somewhat reduced after the χ 2 -probability requirement. A total of 24,927 events are found in the e+jets channel and 26,843 events in the μ+jets channel. The contribution from tt signal to the final event sample is 89.0 %. The remaining fraction of events contains 7.3 % tt decays other than the +jets channels, 2.4 % single top quark events, 0.9 % W+jets and tt+Z/W/γ events, and negligible fractions of Z+jets, diboson, and QCD multijet       Z/γ * +jets background is determined from data [5,43]. The hatched regions correspond to the shape uncertainties for the signal and backgrounds (cf. Sect. 5). The lower part of each plot shows the ratio of data to the predictions events. All background contributions are determined from simulation.
In the dilepton channels, 10,678 events are found in the e + e − channel, 14,403 in the μ + μ − channel, and 39,640 in the e ± μ ∓ channel. Only tt events containing at least two leptons (electrons or muons) from W decays in the final state are considered as signal, and constitute 79.0 % of the final event sample. All other tt candidate events, specifi-cally those originating from decays via τ leptons, are considered as background and amount to 13.3 % of the final event sample. The fraction of Z+jets events is found to be 2.4 %. This background, which is dominant to the e + e − and μ + μ − channels, is estimated from data using the number of events observed within the Z-peak region (which is removed from the candidate sample), and a correction needed for non-Z+jets backgrounds in this same control Table 1 Breakdown of typical systematic uncertainties for the normalized differential cross sections. The uncertainty in the jet-parton matching threshold is indicated as "ME-PS threshold"; "PS" refers to "parton shower". The medians of the distribution of uncertainties over all bins of the measurement are quoted. For the +jets channels, the background from Z+jets is negligible and included in the "Background (all other)" category Source Relative systematic uncertainty (%)   uncertainties. The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower part of each plot shows the ratio of the predictions to data region is obtained from data in the e ± μ ∓ channel [5,43]. Other sources of background, including single top quark production (3.4 %), tt+Z/W/γ production (1 %), the contri-bution arising from misidentified or genuine leptons within jets (0.6 %), or diboson events (0.3 %), are estimated from simulation.  error bars indicate the statistical (combined statistical and systematic) uncertainties. The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower part of each plot shows the ratio of the predictions to data

Systematic uncertainties
The measurement is affected by systematic uncertainties that originate from detector effects and from theoretical assumptions. Each source of systematic uncertainty is assessed individually by changing the corresponding efficiency, resolution, or scale by its uncertainty, using a prescription similar to the one followed in Ref. [5]. For each change made, the mea-sured normalized differential cross section is recalculated, and the difference of the changed result relative to its nominal value in each bin is taken as the systematic uncertainty. The overall uncertainty on the measurement is obtained by adding all the contributions in quadrature, and is of the order of 3-   Table 1.

Experimental uncertainties
The efficiencies of the single-electron and single-muon triggers in the +jets channels are determined using the "tagand-probe" method of Ref.
[46] using Z boson event sam-ples. Scale factors close to unity within a few percent are extracted to account for the observed dependence on the η and p T of the lepton. The lepton identification and isolation efficiencies for the +jets channels obtained with the tagand-probe method agree well between data and simulation, so that the applied corrections are very close to unity. The systematic uncertainties are determined by shape-dependent changes in trigger and selection efficiencies by their uncer-  error bars indicate the statistical (combined statistical and systematic) uncertainties. The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower part of each plot shows the ratio of the predictions to data tainties. Lepton trigger efficiencies in the dilepton channels are measured using triggers that are only weakly correlated to the dilepton triggers used in the analysis. A dependence on η of a few percent is observed, and scale factors are extracted. The lepton identification and isolation uncertainties in the dilepton channels are also determined using the tag-and-probe method, and are again found to be described very well by the simulation for both electrons and muons.
The overall difference between data and simulation in bins of η and p T is estimated to be <2 % for electrons, and scale factors for muons are found to be close to unity within 1.0 %.

Fig. 10
Normalized differential tt production cross section in the +jets channels as a function of the p T of the leading (left) and trailing (right) top quarks or antiquarks. The data points are placed at the midpoint of the bins. The inner (outer) error bars indicate the sta-tistical (combined statistical and systematic) uncertainties. The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower part of each plot shows the ratio of the predictions to data and light (s, u, d, and gluon) jets, assuming that they are all uncorrelated.
The uncertainty in background normalization is determined by changing the background yields. In the +jets channels, the background normalization for the diboson, QCD multijet, W+jets, and Z+jets samples is conservatively varied by ±50 % [5], since these backgrounds, being very small, are determined from simulation rather than from data. The normalization of the tt+Z/W/γ samples is changed by ±30 %. For the single top quark sample, the uncertainty is covered by changing the normalization by ±30 %, and the kinematic scales of the event process (renormalization and factorization scales) as described in Sect. 5.2. In the e + e − and μ + μ − channels, the dominant background from Z+jets determined from data [5,43] is changed in normalization by ±30 %. In addition, changes in the background contributions from single top quark, diboson, QCD multijet, tt+Z/W/γ , and W+jets events of ±30 % are used in dilepton channels [5].
The kinematic reconstruction of top quarks is well described by the simulation, and the resulting uncertainties are small. In the case of the +jets analysis, the uncertainty of the kinematic fit is included in the changes in jet energy scales and resolutions, and in the uncertainty on the dependence on the top quark mass (cf. Sect. 5.2). In the dilepton analysis, the bin-to-bin uncertainty is determined from the small remaining difference in efficiency between simulation and data.
The pileup model estimates the mean number of additional pp interactions to be about 20 events per bunch crossing for the analyzed data. This estimate is based on the total inelastic proton-proton cross section, which is determined to be 69.4 mb following the measurement described in Ref. [47]. The systematic uncertainty is determined by changing this cross section within its uncertainty of ±5 %.

Uncertainties in modelling
The impact of theoretical assumptions on the measurement is determined, as indicated previously, by repeating the analysis and replacing the standard MadGraph tt simulation by dedicated simulation samples with altered parameters.
The uncertainty in modelling of the hard-production process is assessed through changes in the renormalization and factorization scales in the MadGraph sample by factors of two and 0.5 relative to their common nominal value, which is set to the Q of the hard process. In MadGraph, Q is defined by Q 2 = m 2 t + p 2 T , where the sum is over all additional final state partons in the matrix element. The impact of the choice of the scale that separates the description of jet production through matrix elements (ME) or parton shower (PS) in MadGraph is studied by changing its reference value of 20 GeV to 40 and 10 GeV. In the +jets channels, changes in the renormalization and factorization scales are also applied to single top quark events to deter- Syst. ⊕ Stat.

Fig. 11
Normalized differential tt production cross section in the +jets channels as a function of the p tt T (top left), y tt (top right), and m tt (bottom) of the tt system. The data points are placed at the midpoint of the bins. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. The measurements are com-pared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, mc@nlo+herwig6, and to NLO+NNLL [14,15] calculations, when available. The lower part of each plot shows the ratio of the predictions to data mine an uncertainty on the shape of this background contribution. The dependence of the measurement on the top quark mass is also estimated from dedicated MadGraph simulation samples in which the top quark mass is changed by ±1 GeV relative to the value used in the default simulation. The uncertainty from hadronization and parton showering is assessed by comparing the results obtained from samples simulated with powheg and mc@nlo interfaced with pythia6 and herwig6, respectively. The uncertainty from the choice of PDF is determined by reweighting the sample of simulated tt signal events according to the 52 CT10 PDF error sets [24], at a 90 % confidence level. The maximum variation is taken as uncertainty. As mentioned in Sects. 4.2 and 4.3, the effect of scaling the top quark p T spectrum in simulation to match the data has negligible impact on the measured cross sections, therefore no systematic uncertainty is taken into account for this effect. Syst. ⊕ Stat.
Stat. Stat.  The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, mc@nlo+herwig6, and to approximate NNLO [16] calculations, when available. The lower part of each plot shows the ratio of the predictions to data

Normalized differential cross sections
The normalized tt cross section in each bin i of each observable X is determined as a function of the kinematic properties of the leptons, the lepton pair, the b jets, the b jet system, the top quarks, and the tt system through the relation [5]: where x i represents the number of signal events measured in data in bin i after background subtraction and corrected for detector efficiencies, acceptances, and migrations, and X i is the bin width. The differential cross section is normalized by the sum of x i over all bins, as indicated in Eq. (1). The integrated luminosity is omitted, as it cancels in the ratio. Because of the normalization, sources of systematic uncertainty that are correlated across all bins of the measurement, e.g. the uncertainty in the integrated luminosity, also can- Syst. ⊕ Stat.

Fig. 13
Normalized differential tt production cross section in the dilepton channels as a function of the p T of the leading (left) and trailing (right) top quarks or antiquarks. The data points are placed at the midpoint of the bins. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. The measurements are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower part of each plot shows the ratio of the predictions to data cel. The contribution to the background from other tt decays is taken into account, after subtracting all other background components, by correcting the number of signal events in data using the expected signal fraction. The expected signal fraction is defined as the ratio of the number of selected tt signal events to the total number of selected tt events (i.e. signal and all other tt events) in simulation. This procedure avoids the dependence on the total inclusive tt cross section used in the normalization of the simulated signal sample.
Effects from trigger and detector efficiencies and resolutions leading to the migration of events across bin boundaries, and therefore to statistical correlations among neighbouring bins, are corrected by using a regularized unfolding method [5,48,49]. For each measured distribution, a response matrix is defined that accounts for migrations and efficiencies using the simulated MadGraph+pythia6 tt signal sample. The generalized inverse of the response matrix is used to obtain the unfolded distribution from the measured distribution by applying a χ 2 minimization technique. A smoothing prescription (regularization) is applied to prevent large unphysical fluctuations that can be introduced when directly inverting the response matrix. The strength of the regularization is determined and optimized individually for each distribution using the averaged global correlation method [50].
To keep the bin-to-bin migrations small, the widths of bins in the measurement are chosen according to their purity (ratio of the number of events generated and reconstructed in a particular bin to the total number of events reconstructed in that bin; this quantity is sensitive to migrations into the bin) and stability (ratio of the number of events generated and reconstructed in a particular bin to the number of events generated in that bin; this is sensitive to migrations out of the bin). The purity and stability of the bins in this analysis are typically 60 % or larger, mainly due to the improvements in the kinematic reconstruction methods discussed in Sect. 4.3.
The performance of the unfolding procedure is tested for possible biases from the choice of the input model (the Mad-Graph+pythia6 tt signal simulation). It is verified that, either by reweighting the signal simulation or injecting a resonant tt peak into the simulation of the signal, the unfolding procedure based on the nominal response matrices still recovers these altered shapes within statistical uncertainties. Moreover, tt samples simulated with powheg+pythia6 and mc@nlo+herwig6 are used to obtain the response matrices applied in the unfolding when determining the systematic uncertainties of the model (cf. Sect. 5.2). Therefore, possible effects from the unfolding procedure are already taken into account in the systematic uncertainties. The unfolded results are found to be consistent with those obtained using other regularization techniques [49].
The measurement of the normalized differential cross sections proceeds as follows. For each kinematic distribution, the event yields in the separate channels are added together, the background is subtracted, and the unfolding is performed. It is verified that the measurements in separate channels yield results consistent within their uncertainties. The systematic

Stat.
Fig. 14 Normalized differential tt production cross section in the dilepton channels as a function of the p tt T (top left), y tt (top right), and m tt (bottom) of the tt system. The data points are placed at the midpoint of the bins. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. The measurements are com-pared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, mc@nlo+herwig6, and to NLO+NNLL [14,15] calculations, when available. The lower part of each plot shows the ratio of the predictions to data uncertainties in each bin are determined from the changes in the combined cross sections. This requires the full analysis to be repeated for every systematic change, and the difference relative to the nominal combined value is taken as the systematic uncertainty for each bin of each observable. This method therefore takes into account the correlation among systematic uncertainties in different channels and bins.
The normalized differential cross sections of leptons and b jets are unfolded to the particle level and determined in a fiducial phase space defined by the kinematic and geometric region in which the final-state leptons and jets are produced within the detector acceptance (cf. Sect. 6.1). This minimizes model uncertainties from the extrapolation of the measurement outside of the experimentally welldescribed regions of phase space. In addition, the top quark   Fig. 15 Comparison of normalized differential tt production cross section in the dilepton and +jets channels as a function of the p t T (top left), the tt rest frame p t * T (top right), and the rapidity y t (bottom left) of the top quarks or antiquarks, and the difference in the azimuthal angle between the top quark and the antiquark φ(t,t) (bottom right). The measurements are presented relative to the Mad-Graph+pythia6 prediction. A horizontal bin-centre correction is applied to all data points (cf. Sect. 6.2). The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. The predictions from powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6, also presented relative to MadGraph+pythia6, are shown for comparison and tt-system quantities are unfolded to the parton level and presented in the full phase space (cf. Sect. 6.2) to provide easier comparisons with recent QCD calculations. The measurements are compared to predictions from Mad-Graph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The top quark and tt results are also compared to the latest calculations at NLO+NNLL [14,15] and approximate NNLO [16] precision, when available.
In addition to the measurements discussed in Ref. [5], results for the p T and invariant mass of the b jet pair, the p T of the top quarks or antiquarks in the tt rest frame, the p T of the highest (leading) and second-highest (trailing) p T of the top quark or antiquark, and the difference in the azimuthal angle between the top quark and antiquark are also presented.
[GeV]  16 Comparison of normalized differential tt production cross section in the dilepton and +jets channels as a function of the p T of the leading (left) and trailing (right) top quarks or antiquarks. The measurements are presented relative to the Mad-Graph+pythia6 prediction. A horizontal bin-centre correction is applied to all data points (cf. Sect. 6.2). The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. The predictions from powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6, also presented relative to MadGraph+pythia6, are shown for comparison All values of normalized differential cross sections, including bin boundaries, are provided in tables in the supplemental material (URL will be inserted by publisher) 6.1 Lepton and b jet differential cross sections The normalized differential tt cross section as a function of the lepton and b jet kinematic properties is measured at the particle level, where the objects are defined as follows. Leptons from W boson decays are defined after final-state radiation. A jet is defined at the particle level, following a procedure similar to that described in Sect. 4.1 for reconstructed jets, by applying the anti-k T clustering algorithm with a distance parameter of 0.5 to all stable particles (excluding the decay products from W boson decays into eν, μν, and final states with leptonic τ decays). A jet is defined as a b jet if it contains any of the decay products of a B hadron. Only the two b jets of highest p T originating from different B hadrons are considered as arising from the top quark decays.
The measurements are presented in a fiducial phase space defined by geometric and kinematic requirements on these particle-level objects as follows. The charged leptons from the W boson decays must have |η| < 2.1 and p T > 33 GeV in the +jets channels, and |η| < 2.4 and p T > 20 GeV in the dilepton channels. Exactly one and two leptons are required, respectively, in the +jets and the dilepton channels. At least four jets with |η| < 2.4 and p T > 30 GeV, two of which are b jets, are required in the +jets channels. In the dilepton channels, both b jets from the top quark decays must satisfy |η| < 2.4 and p T > 30 GeV. The fiducial particle-level corrections are determined using simulated tt events that fulfill these requirements; all other tt events are classified as background and are removed. Figure 5 presents the normalized differential cross section in the +jets channels as a function of the lepton transverse momentum p T and pseudorapidity η . The distributions of the transverse momentum of the b jets p b T and their pseudorapidity η b are given in Fig. 6, together with the transverse momentum p bb T and invariant mass m bb of the b jet pair. Also shown are predictions from Mad-Graph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. The lower panel in each plot shows the ratio of each of these predictions to data, in order to quantify their level of agreement relative to data. Figure 7 presents the normalized differential cross sections for the dilepton channels: the transverse momentum p T and the pseudorapidity η of the leptons, and the transverse momentum p    In general, none of the examined predictions provides an accurate description of data for all measured lepton and b jet distributions. A steeper p T spectrum is observed in data for the lepton and the b jet distributions compared to the predictions in both decay channels, which is best described by powheg+herwig6. The lepton p T in data is above the predictions for p T < 40 GeV, while it is below for p T > 100 GeV. A similar behaviour is observed for p T , and p bb T . The m + − distribution in data is below all predictions for m + − > 30 GeV. Worse agreement is found for powheg+pythia6. The η distributions in data are described by the predictions within the experimental uncertainties. The η b distributions are slightly less central in data than in the predictions, and are worse described by MadGraph+pythia6. The remaining distributions are described by the predictions within experimental uncertainties.

Top quark and tt differential cross sections
The normalized differential tt cross section as a function of the kinematic properties of the top quarks and the tt system is defined with respect to the top quarks or antiquarks before the decay (parton level) and after QCD radiation, and extrapolated to the full phase space using the MadGraph+pythia6 prediction for the +jets and dilepton channels.
In Figs. 9, 10 and 11, the following distributions are presented for the +jets channels: the transverse momentum p t T and the rapidity y t of the top quarks or antiquarks, the transverse momentum p t * T of the top quarks or antiquarks in the tt rest frame, the difference in the azimuthal angle between the top quark and antiquark φ(t,t), the transverse momentum of the leading ( p t1 T ) and trailing ( p t2 T ) top quark or antiquark, and the transverse momentum p tt T , the rapidity y tt , and the invariant mass m tt of the tt system. The data are compared to predictions from MadGraph+pythia6, powheg+pythia6, powheg+herwig6, and mc@nlo+herwig6. In addition, the approximate NNLO calculation [16] is also shown for the top quark p T and rapidity results, while the m tt and the p tt T distributions are compared to the NLO+NNLL predictions from Refs. [14,15], respectively. Figures 12, 13 and 14 show the corresponding distributions in the dilepton channels. The lower panel in each plot also shows the ratio of each prediction relative to data.
In general, the powheg+herwig6 prediction provides a good description of data for all measured distributions. The shape of the top quark p T spectrum is softer in data than in the predictions from MadGraph+pythia6, powheg+pythia6, and mc@nlo+herwig6 in both channels. The data lie above the predictions for p t T < 60 (65) GeV in the +jets (dilepton) channels, while they lie below for p t T > 200 GeV. This effect was also observed at 7 TeV [5]. The disagreement between data and predictions in the tail of the distributions is also observed in a measurement by the ATLAS Collaboration [6]. In contrast, the prediction from powheg+herwig6 and the approximate NNLO calculation provide a better description of the data, as they predict a slightly softer top quark p T distribution than the three other simulations. The difference between the powheg+pythia6 and powheg+herwig6 distributions is attributed to different treatment of the hardest initial state radiation in pythia6 and herwig6. The same pattern is observed for p t * T , indicating that the softer spectrum in data is not caused by the boost of the tt system. It is also present in the p t1 T , and particularly, in the p t2 T distributions. For all these distributions, the powheg+herwig6 prediction provides a better description of the data. The difference in the shape of the top quark p T spectrum between data and simulation is  is applied to all data points (cf. Sect. 6.2). The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainties. For better visibility, data points with identical bin centres (cf. Supplemental Tables 9, 12) are shifted horizontally by a negligible amount observed consistently in the analyses using different event selection requirements or different pileup conditions. The y t distribution is found to be slightly less central in data than in the predictions, particularly in the case of Mad-Graph+pythia6 and the approximate NNLO calculation, which are more central than the other predictions. On the contrary, y tt is more central in data, and it is slightly better described by MadGraph+pythia6. The m tt distribution in data tends to be lower than the predictions for large m tt values, and is better described by MadGraph+pythia6 and powheg+herwig6. The p tt T spectrum is well described by all the considered predictions, except for the NLO+NNLL calculation, which fails to describe the data for all p tt T values. The results from the +jets and dilepton channels are compared to each other in Figs. 15, 16 and 17. This is only feasible for the top quark and tt quantities, since they are measured in the same phase space (i.e. the full parton level phase space) for both channels. The results are presented relative to the MadGraph+pythia6 prediction to highlight the level of agreement between data and the default tt simulation. To facilitate the comparison of measurements that are performed using different size and number of bins, a horizontal bin-centre correction is applied to all data points from both channels. In each bin, the measured data points are presented at the horizontal position in the bin where the predicted bin-averaged cross section equals the cross section of the unbinned MadGraph+pythia6 calculation (cf. [51]), which is common for both channels. The data are also compared to the predictions from powheg+pythia6, powheg+herwig6, mc@nlo+herwig6 relative to Mad-Graph+pythia6. The results are consistent between the channels for all quantities, in particular, for all measurements related to the top quark p T distribution. The softer spectrum in data relative to MadGraph+pythia6 is clearly visible.
In addition, a comparison between results obtained at √ s = 7 [5] and 8 TeV is also performed for both the +jets and dilepton channels, and presented in Figs. 18 and 19 for p t T , y t , p tt T , y tt , and m tt . Since the fiducial phase space definition for the normalized differential cross sections is also different for each value of √ s, the comparison is again only possible for top quark and tt quantities. The measurements are presented relative to the corresponding default MadGraph+pythia6 predictions at 7 and 8 TeV. A horizontal bin-centre correction with respect to the Mad-Graph+pythia6 predictions is applied to all data points from both channels and √ s values. The results are consistent between the channels for all quantities, both at 7 and 8 TeV. The uncertainties in almost all bins of the distributions are reduced for the 8 TeV results relative to 7 TeV, mainly due to the improvements discussed in Sect. 4.3. The softer p t T in data relative to MadGraph+pythia6 is also visible at 7 TeV.

Summary
First measurements are presented of normalized differential tt production cross sections in pp collisions at √ s = 8 TeV. The measurements are performed with the CMS detector in the +jets ( = e or μ) and dilepton (e + e − , μ + μ − , and e ± μ ∓ ) tt decay channels. The normalized tt cross section is measured as a function of the transverse momentum, rapidity, and invariant mass of the final-state leptons and b jets in the fiducial phase space, and the top quarks and tt system in the full phase space. The measurements in the different decay channels are in agreement with each other. In general, the data are in agreement with standard model predictions up to approximate NNLO precision. Among the examined predictions, powheg+herwig6 provides the best overall description of the data. However, the p T spectrum in data for leptons, jets, and top quarks is softer than expected, particularly for MadGraph+pythia6, powheg+pythia6, and mc@nlo+herwig6. The calculation at approximate NNLO precision also provides a good description of the top quark p T spectrum. The m tt distribution in data tends to be lower than the predictions for large m tt values. The p tt T spectrum is well described by all the considered predictions, except for the NLO+NNLL calculation, which fails to describe the data for all p tt T values. The results show the same behaviour as the corresponding CMS measurements at √ s = 7 TeV. and ttZ hadroproduction at NLO accuracy in QCD with parton shower and hadronization effects.