1 Introduction

Measurements of top-quark production cross section and properties have played a major role in testing the standard model (SM) and in searches for new physics beyond it. The large top-quark production rates at the Large Hadron Collider (LHC) give access to a new realm of precision measurements. For the first time, the \(\mathrm {t}\bar {\mathrm {t}}\) pair production rate is sufficiently high to perform a detailed and precise measurement of the \(\mathrm {t}\bar {\mathrm {t}}\) production cross section differentially as a function of various kinematic observables in \(\mathrm {t}\bar {\mathrm {t}}\) events [1]. These measurements are crucial to verify the top-quark production mechanism at the LHC energy scale in the context of SM predictions with various levels of perturbative quantum chromodynamics (QCD) approximations. Furthermore, scenarios beyond the SM, for example decays of massive Z-like bosons into top-quark pairs, could be revealed in such measurements, most prominently as resonances in the invariant \(\mathrm {t}\bar {\mathrm {t}}\) mass spectrum [24].

Here, measurements of the normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in proton–proton (pp) collisions at a centre-of-mass energy \(\sqrt{s}\) of 7 TeV with the Compact Muon Solenoid (CMS) detector are presented. These results complement the recent CMS measurements of the \(\mathrm {t}\bar {\mathrm {t}}\) production cross section [59]. The analysis makes use of the full set of data recorded in 2011, corresponding to an integrated luminosity of 5.0±0.1 fb−1. The cross section is determined as a function of the kinematic properties of the leptons and of the jets associated to b quarks or antiquarks (b jets) from top-quark decays, of the top quarks themselves, as well as of the \(\mathrm {t}\bar {\mathrm {t}}\) system. The results are compared to several theoretical predictions obtained with MadGraph [10], mc@nlo [11], powheg [1214], and to the latest next-to-leading-order (NLO) plus next-to-next-to-leading-logarithm (NNLL) [15] and approximate next-to-next-to-leading-order (NNLO) [16, 17] calculations.

The measurements are performed in several decay channels of the \(\mathrm {t}\bar {\mathrm {t}}\) system, both in the +jets channels (=e or μ), with a single isolated lepton and at least four jets in the final state, and in the dilepton channels, with two oppositely charged leptons (e+e, μ + μ , μ ±e) and at least two jets. The top-quark-pair candidate events are selected by requiring isolated leptons and jets with high transverse momenta. Backgrounds to \(\mathrm {t}\bar {\mathrm {t}}\) production are suppressed by use of b-tagging techniques. The top-quark kinematic properties are obtained through kinematic fitting and reconstruction algorithms. The normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section is determined by counting the number of \(\mathrm {t}\bar {\mathrm {t}}\) signal events in each bin of the measurement, correcting for the detector effects and dividing by the measured total cross section. Correlations between the bins of the measurement are taken into account by using regularised unfolding techniques.

The measurement performed here refers to kinematical distributions. To remove systematic uncertainties on the normalisation, the absolute differential cross section is normalised to the in-situ measured inclusive cross section. The inclusive cross section, as obtained in this analysis, is consistent with the results from dedicated CMS measurements [59]. To avoid additional model uncertainties due to the extrapolation of the measured cross section into experimentally inaccessible phase space regions, the results for directly measurable quantities, such as the kinematic properties of leptons and b jets, are reported in a visible phase space. This phase space is defined as the kinematic region in which all selected final state objects are produced within the detector acceptance and are thus measurable experimentally. For top-quark and \(\mathrm {t}\bar {\mathrm {t}}\) distributions, the measurements are performed in the full phase space, allowing for comparison with calculations up to the approximate NNLO precision.

This document is structured as follows. A brief description of the CMS detector is provided in Sect. 2, followed by details of the event simulation in Sect. 3, and the event selection and reconstruction in Sect. 4. The estimated systematic uncertainties on the measurements of the cross section are described in Sect. 5. The result of the differential cross section measurements are presented in Sect. 6, followed by a summary in Sect. 7.

2 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 13 m length and 6 m internal diameter, which provides an axial magnetic field of 3.8 T. Within the field volume are the silicon pixel and strip trackers, the crystal electromagnetic calorimeter (ECAL), and the brass/scintillator hadron calorimeter (HCAL). Charged particle trajectories are measured by the inner tracker, covering 0<ϕ<2π in azimuth and |η|<2.5, where the pseudorapidity η is defined as η=−ln[tanθ/2], and θ is the polar angle of the trajectory of the particle with respect to the anticlockwise-beam direction. The ECAL and the HCAL surround the tracking volume, providing high-resolution energy and direction measurements of electrons, photons, and hadronic jets. Muons are measured in gas-ionisation detectors embedded in the steel field return yoke. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors. The detector is nearly hermetic, allowing for energy balance measurements in the plane transverse to the beam directions. A two-tier trigger system selects the pp collision events for use in physics analysis. A more detailed description of the CMS detector can be found in Ref. [18].

3 Event simulation and theoretical calculations

Event generators, interfaced with detailed detector simulations, are used to model experimental effects, such as reconstruction and selection efficiencies as well as detector resolutions. For the simulation of the \(\mathrm {t}\bar {\mathrm {t}}\) signal sample, the MadGraph event generator (v. 5.1.1.0) is used, which implements the relevant matrix elements up to three additional partons. The value of the top-quark mass is fixed to m t=172.5 GeV and the proton structure is described by the parton density functions (PDF) CTEQ6L1 [19]. The generated events are subsequently processed with pythia (v. 6.424) [20] for parton showering and hadronisation, and the MLM prescription [21] is used for the matching of the jets with parton showers.

Standard-model background samples are simulated with MadGraph, powheg (r1380) [22], or pythia, depending on the process. For the +jets channels, W- and Z/γ -boson production with additional jets (referred to as W+jets and Z+jets, respectively, in the following), single-top-quark production (s-, t-, and tW-channel), diboson (WW, WZ, and ZZ), and QCD multijet events are considered as background processes and listed according to their importance. For the dilepton channels, the main background contributions (in decreasing order of importance) stem from Z+jets, single-top-quark, W+jets, diboson, and QCD multijet events. The W+jets and Z+jets samples, including the \(\mathrm{W}/\mathrm{Z}+\mathrm{c} \bar{\mathrm{c}}/\mathrm{b} \bar{\mathrm{b}}\) processes, are simulated with MadGraph with up to four partons in the final state. powheg is used for single-top-quark production, while pythia is used to simulate diboson and QCD multijet events. Parton showering and hadronisation are also simulated with pythia in all the background samples. The pythia Z2 tune [23] is used to characterise the underlying event in both the \(\mathrm {t}\bar {\mathrm {t}}\) signal and the background samples. The CMS detector response is simulated using Geant4 (v. 9.4) [24].

For comparison with the measured distributions, the events in the simulated samples are normalised to an integrated luminosity of 5.0 fb−1 according to their predicted cross sections. The latter are taken from NNLO (W+jets, Z+jets), NLO+NNLL (single-top-quark s- [25], t- [26], and tW- [27] channels), NLO (diboson [28]), and leading-order (LO) (QCD multijet [20]) calculations. Correction factors described in Sect. 5 are applied where necessary to improve the description of the data by the simulation. The \(\mathrm {t}\bar {\mathrm {t}}\) simulation sample is normalised to the data to present expected rates in figures in Sect. 4.

In addition to the MadGraph prediction, theoretical calculations obtained with the NLO generators powheg and mc@nlo (v. 3.41), and the latest NLO+NNLL [15] and approximate NNLO [16, 17] predictions are compared, when available, to the final results presented in Sect. 6. The proton structure is described by the PDF sets CTEQ6M [19] both for powheg and mc@nlo, while the NNLO MSTW2008 [29] PDF set is used for the NLO+NNLL and for the approximate NNLO calculations. The events generated with powheg and mc@nlo are further processed with pythia and herwig (v. 6.520) [30], respectively, for the subsequent parton showering and hadronisation. While powheg and mc@nlo are formally equivalent up to the NLO accuracy, they differ in the techniques used to avoid double counting of radiative corrections that may arise from interfacing with the parton showering generators. Furthermore, the parton showering in pythia is based on a transverse-momentum-ordered evolution scale, whereas in herwig it is angular-ordered.

4 Event reconstruction and selection

The event selection is based on the decay topology of the top quark, where each top quark decays into a W boson and a b quark. The +jets channels refer to events with only one leptonic W-boson decay, whereas in the dilepton channels each of the two W bosons decays leptonically (muon or electron). These signatures imply the identification of isolated leptons with high transverse momentum p T, large missing transverse momentum due to neutrinos from W-boson decays escaping the detector, and highly energetic jets. The heavy-quark content of the jets is identified through b-tagging techniques.

4.1 Lepton and jet reconstruction

Events are reconstructed using a particle-flow technique [31, 32], which combines signals from all sub-detectors to enhance the reconstruction performance by identifying individual particle candidates in pp collisions. Charged hadrons from pileup events, i.e. those originating from a vertex other than the one of the hard interaction, are subtracted event-by-event. Subsequently, the remaining neutral-hadron pileup component is subtracted at the level of jet energy corrections [33].

Electron candidates are reconstructed from a combination of the track momentum at the main interaction vertex, the corresponding energy deposition in the ECAL, and the energy sum of all bremsstrahlung photons attached to the track. They are required to have a transverse energy E T>30 GeV within the pseudorapidity interval |η|<2.1 for the +jets channels, while electrons in the dilepton channels have to fulfil E T>20 GeV and |η|<2.4. As an additional quality criterion, a relative isolation I rel is computed. It is defined by the sum of the transverse momenta of all neutral and charged reconstructed particle candidates inside a cone around the electron in ηϕ space of \(\Delta R\equiv\sqrt{(\Delta\eta)^{2} + (\Delta\phi)^{2}} < 0.4\) for the +jets channels and ΔR<0.3 for the dilepton channels, divided by the p T of the electron. The transverse momentum associated with the electron is excluded from the sum. A relative isolation of the electron I rel<0.125 is demanded for the +jets channels and I rel<0.17 for the dilepton channels. In addition, electrons from photon conversions, identified by missing hits in the silicon tracker, or being close to a second electron track, are rejected.

Muon candidates are reconstructed by matching the track information from the silicon tracker and the muon system. They are required to have p T>30 GeV and |η|<2.1 for the +jets channels, while in the dilepton channels the corresponding selections require p T>20 GeV and |η|<2.4. Isolated muon candidates are selected if they fulfil I rel<0.125 for the +jets channels and I rel<0.20 for the dilepton channels. To further increase the purity of muons originating from the primary interaction and to suppress misidentified muons or muons from decay-in-flight processes, additional quality criteria, such as a minimal number of hits associated with the muon track, are required in both the silicon tracker and the muon system.

Jets are reconstructed by clustering the particle-flow candidates [34] using the anti-k t clustering algorithm with a distance parameter of 0.5 [35]. Electrons and muons passing less stringent selections on lepton kinematic quantities and isolation compared to the ones mentioned above have been identified and are excluded from the clustering process. A jet is selected if it has p T>30 GeV and |η|<2.4 for both the +jets and dilepton channels. In addition, jets originating from b quarks are identified in each decay channel by a “combined secondary-vertex” (CSV) algorithm [36], which provides a b-tagging discriminant by combining secondary vertices and track-based lifetime information. The chosen working point in the +jets channels results in an efficiency for tagging a b jet of about 60 %, while the probability to misidentify light-flavour jets as b jets (mistag rate) is only about 1.5 %. In the dilepton channels, the working point is selected such that the b-tagging efficiency and mistag rate are about 80–85 % and around 10 %, respectively [36].

The missing transverse energy is defined as the magnitude of the transverse momentum imbalance , which is the negative of the vectorial sum of the transverse momenta of all the particles reconstructed with the particle-flow algorithm [37].

4.2 Event selection

The event selection in the +jets channels proceeds as follows. In the e+jets channel, events are triggered by an isolated electron and three or more jets fulfilling transverse momentum thresholds. The trigger efficiency within the acceptance of this analysis is above 96 %. Events in the μ+jets channel are triggered by the presence of an isolated muon fulfilling p T thresholds and geometrical acceptance requirements. In this channel, the trigger efficiency is above 87 %. For the final analysis, only triggered events that have exactly one isolated lepton (leading lepton) according to the lepton identification criteria described in Sect. 4.1 are retained. Events with additional muons with p T>10 GeV, |η|<2.5, and relative isolation I rel<0.2 are rejected. Furthermore, in the e+jets channel, events are rejected if additional electrons have E T>20 GeV, |η|<2.5, and I rel<0.2, and form a dielectron mass within 15 GeV of the mass of the Z boson. In the μ+jets channel, events are rejected if they contain electron candidates with E T>15 GeV, |η|<2.5, and I rel<0.2. These lepton vetoes are meant to suppress background events from Z-boson and diboson production. An event must contain at least four reconstructed jets satisfying the criteria mentioned in Sect. 4.1. At least two of them are required to be tagged as b jets in order to suppress the background contribution mainly from W+jets events. After this selection, the remaining backgrounds are dominantly single-top-quark and top-quark-pair events from other decay channels, i.e. events with missing transverse energy signature. Therefore, no requirement on missing transverse energy is imposed.

In the dilepton channels, at least two isolated leptons of opposite charge are required. These events are triggered using combinations of two leptons fulfilling transverse momentum thresholds and isolation requirements. The trigger efficiency is greater than 95 % in the μ + μ channel and greater than 97 % in the μ ±e and e+e channels. For the final analysis, only triggered events passing the lepton identification criteria described above are retained. In events with more than two leptons, only the lepton pair with the highest p T sum is considered. Events with an invariant mass of the lepton pair smaller than 12 GeV are removed in order to suppress events from heavy-flavour resonance decays. Dilepton events are required to have at least two jets. At least one of the jets is required to be identified as a b jet to reduce the background contribution. In addition, backgrounds from Z+jets processes in the μ + μ and e+e channels are further suppressed by requiring the dilepton invariant mass to be outside a Z-boson mass window of 91±15 GeV and to be larger than 30 GeV.

Basic distributions of the +jets and dilepton event samples are shown in Figs. 1 and 2, respectively, for different steps of the selection. The data are well described by the simulation. It has been verified that the result of the measurement is unaffected by the small remaining differences.

Fig. 1
figure 1

Basic kinematic distributions after event selection for the +jets channels. The top left plot shows the multiplicity of the reconstructed b-tagged jets. The multiplicity of the reconstructed jets (top right), the p T of the selected isolated leptons (bottom left), and the p T of the reconstructed jets (bottom right) are shown after the b-tagging requirement

Fig. 2
figure 2

Basic kinematic distributions after event selection for the dilepton channels. The top left plot shows the multiplicity of the reconstructed b-tagged jets. The multiplicity of the reconstructed jets (top right), the p T of the selected isolated leptons (bottom left), and the p T of the reconstructed jets (bottom right) are shown after the b-tagging requirement. The Z/γ +jets background is determined from data (cf. Sect. 4.2)

In the +jets channels, the main contributions to the background arise from W+jets and QCD multijet events, which are efficiently suppressed after the b-tagging requirement. After performing the full event selection, including the kinematic top-quark-pair reconstruction described in Sect. 4.3, 9 076 events are found in the e+jets channel and 10 766 events in the μ+jets channel. In both decay channels, the +jets signal contribution to the final event sample is about 80 %. The remaining fraction of events contains around 13 % \(\mathrm {t}\bar {\mathrm {t}}\) decays other than the +jets channels, including \(\mathrm {t}\bar {\mathrm {t}}\) decays into τ leptons originating from the primary interaction, about 4 % single-top-quark events, around 3 % W+jets events, and negligible fractions of Z+jets, diboson, and QCD multijet events. The background contributions are all estimated from simulation, normalised as described in Sect. 3 and subtracted from the data in each bin of the measurement.

In the dilepton channels, after performing the full event selection, including the kinematic top-quark-pair reconstruction (cf. Sect. 4.3), 2 632 events are found in the e+e channel, 3 014 in the μ + μ channel, and 7 498 in the μ ±e channel. Only \(\mathrm {t}\bar {\mathrm {t}}\) events with two leptons (electrons or muons) in the final state are considered as signal and constitute about 70–80 % of the final event sample, depending on the decay channel. All other \(\mathrm {t}\bar {\mathrm {t}}\) events, specifically those originating from decays via τ leptons, are considered as background and amount to 12–14 % of the final event sample. Dominant backgrounds to the e+e and μ + μ channels originate from Z+jets processes. Their contribution is estimated from data following the procedure described in Ref. [38]. The background normalisation is determined using the number of events inside the Z-peak region (removed from the candidate sample), and a correction needed for non-Z+jets backgrounds in this control region is derived from the μ ±e channel. The fraction of Z+jets events is found to be around 13 %. Other sources of background, including single-top-quark production and diboson events, are estimated from simulation and found to be about 6 %. The contribution arising from misidentified or genuine leptons within jets is estimated from data using like-sign events in a non-isolated region and found to be smaller than 1 %, consistent with the simulation. The background contributions are subtracted from the data in each bin of the measurement.

4.3 Kinematic top-quark-pair reconstruction

For both the +jets and dilepton channels, the kinematic properties of the top-quark pair are determined from the four-momenta of all final-state objects by means of kinematic reconstruction algorithms.

In the +jets channels, a constrained kinematic fitting algorithm is applied [39]. In the fit, the four-momenta of the selected lepton, up to five leading jets, and the representing the transverse momentum of the neutrino, are varied according to their resolutions. The longitudinal component of the neutrino is treated as a free parameter. Moreover, the fit is constrained to reconstruct two W bosons, each with a mass of 80.4 GeV, and top quark and antiquark with identical masses. In events with several combinatorial solutions, only the one with the minimum χ 2 of the fit is accepted.

In the dilepton channels, an alternative kinematic reconstruction method is used [40]. In these channels, due to the presence of two neutrinos, the kinematic reconstruction is underconstrained, even after imposing a transverse-momentum balance of the two neutrinos, a W-boson invariant mass of 80.4 GeV, and equality of the top-quark and antiquark masses. The top-quark mass can be reconstructed in a broad mass range due to detector resolution effects. To account for this, the top-quark mass for each lepton-jet combination is assumed between 100 GeV and 300 GeV in steps of 1 GeV. In the case that an event produces more than one physical solution, those using two b-tagged jets are preferred to the ones using one b-tagged jet, and solutions using one b-tagged jets are preferred to those using no b-tagged jets. After this selection, if an event has more than one solution with the preferred b-tagging, these are ranked according to how the neutrino energies match with a simulated neutrino energy spectrum, and the highest ranked one is chosen.

For both decay channels, the kinematic reconstruction yields no physical solution for about 11 % of the events. These events are excluded from further analysis. The simulation provides a good description of the data before and after this requirement.

Distributions of the top-quark or antiquark and \(\mathrm {t}\bar {\mathrm {t}}\) kinematic observables (\(p_{\mathrm {T}} ^{\text{t}}\), y t, \(p_{\mathrm {T}} ^{ \mathrm {t}\bar {\mathrm {t}} }\), and \(y^{ \mathrm {t}\bar {\mathrm {t}} }\), where y is the rapidity defined as y=1/2⋅ln[(E+p z )/(Ep z )], with E and p z denoting the particle energy and the momentum along the anticlockwise-beam axis, respectively) as obtained from the kinematic reconstruction, are presented in Fig. 3 for the +jets event sample and in Fig. 4 for the dilepton event sample. In general, the data are well described by the simulation within uncertainties. As in Figs. 1 and 2, the final results are not affected by the small remaining differences in normalisation between data and simulation. For both channels, the measured p T distributions show a trend of being shifted to lower transverse momenta compared to the simulated distributions.

Fig. 3
figure 3

Distribution of top-quark and \(\mathrm {t}\bar {\mathrm {t}}\) quantities as obtained from the kinematic reconstruction in the +jets channels. The left plots show the distributions for the top quarks or antiquarks; the right plots show the \(\mathrm {t}\bar {\mathrm {t}}\) system. The top row shows the transverse momenta, and the bottom row shows the rapidities

Fig. 4
figure 4

Distribution of top-quark and \(\mathrm {t}\bar {\mathrm {t}}\) quantities as obtained from the kinematic reconstruction in the dilepton channels. The left plots show the distributions for the top quarks or antiquarks; the right plots show the \(\mathrm {t}\bar {\mathrm {t}}\) system. The top row shows the transverse momenta, and the bottom row shows the rapidities. The Z/γ +jets background is determined from data (cf. Sect. 4.2)

5 Systematic uncertainties

Systematic uncertainties on the measurement arise from detector effects as well as from theoretical uncertainties. Each systematic uncertainty is investigated separately, and determined individually in each bin of the measurement, by variation of the corresponding efficiency, resolution, or scale within its uncertainty. Correction factors, subsequently referred to as scale factors, are applied where necessary to improve the description of the data by the simulation. For each variation, the measured normalised differential cross section is recalculated, and the difference of the varied result to the nominal result in each bin is taken as the systematic uncertainty. The overall uncertainty on the measurement is then derived by adding the individual contributions in quadrature. The dominant uncertainties on the normalised differential cross section originate from the lepton selection, the b tagging, and from model uncertainties. A summary of the typical systematic uncertainties of the normalised differential cross section, obtained by averaging over all quantities and bins, is given in Table 1 and a detailed description is given in Sects. 5.1 and 5.2.

Table 1 Breakdown of typical systematic uncertainties for the normalised differential cross section in the +jets and dilepton channels. The background uncertainty for the +jets channels includes normalisation uncertainties as well as uncertainties due to variations of the kinematic scales in W/Z-boson events

5.1 Experimental uncertainties

The efficiency of the single-muon trigger in μ+jets events is determined using the “tag-and-probe” method [41] with Z-boson event samples. A dependence on the pseudorapidity of the muon of a few percent is observed and scale factors are derived. In order to determine the efficiency of the electron–trijet trigger in e+jets events, the tag-and-probe method is also applied to the electron branch, while independent control triggers are used for the hadronic part. Good agreement is observed between data and simulation, and scale factors very close to unity are applied. The lepton identification and isolation efficiencies for the +jets channels obtained with the tag-and-probe method agree well between data and simulation, so that corrections very close or equal to unity are applied. The systematic uncertainties are determined by shape-dependent variations of trigger and selection efficiencies within their uncertainties. Lepton trigger efficiencies in the dilepton channels are measured using triggers that are only weakly correlated to the dilepton triggers. The lepton identification and isolation uncertainties in the dilepton channels are also determined using the tag-and-probe method, and are found to be described very well by the simulation for both electrons and muons. The overall difference between data and simulation in bins of pseudorapidity and transverse momentum is estimated to be less than 2 % for electrons, while scale factors for muons are found to be close to unity.

To estimate the uncertainty on the jet energy scale, the reconstructed jet energy is varied as a function of the transverse momentum and the pseudorapidity of the jet (typically by a few percent) [34]. The uncertainty on the jet energy resolution (JER) is determined by variation of the simulated JER up and down by about ±6 %, ±9 %, and ±20 %, for the pseudorapidity regions |η|<1.7, 1.7<|η|<2.3, and |η|>2.3, respectively [34].

The uncertainty due to background normalisation is determined by variation of the background yields. For the +jets channels, the background normalisation is varied by ±30 % for the single-top-quark and diboson samples, and by ±50 % for the QCD samples [5, 6]. For the W/Z-boson samples, this uncertainty is covered by variations of the kinematic scales of the event process (renormalisation and factorisation scales and jet–parton matching), as described in Sect. 5.2. In the e+e and μ + μ channels, the dominant background from Z+jets processes as determined from data (cf. Sect. 4) is varied in normalisation by ±30 %. In addition, variations of the background contributions from single-top-quark and diboson events up and down by ±30 % are performed [9, 42].

The uncertainty on the b-tagging efficiency is determined by dividing the b-jet distributions for transverse momentum and pseudorapidity into two bins at the median of the respective distributions. These are p T=65 GeV and |η|=0.7 for the +jets and p T=65 GeV and |η|=0.75 for the dilepton channels. The b-tagging scale factors for the b jets in the first bin are scaled up by half of the uncertainties quoted in Ref. [36], while those in the second bin are scaled down and vice versa, so that a maximum variation is assumed and the difference between the scale factors in the two bins amounts to the full uncertainty. The variations are performed separately for the transverse-momentum and pseudorapidity distributions.

The kinematic reconstruction of the top quarks is generally found to be very well described by the simulation, and the resulting uncertainties are small. In the case of the +jets analysis, the uncertainty of the kinematic fit is included in the variations of jet energy scales and resolutions. In the dilepton analysis, the bin-to-bin uncertainty is determined from the small remaining difference between the simulation and the data.

The pileup model estimates the mean number of additional pp interactions to be about 9.5 events for the analysed data. This estimate is based on the total inelastic proton–proton cross section, which is determined to be 73.5 mb [43]. The systematic uncertainty is determined by varying this cross section within its uncertainty of ±8 %.

5.2 Model uncertainties

The impact of theoretical assumptions on the measurement is determined by repeating the analysis, replacing the standard MadGraph signal simulation by dedicated simulation samples, as described below.

The uncertainty on the modeling of the hard-production process is assessed by varying the renormalisation and factorisation scale in the MadGraph signal samples up and down by a factor of two with respect to its nominal value, equal to the Q of the hard process (\(Q^{2} = m^{2}_{\text{t}} + \varSigma p^{2}_{\mathrm{T}}\)). Furthermore, the effect of additional jet production in MadGraph is studied by varying the threshold between jet production at the matrix-element level and via parton showering up and down by a factor of two with respect to the nominal value of 20 GeV. In the +jets channels, variations of the renormalisation and factorisation scale are also applied to single-top-quark events to determine a shape uncertainty for this background contribution. Additionally, both kinematic scales are varied for W- and Z-boson background events to associate a shape and background normalisation uncertainty to these samples. Each type of variation is applied simultaneously for the W- and Z-boson samples.

The uncertainty due to the hadronisation model is determined by comparing samples simulated with powheg and mc@nlo using pythia and herwig, respectively, for hadronisation. The dependence of the measurement on the top-quark mass is estimated from dedicated MadGraph simulation samples in which the top-quark mass is varied with respect to the value used for the default simulation. The resulting variations are scaled linearly according to the present world average uncertainty of 0.9 GeV. The effect of the uncertainty from parton density functions on the measurement is assessed by reweighting the sample of simulated \(\mathrm {t}\bar {\mathrm {t}}\) signal events. For this reweighting, the minimum and maximum variations with respect to the nominal value is obtained by following the PDF4LHC prescription [44] using the NLO PDF sets CT10 [45], MSTW2008NLO, and NNPDF2.1 [46].

6 Normalised differential cross section

The normalised cross section in each bin i of each observable X is determined through the relation:

$$ \frac{1}{\sigma}\frac{ \mathrm {d} \sigma_{i}}{ \mathrm {d} X}=\frac{1}{\sigma}\frac{x_{i}}{\Delta^{\text{X}}_{i} \mathcal{L}} $$
(1)

In each bin of the measurement, x i represents the number of signal events in data determined after background subtraction and corrected for detector efficiencies, acceptances, and migrations, as described below. The normalised differential cross section is then derived by scaling to the integrated luminosity \(\mathcal{L}\) and by dividing the corrected number of events by the width \(\Delta_{i}^{\text{X}}\) of the bin and by the measured total cross section σ in the same phase space. Due to the normalisation, those systematic uncertainties that are correlated across all bins of the measurement, and therefore only affect the normalisation, cancel out.

Effects from trigger and detector efficiencies and resolutions, leading to the migration of events across bin boundaries and statistical correlations among neighbouring bins, are corrected by using a regularised unfolding method [47, 48]. For each measured distribution, a response matrix that accounts for migrations and efficiencies is calculated from the simulated MadGraph \(\mathrm {t}\bar {\mathrm {t}}\) signal sample. The generalised inverse of the response matrix is used to obtain the unfolded distribution from the measured distribution by applying a χ 2 technique. To avoid non-physical fluctuations, a smoothing prescription (regularisation) is applied. The regularisation level is determined individually for each distribution using the averaged global correlation method [49]. To keep the bin-to-bin migrations small, the width of the bins of the measurement are chosen according to their purity and stability. For a certain bin i, the number of particles generated and correctly reconstructed \(N^{\text{gen\&rec}}_{i}\) is determined. The purity p i is then this number divided by the total number of reconstructed particles in the same bin \(N^{\text{rec}}_{i}\): \(p_{i}=N^{\text{gen\&rec}}_{i}/N^{\text{rec}}_{i}\). Similarly, the stability s i is defined as that number scaled to the total number of generated particles in the particular bin \(N^{\text{gen}}_{i}\), yielding \(s_{i}=N^{\text{gen\&rec}}_{i}/N^{\text{gen}}_{i}\). In this analysis, the purity and stability of the bins are typically 50 % or larger. The performance of the unfolding procedure is tested for a possible bias due to the choice of the input model (the \(\mathrm {t}\bar {\mathrm {t}}\) MadGraph signal simulation). It has been verified that, by either reweighting the signal simulation or injecting a resonant \(\mathrm {t}\bar {\mathrm {t}}\) signal into the signal simulation, the unfolding procedure still reproduces the results correctly when using the default MadGraph \(\mathrm {t}\bar {\mathrm {t}}\) signal simulation to account for migrations and efficiencies.

The analysis proceeds by measuring the normalised differential cross section in the +jets channels and dilepton channels. For each kinematic distribution, the event yields in the separate channels are added up, the background is subtracted, and the unfolding is performed. As a cross-check, it has been verified that the measurements in the individual channels are in agreement with each other within the uncertainties.

The systematic uncertainties in each bin are assessed from the variations of the combined cross sections. This means that the full analysis is repeated for every systematic variation and the difference with respect to the nominal combined value is taken as the systematic uncertainty for each bin and each measured observable. By using this method, the possible correlations of the systematic uncertainties between the different channels and bins are taken into account.

The normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) cross section 1/σ⋅dσ/dX is determined as a function of the kinematic properties of the leading leptons, the lepton pair, the b jets, the top quarks, and the top-quark pair, and presented in the following sections. In order to avoid additional model uncertainties due to the extrapolation of the measurement outside experimentally well-described phase space regions, the normalised differential cross sections for the measured leptons and b jets are determined in a visible phase space defined by the kinematic and geometrical acceptance of the final state leptons and jets. In contrast, the top-quark and the top-quark-pair quantities are presented in the full phase space in order to allow for comparisons with recent QCD calculations up to approximate NNLO precision. To facilitate comparison with theory curves independently of the binning, a horizontal bin-centre correction is applied. In each bin, the measured data points are presented at the horizontal position in the bin where the predicted bin-averaged cross section equals the differential cross section according to the MadGraph calculation (cf. [50]). The measurement is compared to the predictions from MadGraph, powheg, and mc@nlo. For the latter, uncertainty bands corresponding to the PDF (following the PDF4LHC prescription [44]), the top-quark mass, and renormalisation and factorisation scale variations are also given. The top-quark and \(\mathrm {t}\bar {\mathrm {t}}\) results are also compared to the latest approximate NNLO [16, 17] and NLO+NNLL [15] predictions, respectively. All measured normalised differential cross section values, including bin boundaries and centres, are available in tabular form in the Supplemental Material.

6.1 Lepton and b-jet differential cross sections

For the +jets channels, the normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) cross section as a function of the lepton and b-jet kinematic properties is defined at the particle level for the visible phase space where the lepton from the W-boson decay has a pseudorapidity |η |<2.1 and a transverse momentum \(p_{\mathrm {T}} ^{\ell} > 30 ~\text {GeV} \), and at least four jets with |η|<2.4 and p T>30 GeV, out of which two are b jets. A jet is defined at the particle level as a b jet if it contains the decay products of a B hadron. For this analysis, the two highest transverse momentum b jets originating from different B hadrons are selected.

In Fig. 5, the normalised differential cross section is presented as a function of the lepton transverse momentum \(p_{\mathrm {T}} ^{\ell}\) and pseudorapidity η . In Fig. 6, the distributions for the transverse momentum of the b jets, \(p_{\mathrm {T}} ^{\text{b}}\), and their pseudorapidity, η b, are shown. Also shown are predictions from MadGraph, powheg, and mc@nlo. Good agreement is observed between the data and the theoretical predictions within experimental uncertainties.

Fig. 5
figure 5

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the +jets channels as a function of the \(p_{\mathrm {T}} ^{\ell}\) (top) and η (bottom) of the lepton. The superscript ‘’ refers to both + and . The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo. The MadGraph prediction is shown both as a curve and as a binned histogram

Fig. 6
figure 6

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the +jets channels as a function of the \(p_{\mathrm {T}} ^{\text{b}}\) (top) and η b (bottom) of the b jets. The superscript ‘b’ refers to both b and \(\mathrm {b} \bar {\mathrm {b}} \) jets. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo. The MadGraph prediction is shown both as a curve and as a binned histogram

For the dilepton channels, the normalised \(\mathrm {t}\bar {\mathrm {t}}\) differential cross section as a function of the lepton and b jet kinematic properties is defined at the particle level for the visible phase space where the leptons have |η |<2.4 and \(p_{\mathrm {T}} ^{\ell} > 20 ~\text {GeV} \), and the b jets from the top-quark decays both lie within the range |η|<2.4 and p T>30 GeV. The b jet at the particle level is defined as described above for the +jets analysis.

In Fig. 7, the normalised differential cross section for the following lepton and lepton-pair observables are presented: the transverse momentum of the leptons \(p_{\mathrm {T}} ^{\ell}\), the pseudorapidity η of the leptons, the transverse momentum of the lepton pair \(p_{\mathrm {T}} ^{\ell^{+}\ell^{-}}\), and the invariant mass of the lepton pair \(m^{\ell^{+}\ell^{-}}\). The distributions for the transverse momentum of the b jets, \(p_{\mathrm {T}} ^{\text{b}}\), and their pseudorapidity, η b, are shown in Fig. 8. Predictions from MadGraph, powheg, and mc@nlo are also shown. Good agreement is observed between data and theoretical predictions within experimental uncertainties. The mc@nlo and powheg predictions, which take into account \(\mathrm {t}\bar {\mathrm {t}}\) spin correlations, suggest a better description of the lepton-pair observables in those bins in which there exists some discrepancy between the different generators.

Fig. 7
figure 7

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the dilepton channels as a function of the \(p_{\mathrm {T}} ^{\ell}\) (top left) and η (top right) of the leptons, and the \(p_{\mathrm {T}} ^{\ell^{+}\ell^{-}}\) (bottom left), and \(m^{\ell^{+}\ell^{-}}\) (bottom right) of the lepton pair. The superscript ‘’ refers to both + and . The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo. The MadGraph prediction is shown both as a curve and as a binned histogram

Fig. 8
figure 8

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the dilepton channels as a function of the \(p_{\mathrm {T}} ^{\text{b}}\) (top) and η b (bottom) of the b jets. The superscript ‘b’ refers to both b and \(\mathrm {b} \bar {\mathrm {b}} \) jets. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo. The MadGraph prediction is shown both as a curve and as a binned histogram

6.2 Top-quark and \(\mathrm {t}\bar {\mathrm {t}}\) differential cross sections

The normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) cross section as a function of the kinematic properties of the top quarks and the top-quark pair is presented at parton level and extrapolated to the full phase space using the MadGraph prediction for both the +jets and the dilepton channels.

In Figs. 9 and 10, the distributions for the top-quark and the top-quark-pair observables in the +jets channels and the dilepton channels are presented. Those are the transverse momentum \(p_{\mathrm {T}} ^{\text{t}}\) and the rapidity y t of the top quarks and antiquarks, and the transverse momentum \(p_{\mathrm {T}} ^{ \mathrm {t}\bar {\mathrm {t}} }\), the rapidity \(y^{ \mathrm {t}\bar {\mathrm {t}} }\), and the invariant mass \(m^{ \mathrm {t}\bar {\mathrm {t}} }\) of the top-quark pair. Also shown are predictions from MadGraph, powheg, and mc@nlo. In addition, the top-quark results are compared to the approximate NNLO calculations from Refs. [16, 17], while the \(m^{ \mathrm {t}\bar {\mathrm {t}} }\) distribution is compared to the NLO+NNLL prediction in Ref. [15].

Fig. 9
figure 9

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the +jets channels as a function of the \(p_{\mathrm {T}} ^{\text{t}}\) (top left) and y t (top right) of the top quarks, and the \(p_{\mathrm {T}} ^{ \mathrm {t}\bar {\mathrm {t}} }\) (middle left), \(y^{ \mathrm {t}\bar {\mathrm {t}} }\) (middle right), and \(m^{ \mathrm {t}\bar {\mathrm {t}} }\) (bottom) of the top-quark pairs. The superscript ‘t’ refers to both top quarks and antiquarks. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo, and to NLO+NNLL [15] and approximate NNLO [16, 17] calculations, when available. The MadGraph prediction is shown both as a curve and as a binned histogram

Fig. 10
figure 10

Normalised differential \(\mathrm {t}\bar {\mathrm {t}}\) production cross section in the dilepton channels as a function of the \(p_{\mathrm {T}} ^{\text{t}}\) (top left) and y t (top right) of the top quarks, and the \(p_{\mathrm {T}} ^{ \mathrm {t}\bar {\mathrm {t}} }\) (middle left), \(y^{ \mathrm {t}\bar {\mathrm {t}} }\) (middle right), and \(m^{ \mathrm {t}\bar {\mathrm {t}} }\) (bottom) of the top-quark pairs. The superscript ‘t’ refers to both top quarks and antiquarks. The inner (outer) error bars indicate the statistical (combined statistical and systematic) uncertainty. The measurements are compared to predictions from MadGraph, powheg, and mc@nlo, and to NLO+NNLL [15] and approximate NNLO [16, 17] calculations, when available. The MadGraph prediction is shown both as a curve and as a binned histogram

For both +jets and dilepton channels, good agreement is observed between data and theoretical predictions within experimental uncertainties. Among the various predictions, the approximate NNLO calculation provides a better description of the data, as it predicts a slightly softer top-quark transverse momentum spectrum than the other three predictions.

7 Summary

First measurements of normalised differential top-quark-pair production cross sections in pp collisions at \(\sqrt{s}=7 ~\mathrm {TeV} \) with the CMS detector are presented. The measurements are performed in the +jets (e+jets and μ+jets) and the dilepton (e+e, μ + μ , and μ ±e) \(\mathrm {t}\bar {\mathrm {t}}\) decay channels. The normalised \(\mathrm {t}\bar {\mathrm {t}}\) cross section is measured as a function of the transverse momentum, (pseudo)rapidity, and invariant mass of the final-state leptons and b jets in the visible phase space, and of the top quarks and \(\mathrm {t}\bar {\mathrm {t}}\) system in the full phase space. The measurements among the different decay channels are in agreement with each other and with standard model predictions up to approximate next-to-next-to-leading-order precision. The prediction at approximate NNLO precision is found to give a particularly good description of the top-quark transverse momentum.