Measurement of the top-quark mass in $t\bar{t}+1$-jet events collected with the ATLAS detector in $pp$ collisions at $\sqrt{s}=8$ TeV

A determination of the top-quark mass is presented using 20.2 $\text{fb}^{-1}$ of 8 TeV proton-proton collision data produced by the Large Hadron Collider and collected by the ATLAS experiment. The normalised differential cross section of top-quark pair production in association with an energetic jet is measured in the lepton+jets final state and unfolded to parton and particle levels. The unfolded distribution at parton level can be described using next-to-leading-order QCD predictions in terms of either the top-quark pole mass or the running mass as defined in the (modified) minimal subtraction scheme. A comparison between the experimental distribution and the theoretical prediction allows the top-quark mass to be extracted in the two schemes. The value obtained for the pole-mass scheme is: \[ m_t^{\text{pole}} = 171.1 \pm 0.4 (\text{stat}) \pm 0.9 (\text{syst}) \substack{+0.7\\ -0.3} (\text{theo}) \text{ GeV}. \] The extracted value in the running-mass scheme is: \[ m_t(m_t) = 162.9 \pm 0.5 (\text{stat}) \pm 1.0 (\text{syst}) \substack{+2.1\\ -1.2} (\text{theo}) \text{ GeV}. \] The results for the top-quark mass using the two schemes are consistent, when translated from one scheme to the other.


Introduction
The mass of the top quark, the heaviest known elementary particle, is a key parameter of the Standard Model (SM) of particle physics and must be determined experimentally. In the SM, the gauge structure of the interaction of the top quark with other particles establishes a relation between the top-quark, Higgs-boson and W-boson masses. A precise determination of these three parameters forms an important check of the internal consistency of the SM [1][2][3][4][5]. Precise measurements of the top-quark mass are also required in order to accurately predict the evolution of the Higgs quartic coupling at high scales, which affects the shape of the Higgs potential and is associated in the SM with the stability of the quantum vacuum [6,7]. In this article the top-quark mass is inferred from the shape of a differential cross-section distribution.
Any quantitative statement about the value of a quark mass requires a precise reference to the mass scheme in which the mass is defined. The mass scheme which is used most often in top-quark mass measurements is the pole-mass scheme [8][9][10][11][12][13][14], where the renormalised top-quark mass (the pole mass, m pole t ) coincides with the pole of the top-quark propagator. Several groups have extracted the running1 top-quark mass in the modified minimal subtraction scheme (MS) from the total top-quark pair (tt) production cross section [15,16] or the differential cross section [17]. The two mass schemes can be related precisely, with up to four-loop accuracy [18].
Direct top-quark mass measurements at hadron colliders, based on the reconstruction of the top-quark decay products and using Monte Carlo (MC) event generators in the fit to extract the mass, are frequently interpreted as the pole mass. Recent works estimate that such an interpretation is affected by a 0.  uncertainty due to non-perturbative effects from below the MC lower scale at which perturbative quark and gluon radiation is terminated in the parton shower. With direct top-quark mass measurements reaching sub-percent precision [20] it becomes important to evaluate uncertainties associated with the interpretation of the measured mass at the same level of accuracy.
It is therefore of paramount importance to extract the top-quark mass by comparing data with predictions computed in a well-defined mass scheme. In this case the ambiguity related to the top-quark mass interpretation is avoided, allowing a precise evaluation of the uncertainty associated with the mass scheme chosen. In such measurements the Monte Carlo event generator is only used to correct distributions obtained from measured data for effects originating from the detector and the modelling of non-perturbative physics. The uncertainty associated to such effects can be estimated comparing Monte Carlo simulations produced with different sets of parameters, without specific assumptions on the top-quark mass interpretation. The theory uncertainty can then be estimated using the conventional techniques (scale variations and error sets of the parton distribution functions). Such mass measurements also yield greater flexibility in choosing the mass scheme [8,9].
In this article, results are presented in both the pole-mass and MS schemes. The measurement reported in this study follows the approach developed in Refs. [17,25,26], which takes advantage of the sensitivity to the top-quark mass of the differential cross section of tt production in association with at least one energetic jet. The presence of the additional jet enhances the sensitivity to the top-quark mass in comparison with similar observables defined for the tt system only [25]. In particular, the observable used to extract the top-quark mass, R, is defined as the normalised differential tt + 1-jet cross section: with m 0 representing a constant fixed to 170 GeV and m tt+1-jet being the invariant mass of the tt + 1-jet system.2 The normalised differential cross sections are presented at the so-called particle level in which data are only unfolded for detector effects and at the parton level where R can be directly compared with available fixed-order calculations [17,25]. The particle-level distribution is provided to allow comparisons with possible future calculations. A measurement of the top-quark pole mass [11] with this method using 4.6 fb −1 of 7 TeV pp collisions collected by ATLAS yielded an uncertainty of 2.3 GeV (1.3%) in the top-quark pole mass. In the current analysis the top-quark mass is determined using a sample of 8 TeV pp collisions collected in 2012. The large statistics of the 8 TeV dataset make it possible to achieve a high precision in the measurement of the R distribution, in particular in the region where it is most sensitive to the top-quark mass, ultimately allowing the top-quark mass to be extracted with high accuracy.
Several tt event samples with different choices of the value of the Monte Carlo top-quark mass, but otherwise the same settings as the nominal sample, are used to validate the analysis. Alternative samples are used to evaluate uncertainties in modelling the tt signal. These include samples produced with MC@NLO 4.01 [51] interfaced with H 6.520 [52] and J 4.31 [53] and samples generated with P + H , with the ATLAS AUET2 tune [54] and J [53] for multiple parton interactions. Some P samples were generated with h damp = ∞ and reweighted to h damp = m t , following the strategy presented in Ref. [13]. Two samples with variations of the renormalisation and factorisation scales, the value of the h damp factor and the choice of parton-shower tune are used to estimate the uncertainty in modelling of initial-and final-state radiation [55].
Electroweak single-top-quark production was simulated with P matched with P 6.425, with the CTEQ6L1 PDF set and the Perugia 2011C [50] tune. The cross sections are normalised to NNLO+NNLL calculations for t-channel [56], Wt [57], and s-channel production [58].
Leptonic decays of vector bosons produced in association with several high-p T jets, referred to as W+jets and Z+jets events, with up to five additional final-state partons in the leading-order (LO) matrix elements, were produced with the A generator [59] interfaced with H for parton fragmentation using the MLM matching scheme [60]. Samples corresponding to the production of a W boson in association with heavy-flavour quarks (b-and c-quarks) were generated separately, at leading order and including effects from the value of the mass of the heavy quarks. Overlap between heavy-flavour quarks that originate from matrix-element production and those that originate from the parton shower was removed. The W+jets samples are normalised to the inclusive W-boson NNLO cross section [61,62].
Diboson events were generated with H with the CTEQ6L1 PDF. The multijet background is estimated using a data-driven matrix method described in Ref. [63].
At the LHC, multiple, simultaneous pp interactions occur in each bunch crossing. The average number of additional pp interactions was 21 during the 2012 run. These pile-up collisions were simulated using P 8. 1 [64] with the MSTW2008 leading-order PDF set [44] and the A2M tune [65]. The number of simulated pile-up events superimposed on each hard-scatter event was reweighted to match the distribution of the number of interactions per bunch crossing in data.
The response of the detector and trigger was simulated [66] using a detailed model implemented in GEANT4 [67]. For some samples used to evaluate systematic uncertainties, the detailed description of the calorimeter response was parameterised using the ATLFAST-II simulation [66]. For all the non-tt samples the top-quark mass was set to m t = 172.5 GeV. Simulated events are reconstructed with the same software as the data.

Lepton and jet reconstruction
Electron candidates are reconstructed from clusters of energy deposits in the electromagnetic calorimeter, matched with a reconstructed inner-detector track [68]. Electrons are required to fulfill the tight identification requirement of Ref. [68].The calorimeter cluster is required to have transverse energy E T > 25 GeV and pseudorapidity |η| < 2.47. Clusters in the transition region between the barrel and endcaps with 1.37 < |η| < 1.52 are excluded. Non-prompt electrons are suppressed by cuts on the sum of transverse energy deposited in a cone of size ∆R = 0.2 around the calorimeter cells associated with the electron and on the sum of track p T in a cone of size ∆R = 0.3. The longitudinal impact parameter (z 0 ) of the electron track relative to the selected event primary vertex4 is required to be smaller than 2 mm [69].
Muon candidate reconstruction is based on track segments in the muon spectrometer combined with inner-detector tracks [70]. The combined track must satisfy p T > 25 GeV and |η| < 2.5. Muon candidates have to be separated from any jet by ∆R > 0.4 and the sum of the transverse momenta of tracks within a cone of size ∆R = 10 GeV/p µ T around the muon candidate is required to be less than 5% of the muon transverse momentum, p µ T . The muon longitudinal impact parameter (z 0 ) relative to the primary vertex is required to be smaller than 2 mm. Jet reconstruction starts from topological clusters [71] of energy deposits in the calorimeters. A local calibration scheme [72] corrects for the non-compensating response of the calorimeter, dead material and out-of-cluster leakage. Jets are reconstructed from these topological clusters using the anti-k t algorithm [73, 74] with a radius parameter of R = 0.4. Jets are calibrated to the level of stable-particle jets using Monte Carlo simulation and the response is verified in situ [75]. Jet reconstruction is implemented in the F J package [76]. Jets are accepted if p T > 25 GeV and |η| < 2.5 after the calibration. To reduce the contribution from pile-up, jets with p T < 50 GeV and |η| < 2.4 must have a jet-vertex-fraction (p T -weighted fraction of tracks associated with the jet that point to the primary vertex) greater than 0.5 [77]. The closest jet within ∆R = 0.2 of selected electrons is discarded to avoid double-counting of the electron candidate as a jet.
Jets with b-hadrons (b-jets) are tagged with the MV1 algorithm, based on multivariate techniques exploiting impact parameter and secondary vertex information [78]. The efficiency to tag b-jets in tt events is 70%, with a light-parton jet rejection factor of 130 and a c-jet rejection factor of 5. The simulated b-tagging efficiency is corrected to match the efficiency measured in data.
The missing transverse momentum (and its magnitude E miss T ) is reconstructed from the vector sum of the transverse momenta of the reconstructed calibrated leptons, jets and the transverse energy deposited in the calorimeter cells not associated with these objects [79].

Event selection and reconstruction
Events are selected (preselection) if they pass several quality cuts and requirements to select final states with one reconstructed electron or muon and five or more jets [80,81]. A reconstructed primary vertex with at least five associated tracks is required. Exactly one high-quality, isolated lepton with p T > 25 GeV must be present. It must match the lepton that triggered the event within ∆R < 0.15. At least five jets are required, exactly two of which must be b-jets. The magnitude of the missing transverse momentum E miss T and the W-boson transverse mass5 must both be greater than 30 GeV. After these requirements the data sample contains 12419 events in the electron channel and 15495 events in the muon channel. Of these events ∼ 93% are expected to be tt events. 4 A primary vertex candidate is defined as a vertex with at least two associated tracks, consistent with the beam collision region.
The vertex candidate with the largest sum of squared transverse momenta of its associated tracks is taken as the primary vertex. 5 The transverse mass of the W boson is determined as m W  Table 1: Summary of the event yield after the final selection. The observed event yield is compared with the prediction of the Monte Carlo simulation for top-quark pair production and the most important SM background processes. The estimate of the uncertainty in the normalisation of the expected signal and backgrounds yields includes the theoretical uncertainty in the cross section, as well as experimental systematic uncertainties as discussed in Section 8. The contribution from diboson production is negligible.
The reconstruction of the tt + 1-jet system follows that of Ref. [11]. Candidates for the hadronically decaying W boson are formed by pairing all jets not tagged as b-jets and selecting pairs i, j that satisfy: where p i T is the transverse momentum of the jet i, m i j is the invariant mass of the jet pair, ∆R i j their angular distance and m W is the value of the W-boson mass reported by the Particle Data Group [5]. The application of these two requirements reduces the multijet and combinatorial backgrounds.
The neutrino momentum is reconstructed, up to a twofold ambiguity, by identifying the E miss T with its transverse momentum and using the W-boson mass constraint to infer its longitudinal momentum [11]. Only events where at least one neutrino candidate exists are considered. If there are two solutions, each of the neutrino candidates is added to the charged lepton, leading to two W-boson candidates.
Pairs of hadronic and semileptonic top-quark candidates are formed by combining all the hadronic and leptonic W-boson candidates with the two b-tagged jets. Among all possible combinations the one selected is that which minimises the absolute difference between the masses of the reconstructed hadronic top (m t had ) and the semileptonic top (m t lept ) candidates, divided by their sum: The tt candidates must satisfy m t lept /m t had > 0.9.
The four-momenta of the jets which are identified with the hadronic decay of the W boson are corrected by the factor m W m i j . Among the jets not used in either top-quark candidate, the leading jet in p T is taken as the jet produced in association with the top quarks, before their decay. Only events where this extra jet has a transverse momentum larger than 50 GeV are considered. Due to this requirement the selected tt + 1-jet events are reconstructed with a topology similar to the one used in the theoretical NLO calculation, where a similar p T cut is applied [25].
In Table 1 the event yield after the final selection cuts is presented. The contribution from diboson production is negligible and is hence not reported. The efficiency of the signal selection, relative to the events that passed the preselection cuts, is ∼ 51%. The purity of the sample is 94.3% for the electron channel and 95.2% for the muon channel. The yield predicted by the Monte Carlo simulation is lower than the observed yield in both channels, but is compatible within the MC normalisation uncertainty. Statistical uncertainties in the observed event counts are indicated with error bars. The band estimates the uncertainty on the expected yields. It includes the uncertainty on the luminosity, effects from the cross-section normalisation computed as 8.5%σ tt ⊕ 7.8%σ single-t ⊕ 33%σ V+jets ⊕ 50%σ multijet and detector plus tt modelling uncertainties, as described in Section 8. Sub-leading background contributions have been merged into the "Others" category to improve their visibility and reduce statistical fluctuations in the plot. The bin of the highest ρ s interval includes events reconstructed with ρ s > 1.
The tt + 1-jet system is reconstructed adding the four-vectors corresponding to the b-jets, the selected W-boson candidates and the additional jet. The inclusive quantity ρ s defined in Section 1 is insensitive to ambiguities in the combinatorics and is not affected by an incorrect pairing of b-jets with W-boson candidates. The observed ρ s detector-level distribution is presented in Figure 1.

Data unfolding
This analysis follows the approach of Ref. [11] in which the measured R distribution is unfolded for detector, hadronisation and top-quark decay effects to the parton level where top quarks are on-shell. The distribution obtained at this level is then compared with theoretical predictions at fixed order, allowing the determination of the top-quark mass in a well-defined theoretical framework. In addition, in this paper, the R distribution is also presented at particle level, where data are unfolded for detector effects only. This will allow direct comparisons with possible future theoretical calculations which include top-quark decay and hadronisation effects.
The parton level is defined using on-shell top quarks and including initial-and final-state radiation from quarks and gluons before the top-quark decay. Jets are reconstructed by clustering u-, d-, c-, s-, b-quarks and gluons, via the anti-k t jet algorithm with R = 0.4. The tt + 1-jet fixed-order calculation at NLO is defined for a jet with p T larger than 50 GeV and with absolute pseudorapidity smaller than 2.5, ensuring the observable is infrared-safe for calculation purposes. The same definition is also applied to MC reconstructed events.
The particle level is constructed from the collection of stable particles6 from full matrix-element plus parton-shower generators, including top-quark decay and final-state radiation effects. Particles produced from interactions with the detector components or from pile-up of additional pp collisions are not considered at this level. The leptons' four-momenta are defined by clustering photons and the leptons with the anti-k t jet algorithm, using a jet-radius parameter of R = 0.1. No isolation condition is imposed. In order to choose prompt leptons from W/Z-boson decay, the parent of the lepton is required not to be a hadron.
Leptons from τ decay are considered as valid final-state particles. The neutrino from the W/Z decay is treated as a detectable particle and is selected for consideration in the same way as electrons or muons, i.e. the parent is required not to be a hadron. Jets are defined by clustering all the stable particles which have not been used in the definitions of electrons, muons and neutrinos with the anti-k t algorithm. The value of the jet-radius parameter is chosen to be R = 0.4. A jet is tagged as a b-jet if any rescaled b-hadron7 is included in the jet. Events where the leptons overlap with the selected jets are discarded. The fiducial volume at particle level is defined by applying the detector-level selection algorithm to the aforementioned particles as for data, the only difference being that the neutrino four-momentum is known. This choice minimises the magnitude of the correction to the data.
The unfolding procedure is detailed in the following. First, the detector-level distribution of ρ s in Figure 1 is re-binned as in Figure 2 to maximise the sensitivity of the observable to the top-quark mass while keeping enough statistics in each bin. This is achieved by choosing a fine binning in the region ρ s 0.6, where the observable is most sensitive to the top-quark mass [25]. Second, the predicted background contribution is subtracted and the distribution is normalised to unity. Finally, the distribution is unfolded with a procedure known as iterative Bayesian unfolding [82].
For the parton level the unfolding procedure takes the following form: 6 A particle is considered stable if its lifetime is greater than 3 × 10 −11 s. 7 Intermediate b-hadrons with p T > 5 GeV in the MC decay chain history are clustered in the stable-particle jets with their energies set to zero.
The unfolded distribution is denoted by R tt+1-jet (ρ s ) and the detector-level distribution by R det (ρ s ).
Migrations between the parton level and the detector level are described by the unfolding matrix M. The matrix is built from the nominal ATLAS MC tt sample, using events which pass both the parton-level and detector-level selection cuts. The matrix is inverted and regularised with the Bayesian unfolding method of Ref. [82]. The bin-by-bin correction factor f accounts for the acceptance and for the difference between the tt + g system in the nominal ATLAS MC sample (the first emission level of Ref. [11]) and the tt + 1-jet system at parton level. It has a residual dependence on the value of the mass used in the MC generator for the correction, near the threshold production of tt + 1-jet events. This is due to the available phase space in this region, which depends on the top-quark mass. This effect is taken into account by a second factor f ph.sp. , which is parameterised in each bin as a function of the unfolded observable before acceptance correction, R tt+1 jet ACC ,8 removing any explicit dependence on the value of the top-quark mass. The f ph.sp. factor is very close to one and only affects those bins close to the tt + 1-jet production threshold region (ρ s > 0.775). The unfolding to particle level is performed using the same tools, but is simpler in two ways: f is a pure acceptance correction in this case and the phase-space correction f ph.sp. is equal to one as the same event topologies are considered at detector and particle level.
The unfolded, normalised differential cross section at particle level is presented in Figure 2, where it is compared with the prediction of the P + P 6 generator with the top-quark mass parameter set to 172.5 GeV. The distributions obtained from the electron and muon channels separately, unfolded following the nominal procedure, are also presented in the same figure to show their compatibility with the combined result.
In Figure 3 the same measurement is presented after unfolding to parton level. The result is compared with the prediction for tt + 1-jet production of Refs. [25,83]. The fixed-order calculation at NLO accuracy in QCD is interfaced to the parton shower and is labelled as "NLO+PS" in the following. The prediction is shown for two values of the top-quark pole mass, to demonstrate the sensitivity of the observable to the top-quark mass.

Extraction of the top-quark mass
The top-quark pole mass is extracted from the parton-level result with an NLO+PS calculation of tt + 1-jet production [25,83]. The fit finds the optimal value of m pole t by minimising the following expression with the least-squares method: where indices i, j ∈ {1, 2, . . . , 8} refer to the bin number of the unfolded observable. covariance matrix, of which diagonal terms are the experimental statistical uncertainties in the measured observable, bin-by-bin. Per-bin uncertainties are assumed to be Gaussian. Correlations between bins are taken into account via off-diagonal entries in V. The term R tt+1-jet data represents the measured differential cross section. In each bin i a continuous parameterisation R  Table 2. A detailed description of the systematic uncertainties evaluated is given in the following.
Uncertainties in the modelling of the jet energy response are taken into account by varying the jet energy scale (JES) within its uncertainty for a number of uncorrelated components [84][85][86]. A separate uncertainty is assigned to the b-quark jet energy scale (bJES), which is uncorrelated with the JES. Systematic effects that affect the jet energy resolution (JER) and jet reconstruction efficiency are taken into account by smearing the jet energy and by randomly removing a fraction of the jets, respectively. Uncertainties originating from b-jet tagging/mistagging efficiency are also considered (b-tagging efficiency and mistag). Scale factors are applied to correct for the difference between efficiencies measured in data and in simulated events of E miss T is affected by uncertainties in the jet energy and lepton momentum scales, as well as the response for the soft-term and pile-up modelling [79].
Modelling uncertainties cover a possible bias of the measurement due to imperfections in the description of signal and background processes in Monte Carlo generators. Several alternative models are used for tt production as introduced in Section 4. Monte Carlo simulations produced with a different matrix-element generator (P and aMC@NLO) are compared to evaluate the uncertainties in the calculation of the matrix elements (Signal MC generator). Uncertainties in tt modelling coming from the parton shower and hadronisation model (Shower and hadronisation) are evaluated by comparing P 6 with H , both interfaced with P . Uncertainties due to the choice of proton PDF (Proton PDF) are evaluated following the prescriptions of Ref. [87]. Uncertainties coming from the choice of parameter values that control initial-and final-state radiation (ISR/FSR), the colour reconnection (Colour reconnection) and underlying-event modelling (Underlying event) are estimated following the scheme of Ref. [88]. The uncertainty in the modelling of background processes is evaluated by varying the normalisation and shape of several sources (Background). The normalisation is varied within the cross-section uncertainty for single-top (±7.8%) and V+jets (±33%) backgrounds, while the data-driven multijet contribution is scaled by ±50% [63]. Background shape uncertainties and luminosity uncertainty are found to be negligible.
The uncertainty due to the limited size of the Monte Carlo sample used to unfold the data (MC statistics) is evaluated by repeating the unfolding procedure 5000 times, varying the unfolding matrix within its uncertainties.
Finally, additional systematic uncertainties are assigned to the top-quark mass extraction procedure. The top-quark mass value is obtained by fitting the R(m t ) prediction to the data unfolded at parton level. One uncertainty is assigned to the fit procedure (Fit parameterisation) to account for a possible bias from the continuous parameterisation of the theoretical prediction and for non-closure effects. Another uncertainty is assigned to the phase-space correction factor f ph.sp. (Unfolding modelling). It is evaluated as half the difference between top-quark mass results obtained with and without the phase-space correction.
The theoretical uncertainty in the mass consists of two contributions. The uncertainty due to the truncation of the perturbative series is evaluated with the conventional procedure of varying the factorisation (µ ) and renormalisation (µ ) scales by factors of 2 and then 1/2 from the nominal scale µ = µ = m t (Scale variations). The scale uncertainty is taken as the mass shift for the alternative scale choices and is typically asymmetric. A positive (negative) shift of the extracted top-quark mass is found when decreasing (increasing) the renormalisation scale. Additional tests were performed in order to gain confidence in the values presented in Table 2. The scale variation has a larger impact in the MS mass scheme, as already observed in Ref. [17]. A redundancy exists between the theoretical uncertainty obtained from the scale variations and the one considered by the initial/final radiation, which is not subtracted. The uncertainty due to PDFs and the parametric uncertainty in the strong coupling constant, α s , is evaluated by generating the prediction for three consistent choices of PDF set and α s : CT10nlo with α s (m Z ) = 0.118 (nominal), MSTW2008nlo90cl with α s (m Z ) = 0.120 and NNPDF23 with α s (m Z ) = 0.119. The uncertainty is taken as half the envelope of the mass values extracted from the three choices mentioned above (Theory PDF⊕α s ). The total theory uncertainty is obtained by adding the scale and PDF uncertainties in quadrature. The parton shower barely affects the theoretical prediction [11] and its associated uncertainty is negligible.

Results
The fit to the parton-level differential cross section yields a top-quark pole mass of The procedure for the extraction of the MS mass with the calculation from Ref.
[17] is completely analogous to the pole-mass fit described above. The result for the running mass in the MS scheme is: The statistical uncertainty in the mass is evaluated by repeating the unfolding and fit procedure on pseudo-data samples, where the number of events in each bin is varied within the statistical uncertainty. The experimental systematic uncertainty is evaluated as described in Section 8 and corresponds to those values quoted in Tab. 2.
Several tests were performed to verify the consistency and robustness of the result.
The measured value of the top-quark mass is stable with respect to variations of the p T cut on the additional jet and the choice of binning of the ρ s variable. The analysis is repeated using six bins, eight bins and ten bins.  0.3 GeV with the default setup. A higher number of bins increases the sensitivity and leads to a slightly reduced uncertainty of the measurement. However, the use of ten bins is affected by fluctuations in the unfolding procedure and the χ 2 , originating from the limited statistics of the available simulations. The result obtained with eight bins is very stable in all aspects of the analysis and therefore this choice is finally adopted. The fits are also repeated excluding different bins in the χ 2 sum, with an agreement of the results obtained within 0.1 GeV.
Correlations between the extracted top-quark mass and the assumed value of m W used in event selection are negligible.
The masses extracted from the electron channel and the muon channel separately are compatible. Effects associated to the top-quark finite width, off-shell effects and non-resonant contributions are small and covered by the tt MC modelling uncertainties. In addition, the measured top-quark mass is independent of the assumed top-quark mass in the MC simulation that is used to unfold the data. The fit is repeated for MC samples using different top-quark masses between 165 GeV and 180 GeV. For all samples the unfolding is based on MC simulation with a top mass of 172.5 GeV. The difference between simulated top-quark mass and the fit result is found to be compatible with zero over the entire range of top-quark masses tested.   The unfolding procedure is validated using pseudo-data samples which were generated by varying the bin contents of the observable at detector level according to their statistical errors. Pull distributions are produced using these samples. In addition, stress tests are performed to demonstrate that the unfolding procedure is independent of the input distribution. All these tests demonstrate that the analysis procedure is unbiased and correctly estimates the statistical uncertainties.
The assigned theoretical uncertainty to the measured top-quark pole mass is cross-checked in two alternative ways, following the approach applied to the measurement based on the data set at 7 TeV centre-of-mass energy [11]: • The value of the top-quark mass is evaluated based on a LO calculation and compared to the default, which is based on an NLO calculation. The difference is found to be 0.3 GeV and is covered by the assigned uncertainties due to the scale choice.
• An expansion of R in powers of α S is performed and the theoretical uncertainty is re-evaluated performing scale variations on the new expression for R. In this way, potential cancellations are avoided which may occur when expanding the numerator and the denominator or R separately as a function of α S and can lead to a too optimistic uncertainty. Only the case of m pole t is considered in this test and the result obtained (+0.4, -0.2) is found to be compatible with that expressed in Table 2, All the above considerations and cross-checks suggest that the error assigned to unknown higher orders gives a reliable estimation of its value.
The scale variation has a larger impact in the MS mass scheme, as already observed in Ref. [17].
The top-quark pole mass result obtained from data unfolded to parton level and reported in Eq. (1) is compatible with previous measurements of the pole mass [8][9][10][11][12][13][14], as is shown in Figure 4. Compared with the result obtained by ATLAS with the same method at 7 TeV [11] the statistical and systematic uncertainties of the new result are reduced by more than a factor of two.
The MS mass result is translated to the pole-mass scheme using the NLO QCD relationship [18] between the top-quark masses in the two schemes.9 When converting m t (m t ) to m

Conclusions
In this paper, the normalised differential cross section, R, of top-quark pair production in association with an energetic jet is presented as a function of the inverse of the invariant mass of the tt + 1-jet system ρ s = 2m 0 /m tt+1-jet . The measurement is performed using pp collision data at a centre-of-mass energy  9 The QCD relation between the two schemes is known to four loops, but here the series is truncated at two loops to match the precision of the tt + 1-jet cross section that was used to extract the mass in both schemes. The relationship between the two masses then takes the simple form: The pole mass result quoted in the text is obtained for α s (163 GeV) ∼ 0.116.
The result for m t (m t ) suffers from a larger theoretical uncertainty as compared with the pole mass. This is due to a larger dependence on the renormalisation and factorisation scales of the MS scheme in the most sensitive region close to the tt + 1-jet threshold. [12] D0 Collaboration, Measurement of the inclusive tt production cross section in pp collisions at