The measurement of top-quark pair production cross sections at hadron colliders has entered the era of precision physics with the analysis of data available from the Large Hadron Collider (LHC) in the runs at center-of-mass energies \(\sqrt{S}=7\) and \(8\) TeV. Measurements of the total cross section for \(t{\bar{t}}\)-production from ATLAS and CMS reach by now an accuracy of typically better than \({\fancyscript{O}} (10\,\%)\), with the systematic and luminosity uncertainties already dominating over the small statistical uncertainty, see, e.g., [13]. First results of differential distributions for \(t{\bar{t}}\)-production from the LHC are appearing as well [4, 5]. Thus, given the present experimental accuracy hadro-production of \(t{\bar{t}}\)-pairs is currently being established as a standard model (SM) benchmark process.

This has motivated tremendous activity on the theory side to match the experimental precision by computing higher order corrections in quantum chromodynamics (QCD) and we briefly recapitulate the status for inclusive \(t{\bar{t}}\)-pair production, i.e., no tagged final states. Predictions for the total cross section are complete to next-to-next-to-leading order (NNLO) [69] while differential distributions are known to next-to-leading order (NLO) [10, 11], including top-quark decay [1216], though. Additional corrections beyond NLO based on threshold logarithms have been obtained for distributions in the top-quark’s transverse momentum and rapidity, \(p_\mathrm{T}^t\) and \(y^t\), as well as in the invariant mass \(m^{t{\bar{t}}}\) of the top-quark pair [1720].

Comparison of these theory predictions to experimental data can be used to determine parameters such as the strong coupling constant, the parton luminosity and the top-quark mass and to study their correlations. Of these parameters, the top-quark mass is certainly the most interesting one with prominent implications for the electro-weak vacuum of the SM, see, e.g., [21, 22]. It is a particularly attractive feature of cross sections measurements that they offer the opportunity for an unambiguous and theoretically well-defined determination of the top-quark mass in a particular renormalization scheme [23, 24].

The conventional scheme choice for the quark mass renormalization is the pole mass, which has its short-comings [25, 26], though, since it is based on the idea of quarks appearing as asymptotic states. It exhibits poor convergence of the perturbative series and due to the renormalon ambiguity it carries an intrinsic uncertainty of the order of \(\Lambda _\mathrm{QCD}\). As an alternative, one can consider top-quark hadro-production with a running mass, which has the advantages of improved convergence and scale stability of the perturbative expansion. For \(t{\bar{t}}\) hadro-production, these features have been demonstrated for the total cross section [23].

In the present letter, we study the dependence of single differential distributions in \(p_\mathrm{T}^t\), \(y^t\) and \(m^{t{\bar{t}}}\) on the definition of the mass parameter. Specifically, we will compare the conventional pole mass \(m_t^\mathrm{pole}\) with the scale dependent \(\overline{\hbox {MS}}\) mass by means of the well-known relation in perturbation theory,

$$\begin{aligned} m_t^\mathrm{pole} = m(\mu _\mathrm{r})\left( 1 + \frac{\alpha _\mathrm{s}}{\pi } d_1 + \left( \frac{\alpha _\mathrm{s}}{\pi }\right) ^2 d_2 + \cdots \right) , \end{aligned}$$
(1)

for the scheme change from \(m_t^\mathrm{pole}\) to the running \(\overline{\hbox {MS}}\) mass \(m(\mu _\mathrm{r})\) taken at the renormalization scale \(\mu _\mathrm{r}\). To NNLO the coefficients \(d_1\) and \(d_2\) are given by [27] (see also [28, 29])

$$\begin{aligned}&\!\!\!d_1 = \frac{4}{3} + \ell \end{aligned}$$
(2)
$$\begin{aligned}&\!\!\!d_2 = \frac{307}{32} + \frac{\pi ^2}{3} + \frac{\pi ^2}{9}\ln (2) - \frac{1}{6}\zeta _3 + \frac{509}{72}\ell + \frac{47}{24}\ell ^2\nonumber \\&\qquad - n_\mathrm{f} \left( \frac{71}{144} + \frac{\pi ^2}{18} + \frac{13}{36}\ell + \frac{1}{12}\ell ^2 \right) , \end{aligned}$$
(3)

with \(\ell = \ln \left( \frac{\mu _\mathrm{r}^2}{m(\mu _\mathrm{r})^2}\right) \) and assuming vanishing masses for all lighter quarks.

Let us briefly illustrate the advantages of the running \(\overline{\hbox {MS}}\) mass \(m(\mu _\mathrm{r})\) for the total \(t{\bar{t}}\) cross section. The recently completed exact NNLO QCD result [69] turned out to provide a value for the total cross section which is very close, i.e., within \({\fancyscript{O}}\)(1–5 %), to previous approximations based on the combined threshold and high-energy asymptotics [31] and has been presented as a function of the pole mass \(m_t^\mathrm{pole}\). The necessary scheme transformation from \(m_t^\mathrm{pole}\) to \(m(\mu _\mathrm{r})\), i.e., the application of Eq. (1), has been discussed in [23] and is implemented in the program Hathor (version 1.5) [32], a tool for the calculation of the total \(t{\bar{t}}\) cross section in hadronic collisions.

The much improved apparent convergence of the perturbative expansion with the running mass as well as the scale stability are illustrated in Figs. 1 and 2 where we compare theory predictions for the total \(t{\bar{t}}\) cross section as a function of the pole and the \(\overline{\hbox {MS}}\) mass, respectively. Figure 1 displays the increase in the cross section values from LO to NNLO, where we have taken the parton distribution functions (PDFs) to be order independent. For an on-shell mass \(m_t^\mathrm{pole}=173\) GeV, for instance, the relative increase is \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.46\) and \(\sigma _\mathrm{NNLO}/\sigma _\mathrm{NLO} = 1.12\) at the scale \(\mu _\mathrm{r} = \mu _\mathrm{f} = m_t^\mathrm{pole}\). This is to be compared with a much reduced increase of only \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.26\) and \(\sigma _\mathrm{NNLO}/\sigma _\mathrm{NLO} = 1.03\) for \(m(m)=163\) GeV in the \(\overline{\hbox {MS}}\) scheme at the scale \(\mu _\mathrm{r} = \mu _\mathrm{f} = m(m)\). These findings can be understood by noting that the scheme transformation of Eq. (1) applied to the total \(t{\bar{t}}\) cross section effectively shifts all corrections of the Wilson coefficients of the perturbative cross-section to the threshold region thereby improving the apparent convergence of the perturbation series, see, e.g.,  [33].

Fig. 1
figure 1

The LO, NLO and NNLO QCD predictions for the total cross section at LHC (\(\sqrt{S} = 8\) TeV) as a function of the top-quark mass in the on-shell scheme \(m_t^\mathrm{pole}\) at the scale \(\mu = m_t^\mathrm{pole}\) (left) and, respectively, in the \(\overline{\hbox {MS}}\) scheme \(m(m)\) at the scale \(\mu = m(m)\) (right) using the PDF set ABM11 [30] and \(\mu = \mu _\mathrm{r} = \mu _\mathrm{f}\)

Fig. 2
figure 2

The scale dependence of the LO, NLO and NNLO QCD predictions for the total cross section at LHC (\(\sqrt{S} = 8\) TeV) for the top-quark mass \(m_t^\mathrm{pole}=173\) GeV in the on-shell scheme (left) and for \(m(m)=163\) GeV in the \(\overline{\hbox {MS}}\) scheme (right) with the choice \(\mu = \mu _\mathrm{r} = \mu _\mathrm{f}\) using the PDF set ABM11 [30]. The vertical bars indicate the size of the scale variation in the standard range \(\mu /m_t^\mathrm{pole} \in [1/2, 2]\) and \(\mu /m(m) \in [1/2, 2]\), respectively

Figure 2 shows the scale stability for the LHC predictions confirming earlier findings for the Tevatron, cf. [23]. The scale variation for the cross section in the on-shell scheme in the standard range \(\mu /m_t^\mathrm{pole} \in [1/2, 2]\) amounts to \(\Delta \sigma _\mathrm{NNLO} =^{+3.8\,\%}_{-6.\,0\,\%}\), whereas for the running mass we only find \(\Delta \sigma _\mathrm{NNLO} = ^{+0.1\,\%}_{-3.0\,\%}\) for the range \(\mu /m(m) \in [1/2, 2]\). Interestingly, for an on-shell mass the point of minimal sensitivity where \(\sigma _\mathrm{LO} \simeq \sigma _\mathrm{NLO} \simeq \sigma _\mathrm{NNLO}\) is located at fairly low scales, \(\mu \simeq m_t^\mathrm{pole}/4 \simeq 45\) GeV, whereas for a running mass it resides at the scale \(\mu = {\fancyscript{O}}(m(m))\), i.e., it coincides with the natural hard scale of the process. The observation of improved scale stability for predictions with the \(\overline{\hbox {MS}}\) mass is further corroborated by studying alternative scale choices for \(m(\mu )\) as shown in Fig. 3. Cross section predictions with the running mass \(m(\mu )\) display a comparable uncertainty of \(\Delta \sigma _\mathrm{NNLO} = ^{+0.0\,\%}_{-4.8\,\%}\) for the range \(81.5\) GeV \(\le \mu \le 326\) GeV, which corresponds to the range \(\mu /m(m) \in [1/2, 2]\) if normalized to \(m(m)=163\) GeV.

Fig. 3
figure 3

The scale dependence of the total cross section at LHC (\(\sqrt{S} = 8\) TeV) as a function of \(m(\mu )\) in the \(\overline{\hbox {MS}}\) scheme for the NLO (blue, dashed) and NNLO (red, solid) QCD predictions with the choice \(\mu = \mu _\mathrm{r} = \mu _\mathrm{f}\) and using the PDF set ABM11 [30]. The scale variation is normalized to \(m(m)=163\) GeV. The vertical bars indicate the size of the scale variation in the standard range \(\mu /m_t^\mathrm{pole} \in [1/2, 2]\) and \(\mu /m(m) \in [1/2, 2]\), respectively. The dotted curves display the results for the mass at the fixed scale \(m(m)=163\) GeV as shown in Fig. 2 (right) for comparison

If instead, only the scale of the mass \(m(\mu )\) is varied within \(m(81.5\) GeV) \(\le m(\mu )\) \(\le m(326\) GeV), while the scales \(\mu _\mathrm{r}\) and \(\mu _\mathrm{f}\) are kept fixed, one obtains shifts of \(\Delta \sigma _\mathrm{NNLO} = ^{+0.0\,\%}_{-5.6\,\%}\) relative to the cross section value at \(m(m)\) for the choice \(\mu _\mathrm{r} = \mu _\mathrm{f} = 163\) GeV. Again, these variations are similar to the uncertainty quoted in Fig. 2 (right). These results imply, that experimental determinations of the running mass from the measured cross section are feasible with very good accuracy and a small residual theoretical uncertainty. For Tevatron data such analyses have already been performed in the past [22, 34]. Recently, the top-quark cross section data from Tevatron and LHC has been analyzed in a global fit at NNLO [35] with full account of all correlations between the top-quark mass, the gluon PDF and the strong coupling \(\alpha _\mathrm{s}\).

For completeness, we include in Tables 1 and 2 the values for the full NNLO cross sections at the Tevatron (\(\sqrt{S}=1.96\) TeV) and at the LHC for various energies of interest.

Table 1 The total cross section for top-quark pair-production at NNLO using a pole mass \(m_t^\mathrm{pole} = 173~\mathrm{GeV}\) and the PDF set ABM11 [30] and CT10 [36] and with the errors shown as \(\sigma + \Delta \sigma _\mathrm{scale} + \Delta \sigma _\mathrm{PDF}\). The NLO numbers in brackets are obtained by using the respective NLO PDF sets. The scale uncertainty \( \Delta \sigma _\mathrm{scale}\) is based on maximal and minimal shifts for the choices \(\mu =m_t^\mathrm{pole}/2\) and \(\mu = 2m_t^\mathrm{pole}\) and \(\Delta \sigma _\mathrm{PDF}\) is the 1\(\sigma \) combined PDF+\(\alpha _\mathrm{s}\) error. All rates are in pb
Table 2 Same as Table 1 for a running mass \(m(m) = 163~\mathrm{GeV}\) in the \(\overline{\hbox {MS}}\) scheme

We also briefly comment here on a related study, that has appeared in the literature. Ahrens et al. [37] has performed a study of total \(t{\bar{t}}\) cross section in the \(\overline{\hbox {MS}}\) scheme compared to the pole scheme for the top-quark mass and did not find any strong improvement of the perturbative series when using the running mass. This is in contradiction to the numbers reported in Tables 1 and 2. Note however, that the authors of [37] have used an approximation for the NNLO QCD corrections, which was appropriate at the time of writing, but has turned out to be inaccurate if compared to the exact NNLO QCD result [69]. In fact, [37] has predicted \(t{\bar{t}}\) cross sections at approximate NNLO QCD accuracy, which are systematically too low by \({\fancyscript{O}}\)(6–13 %) depending on the collider and the center-of-mass energy.

Next we discuss the single-differential distributions in the top-quark’s transverse momentum \(p_\mathrm{T}^t\) and rapidity \(y^t\) and in the invariant mass \(m^{t{\bar{t}}}\) of the \(t{\bar{t}}\)-pair, which are all known to NLO in QCD [10, 11] in the conventional pole mass scheme. As we are interested in the differential cross sections with the mass in the \(\overline{\hbox {MS}}\) scheme, we briefly recall the kinematics of heavy-quark hadro-production,

$$\begin{aligned} h_1(P_1) +\, h_2(P_2) \longrightarrow \mathrm{Q}(p_1) + X[{\bar{\mathrm{Q}}}](p_X), \end{aligned}$$
(4)

where \(h_1\) and \(h_2\) are hadrons, \(X[\overline{\mathrm{Q}}]\) denotes any allowed hadronic final state containing at least the heavy anti-quark, and \(\mathrm{Q}(p_1)\) is the identified heavy-quark with mass \(m\). The hadronic invariants in this reaction are

$$\begin{aligned}&\!\!\! S = (P_1+P_2)^2,\quad T_1 = (P_2-p_1)^2-m^2, \nonumber \\&\!\!\! U_1 = (P_1-p_1)^2-m^2. \end{aligned}$$
(5)

The double differential cross section for Eq. (5) in terms of the hard parton cross section \(\sigma _{ij}\) and PDFs \(f_i\) at the factorization scale \(\mu ^2\) reads

$$\begin{aligned} S^2\frac{d^2\sigma (S,T_1,U_1)}{\mathrm{d}T_1 \mathrm{d}U_1}&= \int _{x_1^-}^1 \frac{dx_1}{x_1}\! \int _{x_2^-}^1\! \frac{\mathrm{d}x_2}{x_2} f_i(x_1,\mu ^2) f_j(x_2,\mu ^2) \nonumber \\&\times s^2 \frac{d^2\sigma _{ij}(s,t_1,u_1,\mu ^2)}{\mathrm{d}t_1 \mathrm{d}u_1}, \end{aligned}$$
(6)

and the partonic invariants are related to their hadronic counterparts through

$$\begin{aligned} t_1 = x_1 T_1, \quad u_1 = x_2U_1, \quad s = x_1 x_2 S, \end{aligned}$$
(7)

with the limits on \(x_1\) and \(x_2\),

$$\begin{aligned} x_1^- = -\frac{U_1}{S+T_1} \le x_1 \le 1\, ,\quad x_2^- = \frac{x_1 T_1}{x_1S + U_1} \le x_2 \le 1.\nonumber \\ \end{aligned}$$
(8)

In order to write the differential cross section in terms of \(p_\mathrm{T}^t\), \(y^t\) and \(m^{t{\bar{t}}}\), we will also need their definitions in terms of the hadronic invariants. For the case of \(p_\mathrm{T}^t\) and \(y^t\), the relations are

$$\begin{aligned} y^t = \frac{1}{2}\ln \left( \frac{T_1}{U_1}\right) \, , \quad (p_\mathrm{T}^t)^2=\frac{T_1U_1}{S}-m^2, \end{aligned}$$
(9)

whereas for \(m^{t{\bar{t}}}\), pair-invariant mass kinematics is used, in which case the requirements on the integrals are

$$\begin{aligned} x_1^- = \frac{\left( m^{t{\bar{t}}}\right) ^2}{S} \quad \mathrm{and} \quad x_2^- = \frac{\left( m^{t{\bar{t}}}\right) ^2}{x_1 S}. \end{aligned}$$
(10)

In these kinematics, the relevant partonic invariants for writing the differential cross section in terms of \(m^{t{\bar{t}}}\) are,

$$\begin{aligned} t_1&= -\frac{\left( m^{t{\bar{t}}}\right) ^2}{2}\left( 1-\beta _t \cos \theta \right) \!, \end{aligned}$$
(11)
$$\begin{aligned} u_1&= -\frac{\left( m^{t{\bar{t}}}\right) ^2}{2}\left( 1+\beta _t \cos \theta \right) \!, \end{aligned}$$
(12)

with \(\beta _t = \sqrt{1-4m^2/\left( m^{t{\bar{t}}}\right) ^2}\) and \(\theta \) the scattering angle of the top quark. Full discussions of the kinematics to NLO for one-particle inclusive and pair-invariant mass kinematics are available in [11, 38] respectively.

In order to convert to cross section predictions with the mass in the \(\overline{\hbox {MS}}\) scheme, we start from the on-shell description:

$$\begin{aligned} \frac{\mathrm{d}\sigma (m_t^\mathrm{pole})}{\mathrm{d}X}&= \left( \frac{\alpha _\mathrm{s}}{\pi }\right) ^2 \frac{\mathrm{d}\sigma ^{(0)}(m_t^\mathrm{pole})}{\mathrm{d}X} \nonumber \\&\quad + \left( \frac{\alpha _\mathrm{s}}{\pi }\right) ^3 \frac{\mathrm{d}\sigma ^{(1)}(m_t^\mathrm{pole})}{\mathrm{d}X} + {\fancyscript{O}}(\alpha _\mathrm{s}^4) \end{aligned}$$
(13)

where \(X\) denotes any of the variables \(p_\mathrm{T}^t\), \(y^t\) and so on. If we now replace \(m_t^\mathrm{pole}\) with \(m(\mu _\mathrm{r})\) using Eq. (1), we can expand in \(\alpha _\mathrm{s}\) and obtain a description of the differential cross section in the \(\overline{\hbox {MS}}\) scheme.

$$\begin{aligned} \frac{\mathrm{d}\sigma (m(\mu _\mathrm{r}))}{\mathrm{d}X}&= \left( \frac{\alpha _\mathrm{s}}{\pi }\right) ^2\frac{\mathrm{d}\sigma ^{(0)}(m(\mu _\mathrm{r}))}{\mathrm{d}X} \nonumber \\&+ \left( \frac{\alpha _\mathrm{s}}{\pi }\right) ^3 \left\{ \frac{\mathrm{d}\sigma ^{(1)}(m(\mu _\mathrm{r}))}{\mathrm{d}X}\right. + d_1 m(\mu _\mathrm{r})\nonumber \\&\left. \times \frac{\mathrm{d}}{\mathrm{d}m_t}\!\left( \!\frac{\mathrm{d}\sigma ^{(0)}(m_t)}{\mathrm{d}X}\!\right) \biggr |_{m_t=m(\mu _\mathrm{r})}\right\} \! {+} {\fancyscript{O}}(\alpha _\mathrm{s}^4).\nonumber \\ \end{aligned}$$
(14)

The only extra part required is the mass derivative of the Born contribution. This has been computed semi-analytically for the \(p_\mathrm{T}^t\), \(y^t\), and \(m^{t{\bar{t}}}\) distributions. To see why we also need some numerical derivatives in this calculation, consider Eq. (6) for the Born contribution to the double differential cross section as a starting point:

$$\begin{aligned}&\!\!\!\int _{x_1^-}^1 \frac{\mathrm{d}x_1}{x_1} \int _{x_2^-}^1 \frac{\mathrm{d}x_2}{x_2} f_i(x_1,\mu ^2) f_j(x_2,\mu ^2) s^2\nonumber \\&\quad \times \frac{\mathrm{d}^2\sigma _{ij}^{(0)}}{\mathrm{d}t_1 \mathrm{d}u_1} \delta (s+t_1+u_1), \end{aligned}$$
(15)

where the delta function imposes Born kinematics and can be used to carry out the integral over \(x_2\) through its relation to \(s, t_1\) and \(u_1\). Re-writing the cross section in terms of \(p_\mathrm{T}^t\) and \(y^t\) provides us with the form of the integrand that will need to be evaluated,

$$\begin{aligned} \int _{x_1^-}^1 \mathrm{d}x_1 \mathcal{L}(x_1,x_2,\mu ^2) \frac{x_1 x_2 S}{x_1 S + U_1} \left. \frac{\mathrm{d}^2\sigma (s,t_1,u_1)}{\mathrm{d}y^t \mathrm{d}p_\mathrm{T}^{t2}}\right| _{x_2=-\frac{x_1 T_1}{x_1 S+U_1}},\nonumber \\ \end{aligned}$$
(16)

where \(\mathcal{L}(x_1,x_2,\mu ^2) = f_1(x_1,\mu ^2)f_2(x_2,\mu ^2)/x_1x_2\) is the differential parton luminosity.

The most important aspect to note is that both \(x_2\) and \(x_1^-\) depend on the top-quark mass through their relations to the Mandelstam variables. This means that the mass derivative of the PDFs needs to be done numerically using

$$\begin{aligned}&\!\!\!\frac{\mathrm{d}}{\mathrm{d}m} \mathcal{L}(x_1,x_2,\mu ^2)\nonumber \\&\quad = \frac{\mathrm{d}x_1}{\mathrm{d}m}\frac{\mathcal{L}(x_1+\delta ,x_2,\mu ^2) - \mathcal{L}(x_1-\delta ,x_2,\mu ^2)}{2\delta } \nonumber \\&\qquad +\frac{\mathrm{d}x_2}{\mathrm{d}m}\frac{\mathcal{L}(x_1,x_2+\delta ,\mu ^2) - \mathcal{L}(x_1,x_2-\delta ,\mu ^2)}{2\delta }. \end{aligned}$$
(17)

This form of the derivative is found to converge well. Aside from this, all other derivatives are known analytically. When compared with a fully numerical calculation of the derivative term, it is found that the two methods agree to less than 1 %. In the case of \(m^{t{\bar{t}}}\), the integration limits and variables do not depend on the top-quark mass (\(m\)) so all derivatives are computed analytically.

Using the relations presented here, we have computed the differential cross sections for \(t{\bar{t}}\)-production in terms of \(p_\mathrm{T}^t\), \(y^t\) and \(m^{t{\bar{t}}}\). We have used the program MCFM [39] for the NLO corrections [13, 40] in the conventional pole mass \(m_t^\mathrm{pole}\) scheme and a custom routine for the Born and mass derivative terms. The calculations were carried out using the ABM11 [30] and CT10 [36] PDFs at NLO. As a check, each curve was integrated to obtain a result for the full cross section. In all cases, the value agreed within less than 1 % of the cross section computed using Hathor. As well, the mass derivatives were checked by computing the differential cross sections at values of the top mass ranging between 150 and 180 GeV. A curve was fit to each point in the relevant spectrum to obtain the derivative at the given \(\overline{\hbox {MS}}\) mass. Again, these values agreed within less than 1 % of the (semi-)analytic derivatives used.

In Fig. 4 the rapidity distributions are shown for the \(\overline{\hbox {MS}}\) and pole mass schemes. It is clear from these that at NLO, the convergence of the perturbative series as well as the scale dependence improves. In the pole-mass scheme, a relative increase for the cross section ratios \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.50\) is seen, while in the \(\overline{\hbox {MS}}\) scheme we have \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.31\) at \(y^t= 0\). The scale variation in the on-shell scheme is \(\Delta \sigma _\mathrm{NLO} = {}^{+9.5\,\%}_{-14\,\%}\) while in the \(\overline{\hbox {MS}}\) scheme, we have \(\Delta \sigma _\mathrm{NLO} = {}^{+4.5\,\%}_{-12\,\%}\) again at \(y^t = 0\).

Fig. 4
figure 4

The differential cross section with respect to the rapidity \(y^t\) of the top quark in the pole (left) and the \(\overline{\hbox {MS}}\) (right) mass scheme at the LHC with \(\sqrt{S}=8\) TeV. The dotted (green) curves are the LO contributions while the dashed (blue) curves include NLO corrections and are obtained using the PDF set CT10 [36]. The scale dependence in the range \(\mu /m_t^\mathrm{pole}\) or \(\mu /m(m) \in [1/2,2]\) is shown as a band around the NLO curve

Figure 5 shows the transverse momentum distributions. Again we see an improvement when moving from the pole mass scheme to the \(\overline{\hbox {MS}}\) scheme. In this case the improvement in the NLO contribution is a bit better with \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.50\) for the pole mass scheme and \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.25\) in the \(\overline{\hbox {MS}}\) scheme. The scale variation goes from \(\Delta \sigma _\mathrm{NLO} = {}^{+13\,\%}_{-13\,\%}\) in the pole mass scheme to \(\Delta \sigma _\mathrm{NLO} = {}^{+6.4\,\%}_{-9.6\,\%}\) in the \(\overline{\hbox {MS}}\) scheme. The above values are taken near the maximum of the curve at \(p_\mathrm{T}^t = 75~\mathrm GeV \).

Fig. 5
figure 5

Same as Fig. 4 for the differential cross section with respect to the transverse momentum \(p_\mathrm{T}^t\) of the top quark

Finally, in Fig. 6 we show the invariant mass distributions. The increase at NLO here is \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.54\) with scale variation \(\Delta \sigma _\mathrm{NLO} = {}^{+13\,\%}_{-13\,\%}\) in the pole mass scheme and \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.30\) with scale variation \(\Delta \sigma _\mathrm{NLO} = {}^{+8.2\,\%}_{-9.6\,\%}\) in the \(\overline{\hbox {MS}}\) scheme. These values are taken at an invariant mass of \(m^{t{\bar{t}}} = 371~\mathrm GeV \).

Fig. 6
figure 6

Same as Fig. 4 for the differential cross section with respect to the invariant mass \(m^{t{\bar{t}}}\) of the top quark pair

Looking at just the leading order curves in Fig. 5 we see that the change to the \(\overline{\hbox {MS}}\) scheme seems to push the peak to higher values. To be sure that this does not contribute to the better convergence that we are seeing in the \(\overline{\hbox {MS}}\) scheme, we carry out the same analysis except with an \(\overline{\hbox {MS}}\) mass of 173 GeV. Figure 7 shows the LO and NLO contributions in the pole mass scheme and the \(\overline{\hbox {MS}}\) scheme.

Fig. 7
figure 7

The differential cross section with respect to the rapidity \(p_\mathrm{T}^t\) of the top quark in the pole (left) and the \(\overline{\hbox {MS}}\) (right) mass scheme at the LHC with \(\sqrt{S}=8\) TeV. The dotted (green) curves are the LO contributions while the dashed (blue) curves include NLO corrections and are obtained using the PDF set CT10 [36]

Inspection of the K-factors in these plots shows an increase at NLO of \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.50\) in the pole-mass scheme and \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.26\) in the \(\overline{\hbox {MS}}\) scheme. As in the previous discussion, this value is taken at \(p_\mathrm{T}^t = 75\) GeV. This should be compared the the value of \(\sigma _\mathrm{NLO}/\sigma _\mathrm{LO} = 1.25\) in the \(\overline{\hbox {MS}}\) scheme using a top mass of 163 GeV. We see that the improvement in convergence comes only from the inclusion of the derivative term in Eq. (14) and is largely unaffected by the choice of top mass. Note that this comparison between the two mass choices is consistent through the whole \(p_\mathrm{T}^t\) spectrum and is not a special case at the chosen value of \(p_\mathrm{T}^t\).

As we did for the total cross-section, we also consider the alternative scale choices \(m(\mu )\) for the differential cross-sections. Figure 8 is the same as Fig. 5 (right) except the band is now produced by using \(m(\mu )\) and we vary \(\mu /m(m) \in [1/2,2]\). Comparing Fig. 5 (right) and Fig. 8 we see that using a fixed \(m(m)\) is the more conservative choice when considering scale dependence at NLO. We now have \(\Delta \sigma _\mathrm{NLO} =^{+2.3\,\%}_{-3.7\,\%}\) at \(p_\mathrm{T}^t = 75\) GeV. This improvement in the scale dependence can also be seen in the NLO total cross-section in Fig. 3.

Fig. 8
figure 8

The scale dependence of the \(p^t_\mathrm{T}\) distribution at NLO using the running mass \(m(\mu )\) and the PDF set CT10. The yellow band shows the variation of the differential distribution in the range \(\mu /m(m) \in [1/2,2]\). This should be compared with the plot in Fig. 5 (right) where the same analysis was carried out using a fixed \(m(m)\)

Let us also comment on the difference between the size of predictions of the \(\overline{\hbox {MS}}\) scheme and pole scheme at NLO. When comparing, for example, the peak height in the \(p_\mathrm{T}^t\) distributions in Fig. 5 it is seen that the height differs by approximately 15 %. This difference can be attributed to the fact that we are using fixed order perturbation theory and reinforces the observation of faster convergence in the \(\overline{\hbox {MS}}\) scheme. The same effect can be seen at NLO in the total cross-section in Fig. 1, where the difference is again approximately 15 %. At NNLO however, the cross-section predictions are somewhat closer and only differ by about 6 %. A similar ‘convergence’ of the predicted distributions is expected when higher order corrections are included. Given that the NNLO contribution to the total cross-section is approximately 12 % in the pole mass scheme and only 3 % in the \(\overline{\hbox {MS}}\) scheme, we expect most of this difference to be made up by higher order corrections that are larger in the pole mass scheme than in the \(\overline{\hbox {MS}}\) scheme.

In addition to these improvements, moving from the pole mass to the \(\overline{\hbox {MS}}\) scheme changes the overall shape of the distributions so that the peaks generally become more pronounced. This can be attributed to our choice of mass and is a consequence of the radiative corrections being shifted to the threshold region as mentioned earlier. However, the peak positions in both the \(p_\mathrm{T}^t\) and \(m^{t{\bar{t}}}\) distributions are stable against radiative corrections. At most they are seen to shift by 1 %, which is unlike the case for \(t{\bar{t}}\)-production from \(e^+e^-\) collisions where the position of the \(t{\bar{t}}\)-threshold peak shifts significantly upon adding NLO and NNLO perturbative corrections to the total cross section expressed in terms of the pole mass [41].

Another salient feature not shown in Fig. 6 occurs in the \(\overline{\hbox {MS}}\) differential cross section with respect to the invariant mass of the \(t{\bar{t}}\) pair. Very close to the threshold of \(t{\bar{t}}\) production the contribution reponsible for the change in the mass renormalization scheme, i.e., the derivative term in Eq. (14), becomes large. This is due to the presence of a \(1/\beta _t\) which diverges as \(m^{t{\bar{t}}} \rightarrow m\), cf. Eq. (11). These large corrections have the effect of causing the invariant mass spectrum to dip below zero for values of \(m^{t\bar{t}} \gtrsim 2m_t\). The area under this (negative) divergent part of the curve represents approximately \(-\)0.7 % of the total area (i.e., total cross-section). In the full spectrum, however, this is counterbalanced by the positive contribution resulting in a cross section integrated over \(m^{t{\bar{t}}}\) that agrees within less than 1 % with the value calculated in Hathor.

Obviously, this behavior is an indication of the breakdown of fixed-order perturbation theory. First of all, bound-state effects in \(t{\bar{t}}\) production at hadron colliders arise in the kinematic region \(m^{t{\bar{t}}} \gtrsim 2m_t\), i.e., when the velocity \(\beta \) of the top quarks is small, \(\beta \ll 1\). In this region, the conventional perturbative expansion in \(\alpha _\mathrm{s}\) breaks down, owing to singular terms \(\sim (\alpha _\mathrm{s}/\beta )^n\) in the \(n\)-loop amplitude, which require the all-order resummation of the Coulomb corrections [42, 43]. This resummation for \(t{\bar{t}}\) dynamics close to threshold is carried out in a non-relativistic effective theory by means of a Schrödinger equation for which the pole mass definition seems to be the natural choice and which implies a certain power counting, so that all terms of order \(m_t\beta ^2 \sim m_t\alpha _\mathrm{s}^2\) are formally of equal size.

If the contribution for the change in the mass renormalization scheme \(\delta m^\mathrm{sd}\) from the pole mass to a so-called short-distance mass \(m_t^\mathrm{sd}\) such as the \(\overline{\hbox {MS}}\) mass \(m(\mu _\mathrm{r})\) is parametrically larger than \(m_t\alpha _\mathrm{s}^2\) that is \(\delta m^\mathrm{sd} \equiv m_t^\mathrm{pole} - m^\mathrm{sd} \sim m_t^\mathrm{sd}\alpha _\mathrm{s}\), then \(\delta m^\mathrm{sd}\) becomes the dominant term in the kinematic region \(m^{t{\bar{t}}} \gtrsim 2m_t\). Such situation is realized for \(\delta m^\mathrm{sd} \sim m_t \alpha _\mathrm{s}\), cf. Eq. (14), and excludes the \(\overline{\hbox {MS}}\) mass from being a useful mass near threshold. Of course, all these findings on the scheme choice for the mass definition close to the threshold are long known from studies for \(t{\bar{t}}\) production in \(e^+e^-\) collisions [41] (see [44, 45] for recent progress). Various solutions have been proposed, e.g., the alternative use of a so-called \(1S\) mass [46, 47] defined through the perturbative contribution to the mass of a hypothetical \(n = 1\), \(^3S_1\) toponium bound state, cf. [37] for an application to \(t{\bar{t}}\) hadro-production or the use of a “potential-subtracted” (PS) mass [48], recently considered in [49] in the context of finite-width effects in unstable-particle production at hadron colliders. In any case, since the conventional perturbative expansion of the cross section breaks down for \(m^{t{\bar{t}}} \gtrsim 2m_t\) we do not display this particular kinematic region in Fig. 6. Moreover, with the currently given experimental resolution for the \(m^{t\bar{t}}\)-bins, cf. [4], it will be difficult to access this region at the LHC at all.

For completeness we also provide a table of values for the cross section at LHC with \(\sqrt{S}=8\) TeV at binned values of \(y^{t}, p_{\mathrm{T}}^{t}\) and \(m^{t\bar{t}}\) with binning approximately equal to that of [4]. Comparing the data generated using ABM11 as compared to CT10, we see that there is an overall shift downward consistent with that observed for the total cross section, cf. Tables 1 and 2. The improvement of the apparent perturbative convergence and the scale stability when moving from the pole mass scheme to the \(\overline{\hbox {MS}}\) scheme is consistent for both PDF sets (Table 4).

Table 3 Values for the \(y^{t}\) differential cross section for top-quark pair-production at LO and NLO for various \(y^t\) using the PDF set CT10 [36] with \(\sqrt{S} = 8\)TeV. All rates are in pb
Table 4 The same as Table 3 but using the PDF set ABM11 [30]
Table 5 Values for the \(p_\mathrm{T}^t\) differential cross section for top-quark pair-production at LO and NLO for various \(p_\mathrm{T}^t\) using the PDF set CT10 [36]. All rates are in pb/GeV

In summary, we have shown how treating the differential cross sections for \(t{\bar{t}}\) production in the \(\overline{\hbox {MS}}\) scheme for the top-quark mass has benefits as compared to the pole mass scheme. The perturbative series shows the same improvement in convergence and scale dependence as has been observed for the total cross section. As a consequence the NLO contributions with a \(\overline{\hbox {MS}}\) mass are expected to provide already very precise cross section predictions. An extension to NNLO accuracy would provide results with a still smaller theoretical uncertainty from the scale variation. Yet, the predictions at the nominal scale, i.e., \(\mu _\mathrm{r}=m(m)\), are expected to remain largely unchanged (Tables 6, 8).

Table 6 The same as Table 5 but using the PDF set ABM11 [30]
Table 7 Values for the \(m^{t{\bar{t}}}\) differential cross section for top-quark pair-production at LO and NLO for various \(m^{t{\bar{t}}}\) using the PDF set CT10 [36]. All rates are in pb/GeV
Table 8 The same as Table 7 but using the PDF set ABM11 [30]

As future prospects we note that the refinement of the present phenomenological analysis to NNLO accuracy is certainly feasible once the complete NNLO QCD corrections for differential \(t{\bar{t}}\) production are available. As a first step in this direction, one may consider approximate NNLO corrections based, e.g., on the dominant threshold logarithms. Other obvious improvements are extension to double-differential distributions and other exclusive observables, even including top-quark decay.