1 Introduction

The top quark, which has the largest mass in the Standard Model (SM), is believed to give us a hint for new physics due to its inevitable large coupling with the Higgs boson discovered in 2012 at the Large Hadron Collider (LHC). The top-quark mass value is one of key issues of the current researches. In particular, it plays an important role in testing the stability of the SM Higgs potential. The most precise measurement of the top-quark mass is 172.44 ± 0.13 (stat.) ± 0.47 (syst.) GeV from the published CMS result [1] at the LHC. The result is limited by the systematic uncertainty, coming mainly from the modeling of the hadronization. The ATLAS collaboration has also measured the top-quark mass of 172.84 ± 0.34 (stat.) ± 0.61 (syst.) GeV, which is mostly limited by the systematic uncertainty from the jet energy scale [2].

These measured top-quark masses are believed to be different from the pole mass due to non-perturbative effects like hadronization. To date, the pole mass has been measured with a relatively large uncertainty of around 2 GeV from the \(t\bar{t}\) cross section measurements [3, 4] and using \(t\bar{t}\)+1 jet events [5]. In order to reduce the uncertainty, different approaches are to be explored. One of such approaches is in the direction of utilizing leptonic observables [6,7,8]. Since leptons do not involve QCD activities, leptonic observables are advantageous to the extraction of the top-quark pole mass. The latest measurement using lepton differential distributions [9] has an uncertainty as close as 1.5 GeV.

Among the approaches utilizing leptonic observables, the method proposed in Ref. [8] has an unique feature. The observable required to this method is only the lepton energy distribution in the top-quark decay in the laboratory frame, while the theoretical prediction “compared” with it is just the lepton energy distribution in the top-quark rest frame. In other words, this method has a boost-invariant nature and is independent of top-quark velocities. This method is called “weight function method” as it uses a characteristic weight function \(W(E_\ell ,m)\). The essence of the method is as follows: with the lepton energy distribution \(D(E_\ell )\) from an experiment, there are infinite number of weight functions for which the following quantity,

$$\begin{aligned} I(m) = \int dE_\ell D(E_\ell ) W(E_\ell ,m), \end{aligned}$$
(1)

vanishes when the parameter m is equal to the true mass value of the top quark, i.e. \(I(m=m_t^{true})=0\). For the detail of the method, see Refs. [10, 11], which originally proposed the weight function method for the mass reconstruction of the Higgs boson.

A practical defect of this method is the fact that it requires the whole energy distribution at parton level which includes the region outside the detector acceptance. In the LHC experiments, events with low-energy leptons are not available due to a lepton \(p_T\) trigger. In Ref. [8], this problem was coped with by compensating the low energy part of the distribution with Monte-Carlo (MC) events. Since the compensating MC part has \(m_t\) dependence, a way to independently extract the top-quark mass was invented. In addition, in order to avoid uncertainties related to the MC events, a way of determining the normalization of the compensating part was devised. Furthermore, an event selection was carefully chosen not to deform the parton-level distribution of the remaining part. Although these devices work well, they induce additional complexity.

In this paper, we perform a feasibility study of the top-quark mass measurement using the weight function method at the LHC. In contrast to the study in Ref. [8], we use an unfolding technique to obtain the parton-level distribution, which would make this method simpler and handier experimentally. The lepton energy distribution at parton level is obtained by a simple bin-by-bin unfolding for the lepton+jets decay channel. We adopt more realistic event selection and detector simulation than the study in Ref. [8]. We also estimate some of the major uncertainties that are expected in realistic data analyses at the LHC. We show that this method can provide an independent verification of the top-quark mass measurement and provide pointers toward possible improvements with Run 2 data at the LHC.

2 Samples

Simulated pp collision data samples for the \(t\bar{t}\) process are generated at a center-of-mass energy of 14 TeV, by using MadGraph5 (v2.4.0) [12] at the leading order due to a limit of computer resources, and interfaced with PYTHIA (v6.428) [13] for parton showering and hadronization. The \(t\bar{t}\) samples are generated for five different values of the top-quark mass, 167, 170, 173, 176 and 179 GeV. For each signal sample, 600 K events are generated. A sample with a top-quark mass of 173 GeV is generated separately to obtain a detector response correction required in the unfolding procedure. The sample for the unfolding is statistically independent of the signal data sample with the size of 1200K to avoid any statistical bias.

To emulate a detector performance, the generated events are processed through the DELPHES package (v3.3.2) [14] using the public CMS detector card. Similar to the CMS reconstruction, the objects from the particle-flow algorithm implemented in DELPHES are used throughout this analysis.

Pileup events are not simulated in this analysis. Although the effects of pileup can be merged with the simulated events in the DELPHES package, pileup mitigation, which will be developed at the LHC experiments, can reduce the pileup effects significantly. It is also important to understand the physics without pileup events. Therefore, we focus on the physics under the condition that there are no pileup effects.

In the DELPHES fast simulation, momenta of all the physics objects such as electrons, muons and jets, are smeared as a function of their transverse momenta (\(p_\mathrm{T}\)) and pseudorapidities (\(\eta \)) so that the detector effects in the CMS experiment are simulated. Reconstruction efficiencies of electrons, muons and jets are also parameterized as functions of \(p_\mathrm{T}\) and \(\eta \) based on the public information from the CMS experiment.

The muon identification efficiency is set to 95% for \(p_\mathrm{T}\) > 10 GeV and \(|\eta |\) < 2.4. The electron identification efficiency is set to 95% for \(p_\mathrm{T}\) > 10 GeV and \(|\eta |\) < 1.5, and 85% for \(p_\mathrm{T}\) > 10 GeV and 1.5 < \(|\eta |\) < 2.5. Isolated muons and electrons are selected by applying a relative isolation of \(I_{rel}\) < 0.1, where \(I_{rel}\) is defined as the sum of the surrounding energy of the particle-flow tracks, photons and neutral hadrons divided by \(p_\mathrm{T}\) of the muon or electron.

The particle-flow jets used in this analysis are clustered by using particle-flow tracks and particle-flow towers. If a jet is already reconstructed as an isolated electron, muon or photon, this jet is excluded from further consideration. The b-tagging efficiency parameterized as a function of \(p_\mathrm{T}\) and \(\eta \) ranges from 20 to 50%. The fake b-tagging rate for light-flavor jets is set to 0.1%, which corresponds to the tight working point in the CMS paper Ref. [15].

3 Event selection

Events are selected based on the decay topology of the top-quark pair in the lepton+jets channel. The event should have exactly one isolated lepton (\(e, \mu \)) with \(p_\mathrm{T}\) > 20 GeV and \(|\eta |\) < 2.1. Events are further selected by requiring at least four jets with \(p_\mathrm{T}\) > 30 GeV and two b-tagged jets to reject SM backgrounds such as W + jets and single-top events. The acceptance after all the requirements is 4.3%. After this typical event selection for the lepton+jets channel, the background contribution is expected to be less than 10% level [16]. In the unfolding procedure which will be described in Sect. 4, the remaining background after the selection is assumed to be subtracted from data. A possible background contribution and its uncertainty are not considered in this analysis.

4 Measurement

The weight function method requires the lepton energy distribution at parton level. In order to obtain the parton-level distribution, we use an unfolding technique [17] for removing effects of detector performance, photon radiation, lepton isolation, the event selection, etc. The lepton energy distribution at reconstruction level is unfolded back to the parton-level distribution by using a simple bin-by-bin unfolding. For the unfolding, the additional sample with the top-quark mass of 173 GeV is used. The lepton energy distribution at reconstruction level and the unfolded distribution are shown in Fig. 1. A bin width of 2 GeV is used for the lepton energy distributions. In reality, a more complicated unfolding such as regularization might be required to correct effects of detector resolution and bin migration, which can arise from energy loss due to final-state radiation from a muon or Bremsstrahlung from an electon.

In this analysis, the unfolding is done in two steps. The first step is to correct the event selection effect. The energy distribution after the final selection is unfolded back to the distribution at preselection level with the single-lepton requirement. In this first step, since events are within the acceptance range, a data-driven method could be used. The second step is to correct detector effects such as acceptance and resolution on the lepton energy spectrum in order to obtain the energy distribution at parton level from the distribution at preselection level with the single-lepton requirement. In this second step, the response sample with the top-quark mass of 173 GeV is used. In the unfolding procedure, it is important to have a statistically independent sample to avoid any bias. Therefore, an additional 1200 K events are generated for the response distribution.

Fig. 1
figure 1

Energy distribution of lepton at reconstruction level for the top-quark mass of 173 GeV (red color) and the unfolded distribution at parton level (blue color). The energy distribution at reconstruction level is unfolded using an additional sample with the top quark mass of 173 GeV

At reconstruction level, there are no events in a low-energy region below 20 GeV due to the lepton-trigger requirement of \(p_\mathrm{T}> 20\) GeV. Therefore, we rely on the MC simulation below the \(p_\mathrm{T}\) threshold of 20 GeV. Figure 2 shows unfolded distributions at the top-quark masses of 167, 173 and 179 GeV. The distribution below the 20 GeV threshold at parton level is from the response sample generated at \(m_t\)=173 GeV.

Fig. 2
figure 2

Unfolded energy distributions of lepton at parton level at the top-quark masses of 167, 173 and 179 GeV

After the unfolding procedure, the unfolded energy distribution at parton level is used in the weight function method. The weight functions \(W(E_\ell ,m)\) used in this analysis are provided by authors of Ref. [8]. The explicit form of the weight function is

$$\begin{aligned} W(E_\ell ,m) \propto \int dE \mathcal {D}_0 (E; m) \frac{1}{EE_\ell }\frac{(E_\ell /E)^n - (E/E_\ell )^n}{ [(E_\ell /E)^n + (E/E_\ell )^n]^2 }, \end{aligned}$$
(2)

where \(\mathcal {D}_0(E;m)\) is a theoretical prediction at leading-order for the distribution of lepton energy E, calculated in the top-quark rest frame with a top-quark mass value m. The weight functions corresponding to n = 2, 3, 5, 15 for \(m = 173\) GeV are shown in Fig. 3.

With these weight functions, the following method is applied. As explained in Sect. 1, we calculate weighted integrals of the lepton energy distribution [Eq. (1)]. With binned data, the integral in Eq. (1) is replaced by a sum:

$$\begin{aligned} I(m) = \sum _i N_i W(E_i, m)\Big / \sum _j N_j, \end{aligned}$$
(3)

where \(N_i\) and \(E_i\) are the number of entries and the lepton energy, respectively, for i-th bin of the lepton energy distribution. Then the reconstructed top-quark mass \(m_t^{rec}\) is extracted through \(I(m=m_t^{rec})=0\). We multiply the unfolded energy distribution and the weight function bin by bin, and obtain the weighted sum.

Figure 4 shows the weighted sums over the parton-level energy distribution with the weight functions corresponding to \(n=\) 2, 3, 5, 15. The zeros of the weighted sums indicate reconstructed top-quark masses. In this figure, the input top-quark mass to the data sample is set to 173 GeV. The plot shows that the input value is correctly reconstructed for each n using the unfolded energy distribution.

Fig. 3
figure 3

Weight functions \(W(E_\ell ,m)\) provided by the authors of Ref. [8] corresponding to n = 2, 3, 5, 15 in Eq. (2) with \(m= 173\) GeV

Fig. 4
figure 4

Weighted sums I(m) over the unfolded lepton energy distribution with the weight functions corresponding to n = 2, 3, 5, 15. The input value of the top-quark mass is 173 GeV, as its position is indicated by a dashed line. The horizontal axis is the mass parameter m used in the calculation of \(W(E_\ell ,m)\). The top-quark mass is reconstructed from \(I(m=m_t^{rec})=0\)

Fig. 5
figure 5

Validation with different input values of the top-quark mass using the response distribution with \(m_t = 173\) GeV. The vertical axis indicates the reconstructed top-quark mass and the horizontal axis indicates the input mass. The dashed line indicates the ideal case where the input mass is the same as the reconstructed mass. The statistical uncertainty is shown together with the central value

Table 1 Input and reconstructed masses using the response distribution with \(m_t = 173\) GeV. The statistical uncertainty for each mass value is 0.5%

5 Results

Figure 5 shows reconstructed top-quark masses versus input masses in a 3-GeV step for the \(n=2\) weight function. The response sample with the top-quark mass of 173 GeV is used for the unfolding including the correction below the 20 GeV threshold. The statistical uncertainty is estimated by varying each bin randomly within the statistical uncertainty in the unfolded distribution. The statistical uncertainty shown on the unfolded distribution is the sum of the statistical uncertainties from the response distribution and the input distribution. This uncertainty on the unfolding is obtained by running toy MC experiments. The estimated statistical uncertainty is around 0.5% which corresponds to about 0.8 GeV for the top-quark mass of 173 GeV. The statistical uncertainty described above is shown in Fig. 5.

Table 1 shows the input and reconstructed top-quark masses using the weight function corresponding to n=2, 3, 5, 15. The reconstructed top-quark masses are consistent with the input top-quark masses within the sizes of the statistical error. Note that the results with different n involve different sizes of statistical and systematic uncertainties, although they are strongly correlated. In addition, there would be a systematic bias due to the 2-GeV bin width. Since a weight function with larger n has a sharper form in the low-energy region (see Fig. 3), the bias due to the bin width is larger for the larger-n weight function.

To see effects of the lepton \(p_\mathrm{T}\) threshold on the result, the analysis is repeated with different lepton \(p_\mathrm{T}\) thresholds of 22, 24 and 26 GeV. Increasing the threshold would lead to a larger bias on the reconstructed top-quark mass. Table 2 shows input and reconstructed masses for various lepton \(p_\mathrm{T}\) thresholds. One can see in Table 2 that the reconstructed top-quark mass approaches the top-quark mass value of the response sample, namely 173 GeV, as the threshold increases. With the 20 GeV threshold, the bias is sufficiently small compared with the statistical uncertainty.

Table 2 Input and reconstructed masses for various lepton \(p_\mathrm{T}\) thresholds from 20 GeV to 26 GeV in steps of 2 GeV. The weight function for \(n=2\) is used

6 Discussion

In this section, we discuss and estimate main systematic uncertainties that can arise in this method. The most serious bias is caused by the fact we have to rely on the response distribution from the MC sample for the lepton energy distribution below the threshold. As the lepton energy threshold goes up, we rely more and more on the response sample. In Sect. 5, we showed this possible bias is negligible when the lepton energy threshold of 20 GeV is used. The result with this threshold is consistent with the input top-quark mass within the statistical uncertainty of 0.5%. However, with a threshold of above 20 GeV, the bias becomes larger than the statistical uncertainty, and the method used in this paper has a difficulty. Therefore, it is desired that the lepton \(p_\mathrm{T}\) threshold is as low as possible. Note that this is not a critical problem for the weight function method itself. As explained in Sect. 1, this problem is solved in Ref. [8], by imposing a consistency condition that the reconstructed mass is equal to the mass for the MC sample below the threshold. The result in our study demonstrates that the simple way used in this analysis can be applied only with a \(p_\mathrm{T}\) trigger as low as 20 GeV. With a lepton \(p_\mathrm{T}\) trigger above 20 GeV, it would be essential to apply a supplementary method like the one described in Ref. [8].

There would be a systematic uncertainty from the data-driven correction in the first step of the unfolding described in Sect. 4. In order to obtain the correction factor for the selection efficiency, an orthogonal event selection can be applied to enhance top-quark events. However, it would not be trivial to find such an orthogonal event selection. In this analysis, the uncertainty in this step is ignored but can be significant in a real analysis. In addition, uncertainties on the background subtraction are not included in this analysis but can contribute. The largest experimental uncertainty in the top-quark pair cross section used to determine the pole mass is from the lepton energy scale and resolution [3, 4].

Uncertainties from the factorization and renormalization scales are estimated by varying the scales for input distributions by a factor of two up and down with respect to their reference values for the lepton energy distribution at the top-quark mass of 173 GeV. The same response sample with the nominal scale is kept for the unfolding. The uncertainty of 0.3% is assessed by taking the difference in the result. It should be noted that these scale uncertainties are the dominant ones in the estimate of uncertainties in the study of Ref. [8]. The smallness of the scale uncertainties compared to those in Ref. [8] is an advantage of using the unfolding. It would also be required to have an extensive validation of the unfolding by using different MC generators to check any possible bias from theoretical predictions. In particular, the validation with NLO generators would be very useful.

The weight function method is based on the assumption that the top quark is on-shell. Thus, the actual finite width of the top quark causes a deviation to the reconstructed mass. We estimate the size of this deviation by examining the invariant mass distribution of the top quark at parton level. With the parameter setting that the cutoff for the Breit–Wigner distribution in the configuration of the MadGraph package is at \(m_t\pm 50 \varGamma _t\), the mean value of the mass distribution is shifted from the input mass value by the amount of 0.3 GeV for each mass. Therefore, we expect a systematic shift of the order of 0.3 GeV in the result for the reconstructed mass. In real experiments, the effect of the top-quark width can be estimated by simulation analyses with MC generators which take into account the top-quark finite-width effects more thoroughly (see, for example, Ref. [18]).

Overall, the sum of the statistical and systematic uncertainties considered in this analysis is less than 1 GeV.

This study is performed at leading order, using the leading-order event generator and the leading-order theoretical prediction for the lepton energy distribution \(\mathcal {D}_0\). If we include the next-to-leading order (NLO) corrections to them in the on-shell scheme, a reconstructed mass is identified with the top-quark pole mass. More specifically, a weight function at NLO is calculated with \(\mathcal {D}^\mathrm{NLO}_0(E;m^\mathrm{pole})\), which is the NLO distribution of lepton energy (in the top-quark rest frame) with a top-quark pole mass \(m^\mathrm{pole}\). Then one can read off the NLO pole mass value from a weighted integral through \(I(m^\mathrm{pole}=m_t^{\mathrm{pole},\,rec})=0\). Note that for the purpose of extracting the NLO pole mass, only the corrections to the top-quark decay process are required since the theoretical prediction used in the weight functions, \(\mathcal {D}_0^\mathrm{(N)LO}\), is the distribution in the top-quark rest frame. Thanks to the boost-invariant nature of the weight function method (as mentioned in Sect. 1 and proven in Refs. [10, 11]), the method relies on only the top-quark decay process for theoretical prediction. Note also that the theoretical corrections to the top-quark decay process are much smaller than those to the production process, so that experimental consequences are insensitive to the corrections to the decay process. These suggest that the application of this analysis to the NLO pole-mass extraction is straightforward and the experimental uncertainties estimated in this leading-order study will not change significantly.

7 Conclusions

We estimated the sensitivity of the top-quark mass measurement with the weight function method by using simulation samples. This method requires only the lepton energy distribution at parton level. Events with exclusively one lepton, at least four jets and two b jets are selected. The lepton energy distribution at reconstruction level after the selection is unfolded back to the energy distribution at parton level. In the region below the energy threshold of 20 GeV, the response sample with the top-quark mass of 173 GeV is used. The reconstructed mass of the top quark with the weight function method is consistent with the input mass within the statistical uncertainty of 0.8 GeV. We discussed and estimated some of main systematic uncertainties expected in this method. Taking into account the statistical and systematic uncertainties, the estimated uncertainty of the reconstructed top-quark mass is of the order of 1 GeV. This uncertainty is compatible with the current uncertainty of 2 GeV in the measurement of the top-quark pole mass. Therefore, the weight function method could provide an alternative approach to measure the top-quark mass without introducing large systematic uncertainties that can arise due to the jet energy measurement. This study shows that the weight function method can also provide an independent verification of the top-quark mass measurement and provide pointers toward possible improvements with Run 2 data.