1 Introduction

The top quark [1, 2] is the most massive known fundamental particle and its mass \(m_{\mathrm {t}}\) is an important parameter of the standard model (SM) of particle physics. Precise measurements of \(m_{\mathrm {t}} \) can be used to test the internal consistency of the SM [3,4,5] and to search for new physical phenomena. Since the top quark dominates the higher-order corrections to the Higgs boson mass, a precise \(m_{\mathrm {t}} \) determination is crucial to put constraints on the stability of the electroweak vacuum [6, 7].

At the CERN LHC, top quarks are predominantly produced in quark-antiquark pairs (\({\mathrm {t}\overline{\mathrm {t}}}\)) through the gluon fusion process, and decay almost exclusively to a bottom quark and a \(\mathrm {W}\) boson. Each \({\mathrm {t}\overline{\mathrm {t}}}\) event can be classified through the decays of the \(\mathrm {W}\) bosons. Events in the all-jets final state correspond to those that have both \(\mathrm {W}\) bosons decaying further into \(\mathrm {q}\mathrm {\overline{q}}'\) pairs, while events in the lepton+jets final state have one \(\mathrm {W}\) boson decaying to a charged lepton and a neutrino.

This paper presents a measurement of \(m_{\mathrm {t}} \) obtained in the \({\mathrm {t}\overline{\mathrm {t}}}\) all-jets decay channel using proton–proton (\(\mathrm {p}\mathrm {p}\)) collision data taken in 2016 by the CMS experiment at a center-of-mass energy of \(\sqrt{s}=13\,\text {TeV} \), corresponding to an integrated luminosity of \(35.9{\,\text {fb}^{-1}} \). The two bottom quarks and the four light quarks from the \({\mathrm {t}\overline{\mathrm {t}}}\) decay are all required to be physically separated in the laboratory frame of reference, and the nominal experimental signature is therefore characterized by six jets in the detector.

Although this final state provides the largest branching fraction of all \({\mathrm {t}\overline{\mathrm {t}}}\)  decays, this measurement of \(m_{\mathrm {t}} \) is particularly challenging, because of the large background from multijet production. A kinematic fit of the decay products to the \({\mathrm {t}\overline{\mathrm {t}}}\) hypothesis is therefore employed to separate signal from background events.

The value of \(m_{\mathrm {t}}\) is extracted using the ideogram method [8, 9], which is based on a likelihood function that depends either just on the mass parameter \(m_{\mathrm {t}}\), or on \(m_{\mathrm {t}}\) combined with an additional jet energy scale factor (JSF). In the second case, the invariant mass of the two jets associated with the \(\mathrm {W}\rightarrow \mathrm {q}\overline{\mathrm {q}}'\) decay serves as an observable to directly estimate the JSF.

Previous measurements in this decay channel have been performed by Tevatron and LHC experiments at lower center-of-mass energies [10,11,12,13,14]. The most precise one of these has been obtained by CMS at \(\sqrt{s}=8\,\text {TeV} \), resulting in a mass of \(m_{\mathrm {t}} =172.32\pm 0.25\,\text {(stat+JSF)} \pm 0.59\,\text {(syst)} \,\text {GeV} \). Combining the results of several measurements using different final states at \(\sqrt{s}=7\) and \(8\,\text {TeV} \), ATLAS and CMS reported values of \(m_{\mathrm {t}} =172.69\pm 0.48\,\text {GeV} \) [15] and \(172.44\pm 0.48\,\text {GeV} \) [12], respectively, while a value of \(m_{\mathrm {t}} =174.30\pm 0.65\,\text {GeV} \) was obtained by combining the Tevatron results [16].

The top quark mass has been measured for the first time with \(\mathrm {p}\mathrm {p}\) data at \(\sqrt{s} = 13\,\text {TeV} \), using the lepton+jets channel [17], yielding a value of \(m_{\mathrm {t}} = 172.25\pm 0.08\,\text {(stat+JSF)} \pm 0.62\,\text {(syst)} \,\text {GeV} \). A measurement using both \({\mathrm {t}\overline{\mathrm {t}}}\) all-jets and lepton+jets events is presented here. This is possible since the two measurements use the same mass extraction method, so a single likelihood can be used, rather than just combining the two results statistically. With this approach, no assumptions on correlations between different uncertainties of the measurements have to be made. This is the first report of a combined \(m_{\mathrm {t}}\) measurement in the lepton+jets and all-jets final states using a single likelihood function.

2 The CMS detector and event reconstruction

The central feature of the CMS apparatus is a superconducting solenoid of 6 \(\text { m}\) internal diameter, providing a magnetic field of 3.8 \(\text { T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity (\(\eta \)) coverage provided by the barrel and endcap detectors. Muons are detected in gas-ionization chambers embedded in the steel flux-return yoke outside the solenoid.

Events of interest are selected using a two-tiered trigger system [18]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events within a time interval of 4\(\,\mu \text {s}\) , resulting in a trigger rate of around 100 \(\text { kHz}\). The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to around 1 \(\text { kHz}\) before data storage.

The particle-flow (PF) algorithm [19] aims to reconstruct and identify each individual particle in an event, with an optimized combination of information from the various elements of the CMS detector. The energy of photons is obtained from the ECAL measurement. The energy of electrons is determined from a combination of the electron momentum at the primary interaction vertex as determined by the tracker, the energy of the corresponding ECAL cluster, and the energy sum of all bremsstrahlung photons spatially compatible with originating from the electron track. The energy of muons is obtained from the curvature of the corresponding track. The energy of charged hadrons is determined from a combination of their momentum measured in the tracker and the matching ECAL and HCAL energy deposits, corrected for zero-suppression effects and for the response function of the calorimeters to hadronic showers. Finally, the energy of neutral hadrons is obtained from the corresponding corrected ECAL and HCAL energy.

The reconstructed vertex with the largest value of summed physics-object \(p_{\mathrm {T}} ^2\) is taken to be the primary proton–proton interaction vertex. The physics objects are the jets, clustered using the jet finding algorithm [20, 21] with the tracks assigned to the vertex as inputs, and the associated missing transverse momentum, taken as the negative vector sum of the transverse momentum \(p_{\mathrm {T}}\) of those jets.

Jets are clustered from PF objects using the anti-\(k_{\mathrm {T}}\) algorithm with a distance parameter of 0.4 [20,21,22]. Jet momentum is determined as the vectorial sum of all particle momenta in the jet, and is found from simulation to be within 5–10% of the true momentum over the whole \(p_{\mathrm {T}}\) spectrum and detector acceptance. Additional proton–proton interactions within the same or nearby bunch crossings (pileup) can contribute additional tracks and calorimetric energy depositions to the jet momentum. To mitigate this effect, tracks identified to be originating from pileup vertices are discarded, and an offset correction is applied to correct for remaining contributions from neutral hadrons. Jet energy corrections (JECs) are derived from simulation to bring the measured response of jets to that of particle level jets on average. In situ measurements of the momentum balance in dijet, photon+jet, \(\mathrm {Z}\) +jet, and multijet events are used to account for any residual differences in the jet energy scale in data and simulation [23]. Additional selection criteria are applied to each jet to remove jets dominated by anomalous contributions from various subdetector components or reconstruction failures [24].

A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [25].

3 Event selection and simulation

Only jets with \(p_{\mathrm {T}} >30\,\text {GeV} \) reconstructed within \(|\eta | < 2.4\) are used in the analysis. For the identification of jets originating from the hadronization of \(\mathrm {b}\) quarks, the combined secondary vertex algorithm (CSVv2) \(\mathrm {b}\) tagger is used [26]. The chosen working point provides an identification efficiency of approximately \(50\%\) with a probability of misidentifying a \(\mathrm {u} \)/\(\mathrm {d} \)/\(\mathrm {s} \) quark jet or gluon jet as being a bottom jet of approximately \(0.1\%\), and a misidentification probability for \(\mathrm {c} \) quark jets of \(2\%\). The hadronic activity, used for the event selection, is defined as the scalar \(p_{\mathrm {T}}\) sum of all jets in the event,

$$\begin{aligned} H_{\mathrm {T}} \equiv \sum _{\text {jets}}p_{\mathrm {T}}. \end{aligned}$$

Data events are selected using an HLT that requires the presence of at least six PF jets with \(p_{\mathrm {T}} >40\,\text {GeV} \) and \(H_{\mathrm {T}} >450\,\text {GeV} \). Additionally, the HLT requires at least one jet to be \(\mathrm {b}\) tagged.

In the offline selection, an event must contain a well reconstructed vertex localized within 24 cm in the z direction and 2\(\,\text {cm}\) in the xy plane around the nominal interaction point. Selected events are required to contain at least six jets, at least two of which have to be tagged as \(\mathrm {b}\) jets. The sixth jet (\(\text {jet}_6\)), ordered in decreasing \(p_{\mathrm {T}}\), must fulfill \(p_{\mathrm {T}} (\text {jet}_6) > 40\,\text {GeV} \), and \(H_{\mathrm {T}} >450\,\text {GeV} \) is required. The two \(\mathrm {b}\) jets must be separated in \(\varDelta R = \sqrt{\smash [b]{\varDelta \phi ^2 + \varDelta \eta ^2}}\) by \(\varDelta R(\mathrm {b}\overline{\mathrm {b}}) > 2.0\).

The \({\mathrm {t}\overline{\mathrm {t}}}\) signal is simulated at an \(m_{\mathrm {t}}\) of \(172.5\,\text {GeV} \) using the powheg  v2 [27,28,29] matrix-element (ME) generator in next-to-leading order (NLO) perturbative quantum chromodynamics (QCD). For the parton distribution functions (PDFs), the NNPDF3.0 NLO set [30] is used with the strong coupling constant value of \(\alpha _S =0.118\). This is one of the first PDF sets to include the total \({\mathrm {t}\overline{\mathrm {t}}}\) cross section measurements from ATLAS and CMS at \(\sqrt{s}=7\) and \(8\,\text {TeV} \) as input. The parton shower (PS) and hadronization are handled by pythia  8.219 [31] using the CUETP8M2T4 tune [32, 33] and Geant4 is used to simulate the response of the CMS detector [34]. The simulated signal sample is normalized to the integrated luminosity of the data sample using a cross section of \(\sigma _{{\mathrm {t}\overline{\mathrm {t}}}} = 832\,\text { pb} \), calculated at next-to-next-to-leading order in QCD including resummation of next-to-next-to-leading logarithmic soft gluon terms [35]. In addition to the default sample, six other samples are used assuming top quark masses of 166.5, 169.5, 171.5, 173.5, 175.5, and 178.5\(\,\text {GeV}\), and using the corresponding cross sections.

For simulated events, a trigger emulation is used. The residual differences in the trigger efficiency between data and simulation are corrected by applying scale factors to the simulated events. These are obtained by measuring the trigger efficiency with respect to a reference \(H_{\mathrm {T}}\)  trigger for both data and simulation. The parameterized ratio as a function of \(p_{\mathrm {T}} (\text {jet}_6)\) and \(H_{\mathrm {T}}\) is used to reweight the simulated events. Additional \(\mathrm {p}\mathrm {p}\) collisions are included in the simulated events. These are weighted to match the pileup distribution in data. Finally, corrections to the jet energy scale and resolution, as well as to the \(\mathrm {b}\) tagging efficiency and misidentification rate, are applied to the simulated events.

4 Kinematic fit and background estimation

To improve the resolution of the top quark mass and decrease the background contribution, a kinematic fit is applied. It exploits the known topology of the signal events, i.e., pair production of a heavy particle and antiparticle, each decaying to \(\mathrm {W}\mathrm {b}\) with \(\mathrm {W}\rightarrow \mathrm {q}\overline{\mathrm {q}}'\). The three-momenta of the jets are fitted such that

$$\begin{aligned} {\chi ^2}&= \sum _{j\in \text {jets}}\left[ \frac{\left( {p_{\mathrm {T}}}_j^\text {reco} - {p_{\mathrm {T}}}_j^\text {fit}\right) ^2}{\sigma _{{p_{\mathrm {T}}}_j}^2} + \frac{\left( \eta _j^\text {reco} - \eta _j^\text {fit}\right) ^2}{\sigma _{\eta _j}^2}\right. \\&\qquad \left. +\frac{\left( \phi _j^\text {reco} - \phi _j^\text {fit}\right) ^2}{\sigma _{\phi _j}^2} \right] \end{aligned}$$

is minimized, where all jets assigned to the \({\mathrm {t}\overline{\mathrm {t}}}\) decay system are considered. The labels “reco” and “fit” denote the components of the originally reconstructed and the fitted jets, respectively, and the corresponding resolutions are labeled \(\sigma _X\). The minimization is performed, constraining the invariant mass of the jets assigned to each \(\mathrm {W}\) boson decay to \(m_{\mathrm {W}}=80.4\,\text {GeV} \). As an additional constraint, the two top quark candidates are required to have equal invariant masses.

All possible parton-jet assignments are tested using the leading six jets in the event, but only \(\mathrm {b}\)-tagged jets are used as \(\mathrm {b}\) candidates and equivalent choices (e.g., swapping the two jets originating from one \(\mathrm {W}\) boson) are not considered separately. Of the remaining 12 possibilities, only the assignment yielding the smallest \({\chi ^2} \) is used in the following. The \({\chi ^2} \) value can be used as a goodness-of-fit (gof) measure. For three degrees of freedom, it is translated into a p-value of

$$\begin{aligned} P_\text {gof} \equiv 1-{\text {erf}} \left( \sqrt{\frac{{\chi ^2}}{2}}\right) + \sqrt{\frac{2{\chi ^2}}{\pi }}\mathrm {e}^{-{\chi ^2}/2}. \end{aligned}$$

Events are required to fulfill \(P_\text {gof} >0.1\) for the best assignment.

In simulation, event generator information can be used to validate the assignment of the reconstructed jets to the top quark decay products. Events are classified accordingly as correct or wrong permutations. A parton-jet assignment is considered correct if the jets can be matched unambiguously to the right partons within \(\varDelta R < 0.3\). Wrong permutations can occur because of a wrong parton-jet assignment, yielding the smallest \({\chi ^2} \) or jets being out of acceptance, not being reconstructed, or failing the identification requirements.

The \(P_\text {gof}\) distribution is displayed in Fig. 1 (right). Requiring \(P_\text {gof} >0.1\) increases the fraction of correct permutations from 6 to \(51\%\). The fitted top quark mass (\(m_{\mathrm {t}}^\text {fit}\)) is calculated as the invariant mass of the corresponding jets returned by the kinematic fit. Compared to the mass calculated from the originally reconstructed jets, the mass resolution is improved from 14.0 to \(8.8\,\text {GeV} \) for the correct parton-jet assignments, where, in both cases, the same events passing the \(P_\text {gof} >0.1\) requirement are used.

The \(\varDelta R(\mathrm {b}\overline{\mathrm {b}}) > 2.0\) and \(P_\text {gof} >0.1\) requirements greatly reduce the background from QCD multijet production from approximately 80 to 25%, but a significant number of multijet events enters the signal selection owing to the large production cross section of that background contribution. These events can fulfill the goodness-of-fit criterion because of combinatorial chance, but not because of an underlying decay topology. Therefore, it is assumed that \(\mathrm {b}\) jets can be exchanged with light-flavor jets for the estimation of the background from data, because the probability for mimicking the \({\mathrm {t}\overline{\mathrm {t}}}\) topology is the same.

For the background estimation, the same selection as for the signal is applied, as described above, but instead of requiring two \(\mathrm {b}\)-tagged jets, events with exactly zero \(\mathrm {b}\)-tagged jets are used. For this veto, a very loose working point is used for the \(\mathrm {b}\) tagger, to suppress contamination from \({\mathrm {t}\overline{\mathrm {t}}}\)  events in this QCD-enriched sample. A prescaled trigger similar to the signal trigger is used for this selection, which does not require the presence of \(\mathrm {b}\) jets. The kinematic fit is applied as before, but here any of the six light-flavor jets can be assigned to the partons originating from the \(\mathrm {W}\) decays, as well as to the partons serving as \(\mathrm {b}\) quarks, leading to 90 possible permutations that have to be evaluated. This method allows one to determine the kinematic distributions of the background, but the normalization is unknown. In all plots, the background is normalized to the difference of the number of data events and the number of expected signal events. This data sample contains approximately five times the number of expected background events, so it provides good statistical precision.

Fig. 1
figure 1

The \(\varDelta R(\mathrm {b}\overline{\mathrm {b}})\) (left) and \(P_\text {gof}\) (right) distributions of data compared to simulated signal and the multijet background estimate. For each event, the parton-jet assignment yielding the smallest \(\chi ^2\) in the kinematic fit is used. The simulated signal events are classified as correct or wrong assignments and displayed separately, and the distributions are normalized to the integrated luminosity. For the background estimate, the total normalization is given by the difference of observed data events and expected signal events. The hashed bands represent the total uncertainty in the complete prediction. The lower panels show the ratio of data and prediction

Fig. 2
figure 2

The fitted top quark mass (left) and reconstructed \(\mathrm {W}\) boson mass (right) distributions of data compared to simulated signal and the multijet background estimate. The shown reconstructed \(\mathrm {W}\) boson mass is the average mass of the two W bosons in the event. For each event, the parton-jet assignment yielding the smallest \(\chi ^2\) in the kinematic fit is used. The simulated signal events are classified as correct or wrong assignments and displayed separately, and the distributions are normalized to the integrated luminosity. For the background estimate, the total normalization is given by the difference of observed data events and expected signal events. The hashed bands represent the total uncertainty in the prediction. The lower panels show the ratio of data and prediction

The final selected data set consists of 10,799 events with a signal purity of \(75\%\). Figure 1 shows the distributions of the separation of the two \(\mathrm {b}\) jets \(\varDelta R(\mathrm {b}\overline{\mathrm {b}})\) and the quantity \(P_\text {gof}\) in data, compared to the background estimate and \({\mathrm {t}\overline{\mathrm {t}}}\) simulation. For the \({\mathrm {t}\overline{\mathrm {t}}}\)  signal, correct and wrong parton-jet assignments are shown separately. The corresponding distributions of \(m_{\mathrm {t}}^\text {fit}\) and the reconstructed \(\mathrm {W}\) boson mass \(m_{\mathrm {W}}^\text {reco}\), calculated from the originally reconstructed jets, are shown in Fig. 2. These two quantities are used in the top quark mass extraction described in the following section.

5 Ideogram method

For the extraction of \(m_{\mathrm {t}}\), the ideogram method is used [8, 9]. Simultaneously, a JSF is determined that is used in addition to the standard CMS jet energy calibration [12] to reduce the corresponding systematic uncertainty. The distributions of \(m_{\mathrm {t}}^\text {fit}\) obtained from the kinematic fit and \(m_{\mathrm {W}}^\text {reco}\) are used in a combined fit. For \(m_{\mathrm {W}}^\text {reco}\), the average mass of the two \(\mathrm {W}\) bosons in an event is used.

The likelihood

$$\begin{aligned} \mathcal {L}\left( m_{\mathrm {t}},\text {JSF}\right)&= P\left( \text {sample}|m_{\mathrm {t}},\text {JSF}\right) \\&= \prod _{\text {events}}P\left( \text {event}|m_{\mathrm {t}},\text {JSF}\right) \\&= \prod _{\text {events}}P\left( m_{\mathrm {t}}^\text {fit},m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF}\right) \end{aligned}$$

is maximized, yielding the best fit values for \(m_{\mathrm {t}}\) and JSF. A prior probability for the \(\text {JSF}\) can be incorporated by maximizing

$$\begin{aligned} P(\text {JSF}) P\left( \text {sample}|m_{\mathrm {t}},\text {JSF}\right) \end{aligned}$$

instead. Treating \(m_{\mathrm {t}}^\text {fit}\) and \(m_{\mathrm {W}}^\text {reco}\) as uncorrelated, as verified using simulated events, the probability \(P\left( m_{\mathrm {t}}^\text {fit},m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF} \right) \) factorizes into

$$\begin{aligned}&P\left( m_{\mathrm {t}}^\text {fit},m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF}\right) \\&\quad {}={}f_\text {sig} P\left( m_{\mathrm {t}}^\text {fit},m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF}\right) \\&\qquad {}+{}\left( 1-f_\text {sig}\right) P_\text {bkg}\left( m_{\mathrm {t}}^\text {fit},m_{\mathrm {W}}^\text {reco} \right) \\&\quad {}={}f_\text {sig} \sum _{j}f_{j}P_{j}\left( m_{\mathrm {t}}^\text {fit} |m_{\mathrm {t}},\text {JSF}\right) P_{j}\left( m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF}\right) \\&\qquad {}+{}\left( 1-f_\text {sig}\right) P_\text {bkg}\left( m_{\mathrm {t}}^\text {fit} \right) P_\text {bkg}\left( m_{\mathrm {W}}^\text {reco} \right) , \end{aligned}$$

where \(f_j\) with \(j\in \left\{ \text {correct}, \text {wrong}\right\} \) is the relative fraction of the different permutation cases and \(f_\text {sig}\) is the signal fraction.

The probability densities \(P_{j}\left( m_{\mathrm {t}}^\text {fit} |m_{\mathrm {t}},\text {JSF}\right) \) and \(P_{j}\left( m_{\mathrm {W}}^\text {reco} |m_{\mathrm {t}},\text {JSF}\right) \) for the signal are described by analytic functions parametrized in \(m_{\mathrm {t}}\) and JSF. For the determination of the parameters, a simultaneous fit to simulated samples for seven different generated top quark masses \(m_{\mathrm {t}}^\text {gen}\) and five different input JSF values is used. The background shape is described by a spline interpolation as a function of \(m_{\mathrm {t}}^\text {fit}\) and \(m_{\mathrm {W}}^\text {reco}\), but independent of the model parameters \(m_{\mathrm {t}}\) and JSF.

Three variations of a maximum likelihood fit are performed to extract the top quark mass. In the one-dimensional (1D) analysis, the JSF is fixed to unity (corresponding to a Dirac delta function for the prior probability), i.e., the standard CMS jet energy calibration. For the two-dimensional (2D) analysis, the JSF is a free parameter in the maximum likelihood fit, making possible a compensation of part of the systematic uncertainties. The signal fraction and correct permutation fraction are free parameters in both cases. The third (hybrid) method is a weighted combination of both approaches, corresponding to a measurement with a Gaussian constraint on the JSF around unity. In the limit of an infinitely narrow JSF constraint, the hybrid method is identical to the 1D method, while for an infinitely broad prior probability distribution, the 2D method is recovered. The width of the Gaussian constraint in the hybrid method is optimized to yield the smallest total uncertainty.

To calibrate the mass extraction method, pseudo-experiments are performed for the seven different generated values of \(m_{\mathrm {t}}^\text {gen}\) and three input JSF values (0.98, 1.00, and 1.02). The extracted \(m_{\mathrm {t}}\) and JSF values are compared to the input values and the residual slopes in \(m_{\mathrm {t}}^\text {gen}\) and JSF are used as calibration. The residual biases after the calibration are shown in Fig. 3 for pseudo-experiments with different JSF and \(m_{\mathrm {t}}^\text {gen}\) values. As expected, neither a significant residual offset nor a slope are observed after the calibration procedure.

Fig. 3
figure 3

Difference between extracted and generated top quark masses (upper panel) and JSFs (lower panel) for different input masses and JSFs after the calibration in the all-jets channel. The values are extracted using the 2D method

6 Systematic uncertainties

A summary of the systematic uncertainty sources is shown in Table 1. The corresponding values are obtained from pseudo-experiments, using Monte Carlo (MC) signal samples with variations of the individual systematic uncertainty sources. In the following, details for the determination of the most important uncertainties are given. Most systematic uncertainty sources are shifted by \(\pm 1\) standard deviation, and the absolute value of the largest resulting shifts in \(m_{\mathrm {t}} \) and JSF are quoted as systematic uncertainties for the measurement. For some uncertainties, different models are compared, and are described individually. The maximum of the statistical uncertainty on the observed shift and the shift itself is used as the systematic uncertainty.

Table 1 List of systematic uncertainties for the all-jets channel. The signs of the shifts (\(\delta x = x_\text {variation} - x_\text {nominal}\)) correspond to the \(+\,1\) standard deviation variation of the systematic uncertainty source. For linear sums of the uncertainty groups, the relative signs have been considered. Shifts determined using dedicated samples for the systematic variation are displayed with the corresponding statistical uncertainty
  • Method calibration The quadratic sum of the statistical uncertainty and the residual bias of the calibration curve (shown in Fig. 3) after the calibration is used as the systematic uncertainty.

  • JECs Jet energies are scaled up and down according to the \(p_{\mathrm {T}}\)- and \(\eta \)-dependent data/simulation uncertainties [23]. The correlation groups (called Intercalibration, MPFInSitu, and Uncorrelated) follow the recommendations documented in Ref. [36].

  • Jet energy resolution Since the jet energy resolution measured in data is worse than in simulation, the simulation is modified to correct for the difference [23]. The jet energy resolution in the simulation is varied up and down within the uncertainty.

  • \(\mathrm {b}\) tagging The \(p_{\mathrm {T}}\)-dependent uncertainty of the \(\mathrm {b}\) tagging efficiencies and misidentification rates of the CSVv2 \(\mathrm {b}\) tagger [26] are taken into account by reweighting the simulated events accordingly.

  • Pileup To estimate the uncertainty in the determination of the number of pileup events and the reweighting procedure, the inelastic proton–proton cross section [37] used in the determination is varied by \({\pm }\,4.6\%\).

  • Background An uncertainty in the background prediction is obtained by applying the method to simulation and comparing the obtained estimate to the direct simulation, i.e., generated QCD multijet events passing the signal selection. A linear fit to the ratio is consistent with a constant value of unity. The slope is varied up and down within its uncertainty and used to reweight the events used for the determination of the background probability density function.

  • Trigger To estimate the uncertainty in the trigger selection, the data/simulation scale factor described in Sect. 3 is omitted. Additionally, a base trigger requiring the presence of one muon is used to obtain the correction factor. The maximum of the observed shifts with respect to the nominal correction is quoted as an uncertainty.

  • JEC flavor The difference between Lund string fragmentation and cluster fragmentation is evaluated comparing pythia  6.422 [38] and herwig++  2.4 [39]. The jet energy response is compared separately for each jet flavor [23]. Uncertainties for jets from different quark flavors and gluons are added linearly, which takes into account possible differences between the measured JSF, which is mainly sensitive to light quarks and gluons, and the \(\mathrm {b}\) jet energy scale.

  • \(\mathrm {b}\) jet modeling The uncertainty associated with the fragmentation of \(\mathrm {b}\) quarks is split into three components. The Bowler–Lund fragmentation function is varied within its uncertainties as determined by the ALEPH and DELPHI Collaborations [40, 41]. As an alternative model of the fragmentation into \(\mathrm {b}\) hadrons, the Peterson fragmentation function is used and the difference obtained relative to the Bowler–Lund fragmentation function is assigned as an uncertainty. The third uncertainty source taken into account is the semileptonic \(\mathrm {b}\) hadron branching fraction, which is varied by \(-0.45\%\) and \(+\,0.77\%\), motivated by measurements of \({\mathrm {B}^0}\)/\({\mathrm {B}^{+}}\)decays and their corresponding uncertainties [42].

  • PDF The 100 PDF replicas of the NNPDF3.0 NLO (\(\alpha _S = 0.118)\) set are used to repeat the analysis [30]. The variance of the results is used to determine the PDF uncertainty. In addition, the \(\alpha _S \) value is changed to 0.117 and 0.119. The maximum of the PDF uncertainty and the \(\alpha _S \) variations is quoted as uncertainty.

  • Renormalization and factorization scales The renormalization and factorization scales for the ME calculation are varied. Both are multiplied independently from each other, and simultaneously by factors of 0.5 and 2 with respect to the default values. This is achieved by appropriately reweighting simulated events. The quoted uncertainty corresponds to the envelope of the resulting shifts.

  • ME/PS matching The matching of the powheg ME calculations to the pythia PS is varied by shifting the parameter \(h_{\text {damp}}=1.58^{+\,0.66}_{-0.59}\) [33] within the uncertainties. The jet response \(p_{\mathrm {T}} ^{\text {reco}}/p_{\mathrm {T}} ^{\text {gen}}\) as a function of \(p_{\mathrm {T}} ^{\text {gen}}\) is rescaled in the variation samples to reproduce the response observed in the default sample.

  • ISR PS scale For initial-state radiation (ISR), the PS scale is varied in pythia. The ISR PS scale is multiplied by factors of 2 and 0.5 in dedicated MC samples.

  • FSR PS scale The PS scale used for final-state radiation (FSR) is scaled up by \(\sqrt{2}\) and down by \(1/\sqrt{2}\) [32], affecting the fragmentation and hadronization, as well additional jet emission. The jet response is rescaled in the variation samples to reproduce the response observed in the default sample.

  • Top quark \(p_{\mathrm {T}}\) Recent calculations suggest that the top quark \(p_{\mathrm {T}}\) spectrum is strongly affected by next-to-next-to-leading-order effects [43]. The \(p_{\mathrm {T}}\) of the top quark in simulation is varied to match the distribution measured by CMS [44, 45] and its impact on the \(m_{\mathrm {t}}\) measurement is quoted as a systematic uncertainty.

  • Underlying event Measurements of the underlying event have been used to tune pythia parameters describing nonperturbative QCD effects [32, 33]. The parameters of the tune are varied within their uncertainties.

  • Early resonance decays Modeling of color reconnection (CR) introduces systematic uncertainties which are estimated by comparing different CR models and settings. In the default sample, the top quark decay products are not included in the CR process. This setting is compared to the case of including the decay products by enabling early resonance decays (ERD) in pythia 8.

  • CR modeling In addition to the default model used in pythia 8, two alternative CR models are used, namely a model with string formation beyond leading color (“QCD inspired”) [46] and a model allowing the gluons to be moved to another string (“gluon move”) [47]. Underlying event measurements are used to tune the parameters of all models [32, 33]. The largest shifts induced by the variations are assigned as the CR uncertainty.

    This approach, as well as the ERD variation, is new relative to the Run 1 results at \(\sqrt{s}=7\) and \(8\,\text {TeV} \), because these CR models have become only recently available in pythia 8. The new models were first used to evaluate the \(m_{\mathrm {t}}\) uncertainty due to CR in Ref. [17]. Like in this analysis, the same increase in systematic uncertainty with respect to the Run 1 result has been observed.

A summary of the systematic uncertainties described above is given in Table 1. In Ref. [17], an ME generator uncertainty has been considered: Instead of using powheg  v2 as ME generator, the MadGraph 5_amc@nlo 2.2.2 generator with the FxFx matching scheme is used [48, 49]. The difference between the results obtained with the two generators is \(\delta m_{\mathrm {t}} ^{\text {hyb}}=+\,0.31\pm 0.52\) for the hybrid method in the all-jets channel. However, this is not significant because of the insufficient statistical precision of the available MadGraph 5_amc@nlo sample. Since the radiation after the top quark decay is described by pythia, no significant impact of the ME generator choice is expected beyond the variation of the PS scales and matching. Therefore, no ME generator uncertainty is considered in the total uncertainty of the measurement, but the number is just quoted here as a cross-check.

7 Results

For the 2D fit using the 10 799 \( {\text{t}}{\bar{\text{t}}} \) all-jets candidate events, the extracted parameters are

$$\begin{aligned} m_{\mathrm {t}} ^{\text {2D}}&= 172.43\pm 0.22 {\text{ (stat+JSF)}} \pm 0.81 {\text { (syst) GeV and}} \\ {\text {JSF}}^{\text{2D}}&= 0.996\pm 0.002 {\text{ (stat)}} \pm 0.009 {\text {(syst)}}. \end{aligned}$$

The corresponding 1D and hybrid fits yield instead

$$\begin{aligned} m_{\mathrm {t}} ^{\text {1D}}&= 172.13\pm 0.17\text { (stat)} \pm 1.03 {\text{ (syst) GeV}},\\ m_{\mathrm {t}} ^{\text {hyb}}&= 172.34\pm 0.20 {\text { (stat+JSF)}} \pm 0.70\,{\text {(syst) GeV, and}}\\ {\text {JSF}}^{\text {hyb}}&= 0.997\pm 0.002 {\text { (stat)}} \pm 0.007\,{\text {(syst)}}. \end{aligned}$$

In all cases the fitted values for the fraction of correct assignments, as well as the background fraction, are in agreement with the values expected from simulation. The hybrid measurement of \(172.34\pm 0.20\,\text {(stat+JSF)} \pm 0.43\,\text {(CR+ERD)} \pm 0.55\,\text {(syst)} \,\text {GeV} \) is the main result of this analysis, since it is constructed to provide the smallest uncertainty. The color reconnection and early resonance decay parts are separated from the rest of the systematic uncertainties. Because of the larger data sample used in this analysis, the statistical uncertainty is reduced with respect to the result of \(m_{\mathrm {t}} =172.32\pm 0.25\,\text {(stat+JSF)} \pm 0.59\,\text {(syst)} \,\text {GeV} \) obtained at \(\sqrt{s}=8\,\text {TeV} \). The new result is in good agreement with the value measured at \(\sqrt{s}=8\,\text {TeV} \), where a leading-order \( {\text{t}}{\bar{\text{t}}} \) simulation has been employed to calibrate the measurement, whereas an NLO simulation has been used here. The systematic uncertainty is increased with respect to the Run 1 result, because a broader set of CR models has been compared, which have become available in pythia  8.

8 Combined measurement with the lepton+jets final state

This measurement is combined with the lepton+jets final state, where only electrons and muons are explicitly considered as leptons, while tau leptons enter the selection only when they decay leptonically. The corresponding analysis for the lepton+jets final state is described in Ref. [17]. All selection and analysis steps are kept unchanged. Since the same method for the mass extraction is used, a combination with the all-jets channel at the likelihood level is possible.

The total likelihood \(\mathcal {L}\) is constructed from the single-channel likelihoods \(\mathcal {L} _i\),

$$\begin{aligned} \mathcal {L} (m_{\mathrm {t}}, \text {JSF}) = \mathcal {L} _\text {A} (m_{\mathrm {t}}, \text {JSF}) \mathcal {L} _\text {L} (m_{\mathrm {t}}, \text {JSF}) , \end{aligned}$$

where the indices A and L indicate the all-jets and lepton+jets channel, respectively.

No extra calibration of the mass extraction is performed, but the single-channel calibrations are applied. Figure 4 shows the extracted values for the top quark mass and JSF for different input values as a validation. No residual dependence is observed.

Fig. 4
figure 4

Difference between extracted and generated top quark masses (upper panel) and JSFs (lower panel) for different input masses and JSFs after the single-channel calibrations for the combined measurement. The values are extracted using the 2D method.

The systematic uncertainties are evaluated as described above for the all-jets channel. For the pseudo-experiments, the systematic uncertainty sources are varied simultaneously for both channels. An exception are uncertainties that only affect a single channel. These uncertainty sources are only varied for the corresponding channel. For the all-jets channel, these are the background and trigger uncertainties. In addition, uncertainties specific to the lepton+jets channel are introduced, including the background and trigger uncertainties, as well as the uncertainties arising from the lepton isolation and identification criteria, and are described in Ref. [17]. The complete list of uncertainties is shown in Table 2. A comparison of the hybrid mass uncertainties can be found in Table 3 for the all-jets and lepton+jets channels as well as for the combination. In general, the uncertainties for the combination are smaller than those for the all-jets channel and are close to the lepton+jets uncertainties, as expected because the combination is dominated by this channel. The total uncertainty for the combination is slightly smaller than that for the lepton+jets channel.

Table 2 List of systematic uncertainties for the combined mass extraction. The signs of the shifts (\(\delta x = x_\text {variation} - x_\text {nominal}\)) correspond to the \(+1\) standard deviation variation of the systematic uncertainty source. For linear sums of the uncertainty groups, the relative signs have been considered. Shifts determined using dedicated samples for the systematic variation are displayed with the corresponding statistical uncertainty
Table 3 Comparison of the hybrid mass uncertainties for the all-jets and lepton+jets [17] channels, as well as the combination. The signs of the shifts follow the convention of Tables 1 and 2

The combined measurement yields

$$\begin{aligned} m_{\mathrm {t}}^{\text {2D}}&= 172.39\pm \,0.08\,{\text {(stat+JSF)}} \pm 0.71{\text { (syst) GeV and }} \\ {\text {JSF}}^{{\text{2D}}}&= 0.995\pm 0.001{\text{ (stat)}} \pm 0.010\,{\text{(syst)}} \end{aligned}$$

for the 2D method and

$$\begin{aligned} m_{\mathrm {t}}^{\text{1D}}&= 171.94\pm \,0.05\,{\text {(stat)}} \pm 1.07 {\text{ (syst) GeV}} \\ m_{\mathrm {t}} ^{\text{hyb}}&= 172.26\pm 0.07{\text { (stat+JSF)}} \pm 0.61 {\text{ (syst) GeV, and }} \\ {\text {JSF}}^{{\text {hyb}}}&= 0.996\pm 0.001{\text { (stat)}} \pm 0.007{\text { (syst)}} \end{aligned}$$

for the 1D and hybrid fits. The likelihood contours for \(-2\varDelta \ln \mathcal {L} =2.3\), corresponding to the \(68\%\) confidence level, in the \(m_{\mathrm {t}}\)-\(\text {JSF}\) plane are shown in Fig. 5 for the hybrid measurement results for the all-jets and lepton+jets channels, as well as for the combination. Additionally, the likelihood profiles are displayed as a function of \(m_{\mathrm {t}}\). Both channels are in statistical agreement with each other. The result of the combination is closer to the lepton+jets channel, as expected.

Fig. 5
figure 5

Likelihood contours for \(-2\varDelta \ln \mathcal {L} =2.3\), corresponding to the \(68\%\) confidence level, in the \(m_{\mathrm {t}}\)-\(\text {JSF}\) plane (upper panel) and the likelihood profiles for the top quark mass (lower panel), where the level corresponding to one standard deviation (\(\sigma \)) is indicated. The hybrid measurement results for the all-jets and lepton+jets channels, as well as for the combination, are shown

Just as for the single-channel results, the hybrid measurement provides the best precision and is considered the main result. This is the first top quark mass measurement using the \({\mathrm {t}\overline{\mathrm {t}}} \) lepton+jets and all-jets final states combined in a single likelihood function. The largest uncertainty contribution is related to the modeling of color reconnection, as it was observed for the all-jets channel and the lepton+jets channel before using the same CR models. Accordingly, the quoted systematic uncertainty is larger than those reported in the most precise combination reported by the CMS Collaboration [12], and comparable to the value reported by the ATLAS Collaboration [50].

9 Summary

A measurement of the top quark mass (\(m_{\mathrm {t}}\)) using the all-jets final state is presented. The analyzed data set was collected by the CMS experiment in proton–proton collisions at \(\sqrt{s}=13\,\,\text {TeV} \) that correspond to an integrated luminosity of \(35.9{\,\text {fb}^{-1}} \). The kinematic properties in each event are reconstructed using a constrained fit that assumes a \({\mathrm {t}\overline{\mathrm {t}}}\) hypothesis, which suppresses the dominant multijet background and improves the mass resolution.

The value of \(m_{\mathrm {t}}\) and an additional jet energy scale factor (\(\text {JSF}\)) are extracted using the ideogram method, which uses the likelihood of the values of \(m_{\mathrm {t}}\) and \(\text {JSF}\) in each event to determine these parameters. The resulting \(m_{\mathrm {t}}\) is measured to be \(172.34\pm 0.20\,\text {(stat+JSF)} \pm 0.70\,\text {(syst)} \,\text {GeV} \). This is in good agreement with previous CMS results obtained at \(\sqrt{s} = 7\), 8, and \(13\,\text {TeV} \). The modeling uncertainties are larger than in the previous measurements at lower center-of-mass energies because of the use of new alternative color reconnection models that were not previously available.

A combined measurement using also the lepton+jets final state results in \(m_{\mathrm {t}} = 172.26\pm 0.07\,\text {(stat+JSF)} \pm 0.61\,\text {(syst)} \,\text {GeV} \). This is the first combined \(m_{\mathrm {t}}\) result obtained in the all-jets and lepton+jets final states using a single likelihood function.