1 Introduction

The exclusive simulation of processes that feature heavy unstable particles, such as the top, W, Z, and Higgs in the Standard Model, is complicated for a variety of reasons. Conceptually, the most straightforward approach is to choose a (set of) decay channel(s) for the unstable particle(s) and compute the process where the initial and final states only include light partons and the products of such decay channels. For example, in the case of \(t\bar{t}\) production, one may focus on a dilepton channel, and thus simulate (in hadronic collisions)Footnote 1

$$\begin{aligned} pp\;\longrightarrow \;b\bar{\ell }_i\nu _i\bar{b}\ell _j\bar{\nu }_j\,, \end{aligned}$$
(1)

rather than

$$\begin{aligned} pp\;\longrightarrow \;t\bar{t}\,. \end{aligned}$$
(2)

From the viewpoint of computing resources, the process of Eq. (1) is much more expensive than that of Eq. (2). This does not change when one takes into account the fact that Eq. (2) must be supplemented by the simulation of the leptonic decays of the tops, namely:

$$\begin{aligned} t\;\longrightarrow \;b\bar{\ell }_i\nu _i\,,\;\;\;\;\;\;\;\; \bar{t}\;\longrightarrow \;\bar{b}\ell _j\bar{\nu }_j\,, \end{aligned}$$
(3)

since the complexity of such a simulation is generally smaller than that of either Eq. (1) or (2).Footnote 2 Having said that, one remarks that the combination of Eqs. (2) and (3) is equivalent to Eq. (1) only in the limit of a vanishing top-quark width (the so-called narrow-width approximation); away from that limit, the agreement between the results of the two approaches might be degraded by the presence of non-resonant (i.e., non-\(t\bar{t}\) mediated) contributions. Even when only resonant contributions are present, it is generally the case that the incoherent simulation of the production process (Eq. (2)) and of the decay processes (Eq. (3)) constitutes a much poorer physics description than that emerging from Eq. (1), and this because of two mechanisms. Firstly, the kinematics of a decay product of a given unstable particle may be affected by that of partons/leptons in the event which are themselves not decay products of that unstable particle. These effects are called (production) spin correlations, and cannot be accounted for by the separate simulations of the individual decays, while they are correctly included by the matrix elements that underpin Eq. (1). Secondly, the kinematics of the decay products is affected by the emissions of the extra partons that appear in the hard process when perturbative corrections are considered, as in the case of NLO+PS simulations. By construction, a parton shower mimics the effects of such extra emissions, however with an accuracy which decreases with the hardness of the recoils they induce. Therefore, while perturbative corrections to Eq. (1) will automatically include hard-recoil effects, such effects are poorly described when the simulation of the decays of Eq. (3) is followed by an ordinary parton shower.

As far as the first mechanism is concerned, spin correlations are nowadays routinely taken into account in the context of production+decay simulations (Eqs. (2) and (3)) by means of the procedures of Ref. [2] (especially in NLO+PS approaches) or Ref. [3]; we shall not discuss them any further in this work. Conversely, hard recoils, absent computations of the complete processes such as that of Eq. (1), are approximated by Monte Carlo event generators (MC henceforth) by means of the so-called Matrix Element Corrections (MEC henceforth). The manner in which MEC are simulated depends on the specific MC one employs, but they all rely on (exact or approximated) tree-level matrix elements that describe the emission(s) of interest.

Fig. 1
figure 1

Sample graphs relevant to MEC in top production (left panel) and top decay (right panel). The top quark is depicted by means of a thicker fermion line. The hard production process \(\mathcal H\) is indicated by the blue circle

Before the widespread adoption of matching and merging techniques, MCs (and specifically Pythia) used to add MEC to both the production (Eq. (2)) and the decay (Eq. (3)) processes. These mechanisms will henceforth be referred to as production and decay MEC, respectively; sample Feynman graphs involved in their computations, in the specific case of top-quark production and decay, are depicted in Fig. 1. It must be stressed that MEC graphs for production and decay are not allowed to interfere; in other words, they are employed incoherently.

In the context of an NLO+PS simulation,Footnote 3 MEC can potentially spoil the accuracy of the computation by double counting (essentially, by generating again an emission already accounted for at the hard matrix element level – this is the case for the contribution shown on the left-hand panel of Fig. 1). In fact, this is not an issue for decay MEC, even in the presence of spin-correlation corrections, since the latter do not feature (at present) any extra emissions. There is therefore a good motivation to always include decay MEC in one’s simulation, since they may induce, and accurately account for, large shifts in observables sensitive to emissions off decay products.Footnote 4 In contrast, the case of production MEC must be considered more carefully. In MC@NLO-type [5] simulations, the MC counterterms that guarantee the absence of double counting are determined by considering the showers without any additional MEC; this implies that production MEC are certain to spoil the accuracy of an MC@NLO simulation. This is not a simple formal problem; since emissions in MC@NLO are not ordered in hardness, it is not unlikely that the effects of such double counting can be visible in phenomenologically-relevant quantities. On the other hand, in a POWHEG-type simulation the hardest emission is always generated before the MC shower is added; potential NLO effects driven by production MEC are thus restricted to the region where the POWHEG-generated emission did not occur, i.e. a small-\(p_{\textsc {T}}\) region where the cross section is Sudakov suppressed.

The bottom line is that both MC@NLO- and POWHEG-based simulations are compatible with, and phenomenologically benefit from, decay MEC. Conversely, while production-MEC effects are essentially negligible in POWHEG and can be either included or discarded, they are responsible for double counting in MC@NLO, and must therefore not be employed there. Thus, the procedure to adopt in MC@NLO and POWHEG is in principle clear. In practice, unfortunately, the MC@NLO-type MC interfaces to date have disabled all MEC instead of just those necessary to avoid double counting. We have already mentioned that the phenomenological implications of such choices should be restricted to a certain narrow class of observables. However, observed differences between MC@NLO- and POWHEG-based \(t\bar{t}\) predictions at the LHC are commonly attributed to the different underlying matching mechanisms, whereas it might be the case, and certainly for the observables mentioned above, that they stem from the absence as opposed to the presence, respectively, of decay-MEC effects.

The aim of this note is to document how the separation of production and decay MEC can be achievedFootnote 5 in Pythia8  [7], thus paving the way for MC@NLO-based simulations which are phenomenologically more complete than those carried out thus far. We shall present examples for \(t\bar{t}\) production, where Pythia8 standalone runs are compared with POWHEG-based simulations performed with POWHEG-BOX  [8], and MC@NLO-based simulations performed with MadGraph5-aMC@NLO [1] (shortened as MG5_aMC henceforth).

2 Matrix element corrections in the Pythia8 dipole shower

An introduction to the basics of MEC has been given in Sect. 1; here, we focus on their implementations within Pythia8, and explicitly consider the various settings that must be chosen in order to preserve the accuracy and precision of the underlying perturbative predictions. The Pythia8 standalone parton shower algorithm is based on a leading-logarithm DGLAP evolution, which is by default corrected by means of MEC to reproduce the appropriate tree-level matrix element behaviour, including in the cases where Pythia8 is interfaced with external calculations. Technically, this is done by populating all of the phase space by candidate emissions whose number is overestimated, and which are then possibly vetoed with a rejection rate given by the ratio of the matrix elements over their shower approximations. While the MEC provide continuity between the exact matrix-element based calculation and the parton shower in the hard emission limit, it also provides the soft emission limit necessary for particles with mass on the order of the soft emission scale. Various options are available that allow for all, some, or none of the MEC to be included in a given simulation. It is therefore important to understand which of these options need to be employed in the context of NLO+PS simulations, so as to avoid the double counting issue discussed in Sect. 1.

We start by pointing out that the separation between MEC in production and decay, exemplified by the two graphs in Fig. 1, is conceptually well defined in the zero-width limit (the width being that of the particle that decays), since in such a limit the interference between the two types of tree-level graphs vanishes; this is important, because MEC are effected at the level of amplitude squared. However, this is not the way in which MCs, and specifically Pythia8, operate, since the primary distinction there is that between emissions off initial-state and off final-state legs (conventionally referred to as ISR and FSR, respectively). Thus, production MEC are relevant to both ISR and FSR, whereas decay MEC are solely relevant to FSR. This implies that production MEC constitute a difficult problem, since in general at the level of amplitude squared one cannot unambiguously separate the effects due the graphs where the extra parton is emitted by an initial-state leg from those due to emissions by a final-state legFootnote 6.

The Pythia8 solution to the problem posed by production MEC is rather draconian. Namely, MEC are applied to ISR only for colour-singlet resonant production [9] (i.e. to \(q\bar{q}\rightarrow V\), with V a heavy EW boson, and to \(gg/\gamma \gamma \rightarrow H\)), since in such a case the ambiguity mentioned before is absent. As far as FSR is concerned, one applies MEC by considering, for each possible emitting colour dipole, a matrix element deemed equivalent to it, according to Table 1 of Ref. [10] (for example, for \(t\bar{t}\) production, the relevant matrix element is \(\gamma ^*\rightarrow t\bar{t}g\)).

According to what was said previously, the latter FSR mechanism applies to decay MEC as well. However, before resorting to that, priority is given to matrix elements which exactly match the decay one is consideringFootnote 7; if those exist, they are used for the MEC. In the Standard Model, these include the \(t\rightarrow Wbg\) and the \(V\rightarrow q\bar{q}\) matrix elements; in particular, this implies that a top that decays hadronically is never seen as such, but rather as a decay chain of a top decaying to a Wb pair, followed by a hadronic W decay.

Regardless of whether equivalent or exactly-matching matrix elements are employed for MEC, their effects can be iterated by Pythia8 for subsequent emissions as well (as opposed to limiting them to the first emission). This is done by applying MEC on the system obtained by discarding all radiated partons except that whose kinematical configuration one seeks to correct presently; this may require a momentum reshuffling (for example: if a \(t\rightarrow Wbgg\) configuration is obtained by means of two subsequent emissions off the b, such emissions can be both corrected, one after the other, by using the \(t\rightarrow Wbg\) matrix element; \(g\rightarrow gg\) branchings are not corrected). For a decay chain (such as \(t\rightarrow W(\rightarrow q\bar{q})b\)), the iteration of MEC is applied individually to each resonance that decays (here, the top and the W).

The various strategies described above are controlled in Pythia8 by means of the following parameters, separately for ISR (X=SpaceShower) and FSR (X=TimeShower), which are to be set equal to either on (which is the default for all of them) or off:

  • X::MEcorrections: If =on, use MEC whenever available, regardless of whether they entail the use of equivalent or exactly-matching matrix elements.

  • X::MEextended: If =on, use equivalent matrix elements for MEC; ignored if X::MEcorrections=off.

  • X::MEafterFirst: If =on, apply MEC also to all emissions after the first; ignored if X::MEcorrections=off.

Among other things, the above implies that if X::MEcorrections=on, then the MEC stemming from exactly-matching matrix elements are always applied, irrespective of the value of X::MEextended. Thus, presently one can generally switch production MEC off while keeping decay MEC on, but not the other way around. This is just as well, since as it was discussed in Sect. 1 MEC in production have to be employed only when neither NLO matching nor multi-jet merged simulations are available for the process of interest (which nowadays is a virtual impossibility), whereas MEC in decay are important for better phenomenological predictions in the context of relatively cheap (CPU-wise) generations. Having said that, this discussion clarifies that a more flexible approach to MEC in Pythia8 with respect to that given by the three (per shower type) parameters above would be desirable, namely one that allows the MEC to be applied or disabled until a certain condition is met.Footnote 8

In summary, the necessity of avoiding double counting in MC@NLO-type simulations led to the recommendation that all MEC be switched off, by means of:

figure a

An undesirable by-product of these settings is the absence of MEC for decays. However, we have seen that even with the public versions of Pythia8 (starting with Pythia8.219), this can be avoided by setting:

figure b

in the majority of the cases of interest including, but not limited to, the processes that feature a \(t\bar{t}\) final state; both values of TimeShower::MEafterFirst are acceptable.

We stress again that this is not a clean separation between production and decay MEC in the context of a generic process, and therefore the above setting recommendations must not be seen as universal. As a cautionary tale, consider Z-mediated dijet production, \(q\bar{q}\rightarrow Z^\star \rightarrow q\bar{q}\). This process is seen by Pythia8 as the production of a colour singlet followed by its decay, rather than as a unique \(2\rightarrow 2\) production process. As such, exactly-matching matrix elements exist for both production and decay (\(Z\leftrightarrow q\bar{q}g\)), which would not be the case had the \((2\rightarrow 2)\)-process picture been adopted. It follows that the value of MEextended is irrelevant in this case, and MEC are applied or not depending solely on the value of MEcorrections. Thus, if the latter is equal to on, one has (does not have) double counting if the process is generated by MG5_aMC as a \(2\rightarrow 2\) (\(2\rightarrow 1\) followed by tree-level Z decay). The bottom line is that, since avoiding any double counting is of paramount importance, in case of doubt the user is encouraged to contact the Pythia8 authors.

3 Monte Carlo event samples

We now proceed to evaluate the impact of MEC in resonance decays in realistic NLO+PS-accurate MC simulations of relevance for LHC phenomenology. For this, we consider different MC samples for the production of stable top-quark pairs in proton-proton collisions at \(\sqrt{s}=13\) TeV.

The first set of samples is generated using the MG5_aMC [1] program at the leading- and the next-to-leading-order (the latter with MC@NLO matching) in the strong coupling constant. The renormalization and factorization scales are set equal to the sum of the transverse energies of the final-state particles at the hard-process level, namely the top, the antitop, and when present a light quark or a gluon. The top quarks are subsequently decayed either leptonically or semileptonically (by including only electrons and muons), and preserving the tree-level spin correlations, with MadSpin [11]. A second sample is generated using standalone Pythia8.309 (thus, at the LO), with scales set equal to the geometric mean of the squared transverse masses of the two outgoing particles. The last sample is generated at the NLO with the HVQ [12] program in POWHEG-BOX-V2 [8, 13]. In POWHEG-BOX the scales are set equal to \(\sqrt{m_t^2+p_{\textsc {T}}^2}\), with \(m_t\) and \(p_{\textsc {T}}\) the top quark mass and transverse momentum, respectively, evaluated by employing the underlying Born configuration; the \(h_{\textrm{damp}}\) parameter is set equal to \(m_t\). The top decay is performed at the tree-level, and including the effect of spin correlations, using the approach of Ref. [2] as is implemented in the POWHEG-BOX internal routines. In all samples the NNPDF31_nnlo_as_0118 [14] parton distributions are used, and the value of the top quark mass is set equal to \(m_t=172.5\) GeV. Both the MG5_aMC LO sample and the Pythia8 standalone one are normalized, prior to any final-state cuts, to the NLO total cross section as is predicted by MG5_aMC.

All of the predictions are then interfaced to Pythia8.309 (employed with the Monash tune [15]) to include the effects of parton showering, multiple parton interactions and underlying event. For the standalone Pythia8 and for the POWHEG-BOX samples, MEC are included wherever available, for both production and decay. For each of the MG5_aMC LO and NLO event sets, we shower the events twice. Namely, we use the current recommended MEC settings, in which matrix-element corrections are completely switched off, as well as the newer settings presented in Sect. 2, in which MEC are included only in the decays. Finally, the showered particle-level events are passed through Rivet [16]-based analyses that implement the observables of interest.

4 Phenomenological results

The inclusion of MEC to resonance decays is in general expected to affect the kinematics of the reconstructed decay products. We shall show in this section a few illustrative observables that document the extent of these effects. In particular, the Pythia8 LO+PS and POWHEG-BOX +Pythia8 NLO+PS predictions that include MEC to production and decays are compared with MG5_aMC NLO+PS predictions where MEC are either switched off (the current default), or included only for the decay (the settings proposed here). Since the focus of this work is the impact of MEC, no effort is made to simulate hard radiation in production in a manner which is as mutually consistent as possible across the various programs we employ.Footnote 9 We note that the corresponding phase-space region is in any case better described by merged approaches, which also feature smaller systematics w.r.t. those of matched simulations. In order to facilitate the disentangling of the effects due to decay MEC from those due to hard radiation in the context of simulations stemming from exactly the same assumptions, in Appendix B we present comparisons between the LO- and NLO-accurate predictions of MG5_aMC, for the same observables as those considered in this section.

All of the figures of this section have the same layout, namely consist of a main frame and a lower inset. In the main frame the four predictions (and possibly the experimental data) are displayed in absolute value, as blue (MG5_aMC with decay MEC), green (MG5_aMC without MEC), orange (POWHEG-BOX +Pythia8), and red (Pythia8) histograms. The lower inset shows the ratios of the latter three predictions (and possibly of the data) over the one for MG5_aMC with decay MEC, using the same colour patterns as in the main frame. For ease of reading the plots, the labels indicate, where relevant, which kind of MEC are applied: MEC in decay (\(\textrm{MEC}^\textrm{dec}\)) for MG5_aMC, and MEC in both production and decay (\(\textrm{MEC}_\textrm{prod}^\textrm{dec}\)) for POWHEG-BOX and Pythia8 standalone. For the case of the semileptonic channel, we can expect differences in the reconstructed hadronic W-boson and top quark masses. To define the mass of the hadronic W-boson, we consider all pairs of non-b-tagged jets, and then choose as W-boson candidate the pair whose invariant mass is closest to the W pole mass. We then obtain two top quark candidate masses by summing the momentum of the reconstructed W-boson to the momentum of each of the two identified b-jets. We do not try to distinguish which of the two b-jets is the correct one to combine with the W-boson. Both candidate top momenta are constructed and we fill the histogram twice for each of the candidate masses, as dictated by the rivet routines we employ. These two distributions are shown in Fig. 2. The MEC in decay lead to a difference of about 10% (20%) in the reconstructed W-boson (top-quark) mass shape in the vicinity of the resonance peak. After inclusion of MEC, the three generators considered are in a better than 5% agreement with each other around the resonances peak, where the validity of the narrow-width approximation is good, and the impact of production hard radiation is negligible. As one can see from Fig. 6, hard radiation does have a visible (but still moderate) impact at large invariant masses, more pronounced in the case of the reconstructed top mass than for the W mass, which explains the small residual differences among our three benchmark results in that region.

Fig. 2
figure 2

The distribution of the reconstructed top quark (left panel) and W-boson (right panel) mass in semileptonic \(t\bar{t}\) events

Fig. 3
figure 3

The distribution in the invariant mass of the (lepton,b-jet) pair (left panel) and the leading b-jet transverse momentum (right panel) in dileptonic \(t\bar{t}\) events

Fig. 4
figure 4

The differential b-jet shape distribution (left panel) and the scaled energy fraction of the B-hadron (right panel) in semileptonic \(t\bar{t}\) events

Fig. 5
figure 5

The groomed subjet distance (left panel) and the jet width (right panel) distribution as reconstructed from charged particle tracks in semileptonic \(t\bar{t}\) events, compared to measured data by the CMS Collaboration [18]

Fig. 6
figure 6

The distribution of the reconstructed top quark (left panel) and W-boson (right panel) mass in semileptonic \(t\bar{t}\) events

Fig. 7
figure 7

The distribution in the invariant mass of the (lepton,b-jet) pair (left panel) and the leading b-jet transverse momentum (right panel) in dileptonic \(t\bar{t}\) events

Fig. 8
figure 8

The differential b-jet shape distribution (left panel) and the scaled energy fraction of the B-hadron (right panel) in semileptonic \(t\bar{t}\) events

Fig. 9
figure 9

The groomed subjet distance (left panel) and the jet width (right panel) distribution as reconstructed from charged particle tracks in semileptonic \(t\bar{t}\) events, compared to measured data by the CMS Collaboration [18]

The invariant mass of the (lepton, b-tagged jet) pair, \(m(l,b_\textrm{jet})\) and the leading b-tagged jet transverse momentum are shown in Fig. 3 for dileptonic \(t\bar{t}\) events. The \(m(l,b_\textrm{jet})\) distribution exhibits a kinematic edge around \(\sqrt{m_t^2-m_W^2}\sim 150\) GeV, sensitive to the value of the top mass. We observe that decay MEC shift the position of the peak, and after their inclusion in MG5_aMC this simulation and the POWHEG-BOX one (and to a good extent that of Pythia8 as well) agree fairly well with each other around and below this peak. At larger \(m(l,b_\textrm{jet})\) values the detailed description of the production mechanism, which is treated differently in the three codes, becomes important, and we observe relative differences of up to 30% in this region. Inspection of the left panel of Fig. 7 confirms that the vast majority of these discrepancies are indeed due to hard radiation, with a residual \(\mathcal{O}(5\%)\) effect stemming from decay MEC. The impact of decay MEC is also evident at small values of the b-jet transverse momentum (for \(p_{\textsc {T}}\) smaller than about 50 GeV); thus, in this region decay MEC improve the agreement between the three generators. However, at variance with \(m(l,b_\textrm{jet})\), for the present observable the separation between hard-radiation and decay-MEC effects is less clear-cut; this can be best understood by looking at the right panel of Fig. 7. Having said that, it is at large \(p_{\textsc {T}}\) that the impact of hard radiation is larger than that of decay MEC; hence, the inclusion of decay MEC in MG5_aMC does not help reduce significantly the \(\mathcal{O}(10\%)\) discrepancies between MG5_aMC, POWHEG-BOX, and Pythia8.

Since decay MEC modify the kinematic properties of the b-quark and of the B-hadron resulting from the hadronization of the former, they also have an impact on the radiation pattern inside its corresponding b-tagged jet. In order to illustrate this, in Fig. 4 we consider the distribution of the b-jet profile, and of the scaled B-hadron energy spectrum. The b-jet profile r, a.k.a. b-jet shape, is defined as the average fraction of the jet transverse energy that lies inside an inner cone of radius \(r<R\), with R the jet-radius parameter; jets are defined according to the anti-\(k_{T}\) algorithm [17], with parameters that depend on the specific analysis one considers. The scaled B-hadron energy, defined as the ratio of the B-hadron energy over the energy of the b-jet containing it (with energies defined in the lab frame), is a proxy for the longitudinal b-quark fragmentation function. The decay MEC are found to narrow the distribution of energy around the b-jet axis (i.e. there are comparatively more events at small r values), and to shift the peak of the scaled B-hadron energy spectrum to higher values. The effect of decay MEC is significant in the whole r range considered, while that of hard radiation is negligible (see the left panel of Fig. 8): thus, after their inclusion, the agreement among the MG5_aMC, POWHEG-BOX, and Pythia8 simulations is of \(\mathcal{O}(2\%)\). The radiation pattern is more complicated in the case of the scaled B-hadron energy (see the right panel of Fig. 8), with residual non-negligible hard-radiation effects at small values; still, in the bulk of the distribution the agreement among our three benchmark results is at the same level as for the jet energy profile.

We finally compare our predictions to experimental data of jet substructure in \(t\bar{t}\) semileptonic events that use charged-particle tracks from the CMS collaboration [18]. In Fig. 5 we consider two observables that are sensitive to the radiation pattern inside b-tagged jets: the groomed subjet distance, \(\Delta R_g\), and the jet width, \(\lambda _1^1\). As is clearly demonstrated by Fig. 9, these observables are essentially insensitive to hard radiation, and decay MEC dominate their behaviour. Given what has been observed so far, it is therefore not particularly surprising that the inclusion of decay MEC in MG5_aMC vastly improve its agreement with both POWHEG-BOX and Pythia8 results, whereas the previous MEC settings in MG5_aMC led to discrepancies of \(\mathcal{O}(\pm 15\%)\) with these two codes. The description of the data is also significantly improved.

5 Conclusions

We have illustrated how matrix element corrections to resonance decays as implemented in the Pythia8 parton shower can be consistently included in MC@NLO-type NLO+PS simulations in order to improve their phenomenological accuracy, without resulting in any double counting. We have discussed the impact of these corrections in a process of particular relevance for LHC physics, namely the production of top quark pairs. We have also verified that the same conclusions apply to the associated production of top-quark pairs and a Higgs boson – this must be expected, since this kind of effects are thought to largely factorize w.r.t. the hard process; we thus regard our findings as to be universally valid. We have found that decay MEC can have a relative impact on the shape of distributions of up to 20%. By comparing NLO+PS predictions stemming from MC@NLO- and POWHEG-type matching with standalone Pythia8 ones we find that a significant reduction in the spread among the results occurs if the MEC are included whenever possible in the various simulations. We thus encourage the usage of the MEC settings proposed in this paper for any practical applications, in order to improve the phenomenological accuracy of MC@NLO-type simulations, and to reduce systematic uncertainties.