1 Introduction

One of the main objectives of Run 3 of the Large Hadron Collider (LHC) will be the further investigation of the Higgs sector. Most studies directly targeting the Higgs boson will focus on its onshell production and subsequent decay. Indeed, one might naively expect that the cross section to produce an offshell Higgs boson is negligible, due to the extremely narrow width of the Higgs boson of about 4 MeV in the Standard Model (SM). However, contrary to this expectation, it is known that approximately 10% of \(gg \rightarrow H^* \rightarrow VV\) events are produced with an invariant mass \(m_{VV}\) above the \(2 m_V\) production threshold [1]. The importance of the offshell region for Higgs phenomenology was further highlighted in Ref. [2], which showed that a comparison of onshell and offshell data can provide stringent constraints on the width of the Higgs boson (see also Refs. [3, 4]). While later work indicated that such constraints are not model-independent, they also revealed the potential of using offshell data to probe the couplings of the Higgs boson [5,6,7,8,9,10,11]. Offshell analyses have been performed by both ATLAS [12, 13] and CMS [14,15,16,17], and have succeeded in constraining the Higgs boson width to \(\mathcal {O}\)(10 MeV). This is several orders of magnitude smaller than a direct constraint, which is limited by the detector resolution. Nevertheless, offshell analyses are currently still limited by the available statistics. Further studies of offshell Higgs boson production will therefore be a key component of the investigations of the Higgs sector during both Run 3 and in the high luminosity phase of the LHC.

In this paper, we will focus on the production of an offshell Higgs boson through gluon fusion and its subsequent decay into a pair of electroweak gauge bosons. To this end we consider the signal Higgs production process \(gg \rightarrow H^* \rightarrow VV\) together with the corresponding continuum background process \(gg \rightarrow VV\) and their interference. We study the two diboson modes \(VV=\{ZZ,W^+W^-\}\) and we assume leptonic decays of the diboson pair. In the following, for brevity, we often denote the processes according to the intermediate diboson resonances (ZZ\(W^+W^-\)). However by this we always refer to the full four-lepton offshell processes, including the interference between Z and offshell photon production.

The signal process proceeds predominantly through a top-quark loop. For onshell Higgs production, the top-quark mass is the largest scale in the process and can be approximated as infinitely heavy, allowing this loop-induced process to be reduced to a tree-level one. Using this approximation, the next-to-next-to-next-to-leading order (N3LO) corrections to Higgs production have been computed [18,19,20]. However, this approximation is not valid for offshell Higgs production, since the virtuality of the Higgs may be comparable to (or even larger than) the top-quark mass. This means that a leading-order (LO) prediction for offshell Higgs production requires the computation of a one-loop amplitude with the full top mass dependence, while the next-to-leading order (NLO) correction requires a two-loop amplitude. By itself, this would not be so onerous, but there is a second reason why predictions for offshell Higgs production are more demanding than for onshell Higgs production. It is well-known that the interference effects between the signal \(gg \rightarrow H^* \rightarrow VV\) and the background process \(gg \rightarrow VV\) can be sizeable and thus must be taken into account [1]. Moreover, as we discussed above, the impact of top quarks in the loops cannot be neglected, and this means that in the computation of the background amplitudes \(gg \rightarrow VV\) the contribution from both massless and massive quarks circulating in the loops should be considered. For on-shell Higgs production NLO simulations matched to parton showers are indeed readily available [21, 22] without any approximations for the heavy quark loops.

Results for offshell Higgs production including the mass dependence of quarks in the loop and interference effects are known at LO [1, 3, 4, 23]. Results in the presence of an additional radiated jet have also been presented [24, 25]. At NLO, the two-loop \(gg \rightarrow VV\) amplitudes for massless quarks circulating in the loop have been known for several years [26, 27], and allow for offshell vector bosons. However, the corresponding amplitudes for massive quark loops have only recently become available [28,29,30]. We note that these amplitudes treat the vector bosons as being onshell and thus are only valid above the \(2m_V\) threshold. Although offshell amplitudes are required for a consistent NLO description in the entire phase-space, the onshell treatment of the vector bosons is not a serious deficiency for offshell Higgs studies. This means that a fully consistent NLO prediction with the exact dependence on the top-quark mass is in sight in this kinematic regime but still not available.

However, NLO calculations including interference effects in \(gg \rightarrow ZZ\) have been presented based on an expansion in \(1/m_t\) [31,32,33]. This expansion is not valid for high energies, but has been shown to work well below the top-pair production threshold \(2m_t\). In fact, Ref. [32] uses a conformal mapping and Padé approximants to extend the results beyond the top-pair threshold. More recently, it has been demonstrated that using an expansion in \(1/m_t\) together with a threshold expansion as inputs for Padé approximants can lead to improved estimates for both \(gg \rightarrow HH\) and \(gg \rightarrow VV\) amplitudes [34, 35]. In Ref. [36] the massive two-loop amplitude for \(gg \rightarrow ZZ\) has been computed in the high-energy expansion \(s,|t| \gg m_t^2\), which opens the door for a NLO description of this process in the phase space \(m_{ZZ} > 2 m_t\). However, even disregarding these methods, there is a significant region of the offshell phase space with \( m_{ZZ} < 2\,m_t\) in which the \(1/m_t\) expansion is expected to be reliable, and hence where a good approximation to the NLO corrections can be obtained. We base the Monte Carlo generator for \(gg \rightarrow ZZ\) presented here on such an approximation, following the calculation of Ref. [33]. Additionally, we employ the reweighting of mass effects in the one-loop amplitudes to estimate the unknown contribution affecting massive two-loop \(gg \rightarrow ZZ\) amplitudes, which allows us to also cover the region \(m_{ZZ} > 2\,m_t\). When considering the \(gg \rightarrow WW\) process, we only employ the one-loop reweighting procedure. We emphasize that, when the exact massive two-loop amplitudes become available, it will be immediate to extend the generator by replacing these approximate treatments with the exact ones.

Reliable NLO corrections to the continuum background \(gg \rightarrow VV\) alone can be obtained ignoring heavy quark contributions (or these can be incorporated via a reweighting of the massless two-loop amplitude with the LO mass dependence). They are available in the literature both for \(gg \rightarrow ZZ\) [37, 38] and \(gg \rightarrow W^+W^-\) [39, 40].Footnote 1 Formally these are of \(\mathcal {O}(\alpha _S^3)\) with respect to the LO \(pp\rightarrow VV\) process, i.e. they contribute beyond the order of the known NNLO corrections to the quark-induced channels [43,44,45,46,47,48] - yet they yield phenomenologically relevant contributions.

The NLO results of Refs. [32, 33, 37,38,39,40,41] are at fixed-order parton level, meaning that they do not account for radiation beyond one additional jet. This, together with the fact that unweighted events are not available, prevents their direct use in event simulations. In this paper, we report on NLO calculations for offshell Higgs production, including interference effects, matched to parton showers using the POWHEG method [49,50,51,52]. The implementation extends earlier work by two of us [53] that considered the background process \(gg \rightarrow ZZ \rightarrow 4\ell \) only. Furthermore, in contrast to Ref. [53], here we also include the contribution from qg- and \(q\bar{q}\)-initiated channels. This implementation allows the generation of unweighted events with additional radiation included through the parton shower, and should facilitate the use of the NLO calculations in experimental analyses. The corresponding POWHEG-BOX-RES generator gg4l will be made publicly available in due time.

The paper is organized as follows. In Sect. 2, we briefly discuss the technical details involved in the parton-level calculation as well as in the matching procedure. In Sect. 3, we summarize the numerical inputs that we use. In Sect. 4, we present fixed-order results validating our calculation and investigate the applied approximations. Finally in Sect. 5 we present numerical results for ZZ and WW production matched to parton showers. We conclude in Sect. 6.

2 Computational setup

In this section, we describe the matching of the NLO calculation of gluon-induced four-lepton production to parton showers through the POWHEG method implemented in POWHEG-BOX-RES . We first describe the structure of the fixed-order NLO computation and then discuss several details relevant for the matching to PYTHIA 8 .

2.1 Structure of the NLO computation

We begin by summarizing the salient features of the NLO calculation, and refer the reader to Ref. [33] for additional discussion. As mentioned in the previous section, we need to consider both Higgs-mediated amplitudes \(gg \rightarrow H^* \rightarrow VV\) as well as continuum production \(gg \rightarrow VV\) amplitudes. We therefore write the full amplitude for gluon-induced VV production as

$$\begin{aligned} A = A_\mathrm{signal}+ A_\mathrm{bkgd}\end{aligned}$$
(1)

where \(A_\mathrm{signal}\) refers to Higgs-mediated amplitudes, while \(A_\mathrm{bkgd}\) refers to amplitudes without any Higgs propagators. Squaring this equation gives

$$\begin{aligned} |A|^2 = |A_\mathrm{signal}|^2 + |A_\mathrm{bkgd}|^2 + 2\mathrm{Re}\left( A_\mathrm{signal}A_\mathrm{bkgd}^*\right) . \end{aligned}$$
(2)

Upon integrating over the phase space for the final state particles, the first two terms on the right-hand side give the signal and background results, respectively, while the third term gives the interference contribution

$$\begin{aligned} \mathrm {d}\sigma _\mathrm{full} = \mathrm {d}\sigma _\mathrm{signal} + \mathrm {d}\sigma _\mathrm{bkgd} + \mathrm {d}\sigma _\mathrm{intf}. \end{aligned}$$
(3)

In Sects. 4 and  5 we will present results for these contributions separately, as well as for their sum \(\mathrm {d} \sigma _\mathrm{full}.\)

As mentioned in the previous section, the LO amplitudes for both \(A_\mathrm{signal}\) and \(A_\mathrm{bkgd}\) are well known [1, 3, 4, 23, 54,55,56]. At NLO, we have to compute the real and virtual corrections to \(A_\mathrm{signal}\) and \(A_\mathrm{bkgd}\). The corrections to \(A_\mathrm{signal}\) have been known for some time [57,58,59,60]. On the other hand, the NLO corrections to the background amplitude \(A_\mathrm{bkgd}\) are more involved, and deserve a separate discussion.

We begin by examining the virtual corrections to the \(gg \rightarrow ZZ\) process. In this case, one can clearly separate massless loops of the first five flavours, and massive top-quark loops. The virtual (two-loop) amplitudes for the former are known [26, 27], and we construct these using the ggVVamp library [27]. Results for two-loop amplitudes with massive quarks and onshell Z bosons were presented very recently [28, 30]. However, here we follow the approach of Refs. [31, 33] and use an expansion in \(1/m_t\) for the massive amplitudes. This implies that our NLO results for the ZZ production process are only valid below the top pair production threshold \(m_{ZZ} < 2m_t\).

An alternative option is to approximate the mass effects of the two-loop amplitudes through a reweighting procedure, using the known one-loop amplitudes. However, contrary to the \(1/m_t\) expansion, which is systematically improvable and with a well-defined validity regime, there is no clear indication of how to estimate the accuracy of an approximation in which the virtual amplitude is reweighted using the leading-order mass dependence. Nonetheless, in Sect. 1 we explore this alternative procedure and compare kinematic distributions obtained using the \(1/m_t\) expansion to those obtained by reweighting the virtual background amplitude using the exact LO top-mass dependence. As shown there, the reweighting procedure is in excellent agreement with the \(1/m_t\) expansion below the \(m_{ZZ} < 2m_t\) threshold, and starts to display sizeable differences (up to several percent) above this threshold.

Finally, we need to include double-triangle amplitudes, where each triangle can have either massless or massive quarks in the loop. We employ analytic results for these amplitudes taken from Refs. [61, 62].

We now discuss the case of WW production. Since top and bottom quarks mix in the loop, there is no longer a clear division into massive and massless loops. For this reason, Ref. [33] only considered four massless quark flavours in the loop for WW, neglecting the third generation entirely. Here we take a slightly different approach. We compute the two-loop amplitudes \(A_\mathrm{bkgd}^\mathrm{2loop}\) using ggVVamp assuming massless quarks for four active flavours. We then reweight the two-loop helicity amplitude using ratios of massive and massless helicity amplitudes, computed at one-loop:

$$\begin{aligned} \mathcal {A}_\mathrm{bkgd}^{(2),WW}\!\! \approx \! \mathcal {A}_\mathrm{bkgd}^{(2),WW}\!(d,u,s,c) \times \! \frac{\mathcal {A}_\mathrm{bkgd}^{(1),WW}\!(d,u,s,c,b, \mathbf {t})}{\mathcal {A}_\mathrm{bkgd}^{(1),WW}\!(d,u,s,c)}, \end{aligned}$$
(4)

where \(\mathcal {A}_\mathrm{bkgd}^{(1),WW}(d,u,s,c,b, \mathbf {t})\) is the one-loop amplitude at fixed helicity with full mass dependence for the third-generation quarks and \(\mathcal {A}_\mathrm{bkgd}^{(1),WW}(d,u,s,c)\) the one-loop helicity amplitude with four active flavours.Footnote 2 We will comment on the accuracy of this approach in Sect. 4.2.

The real corrections to \(gg \rightarrow VV\) include both the purely gluonic channel \(gg \rightarrow VV + g\) as well as channels with initial state quarks \(qg \rightarrow VV+q\) and \(q\bar{q} \rightarrow VV+g \) (see Fig. 1) and their crossings. At \(\mathcal {O}(\alpha _s^3)\), the former can be unambiguously identified as corrections to the loop-induced process \(gg \rightarrow VV\). The qg and \(q\bar{q}\) channels are more intricate. These channels also appear in the \(\mathcal {O}(\alpha _s^3)\) corrections to the \(q\bar{q} \rightarrow VV\) process, and it is not possible to parametrically distinguish these corrections from the corrections to the loop-induced process that we are interested in. For this reason, these channels were not included in Ref. [33]. On the other hand, there is no obstacle to computing these corrections, as they form a gauge invariant subset and their only infrared singularities are removed through the collinear renormalization of the parton distribution functions. Indeed, results for these channels were included in Refs. [38, 40, 41]. In this paper, we choose to include these channels in our nominal predictions at NLO and also in the results after matching to the parton shower. At the same time we will also investigate the impact of these channels in Sect. 5.4, so that both their magnitude and their impact on the scale variation can be properly assessed. Note that we define these contributions to include any amplitudes with at least one vector boson attached to a closed fermion loop. In particular, amplitudes such as those represented in Fig. 1d (contributing to \(gg\rightarrow ZZ\)) are included.Footnote 3

Fig. 1
figure 1

Real corrections to gluon-induced VV production, with different partonic channels

We compute all the real correction loop-squared amplitudes using OpenLoops 2 [63, 64] including massless and massive quark contributions in the loop, allowing us to retain the full dependence on the top quark mass (and bottom quark mass where applicable). This is in contrast to the approach of Ref. [33] where the real amplitudes for \(gg \rightarrow ZZ+g\) involving a top-quark loop were computed using an expansion in \(1/m_t\). Finally, we note that our calculation includes single-resonant amplitudes in all partonic channels.

In order to ensure numerical stability across the whole phase-space, including in the IR regions close to the soft or collinear limits of the real radiation, we rely on the OpenLoops stability system, which automatically reevaluates all phase-space points with the two reduction methods implemented in Collier [65]. For unstable points matrix elements are set to zero. We verified that varying the corresponding threshold stability_kill2 by a factor of 10 around a central value of \(10^{-2}\) leaves all results unchanged (see Ref. [64] for documentation of this threshold parameter).

In order to optimize the treatment of all the colorless resonances, we take advantage of the POWHEG-BOX-RES  framework [52], which, despite being specifically designed to handle the subtractions when intermediate colored resonances are present, can at the same time improve the phase-space sampling of any resonance. This is achieved by first manually specifying the resonance histories.Footnote 4 The POWHEG-BOX-RES then decomposes the cross section into contributions associated to a well-defined resonance structure, which are enhanced on that particular cascade chain. Each contribution is separately integrated at this point with a dedicated resonance-aware phase space sampling which makes use of a resonance-aware subtraction procedure. The resonance-aware subtraction makes use of a mapping from a real to the underlying Born configuration that preserves the virtuality of intermediate resonances. Due to the absence of QCD divergences in the resonances, the resonance-aware subtraction is strictly speaking not necessary for the processes considered here. However, we choose to adopt it because its usage improves the statistical errors for observables directly probing the resonance structure. The last essential feature of the POWHEG-BOX-RES implementation is the ability to generate remnants and regular events even when the corresponding cross section is negative, which was not possible in previous versions of the POWHEG-BOX that were instead assuming them to be positive. Despite usually being squares of matrix elements, in this process remnants and regulars contributions might indeed assume negative values in the calculation of the interference terms. Technical details about necessary modifications in POWHEG-BOX-RES to deal with the processes at hand are given in Sect. 1.

2.2 Matching to PYTHIA 8 

We next discuss the matching of the NLO calculation of \(gg \rightarrow VV\) to the PYTHIA 8  parton shower in the framework of POWHEG-BOX-RES .

The resonance structure that we construct at the partonic level is further preserved by the parton shower by specifying the input resonance cascade chain at the Les Houches event level (LHE) and making sure that the shower does not distort it through recoil effects. This is achieved by using the PowhegHooks class in PYTHIA 8 . However, the PowhegHooks class needs to know the number of final state particles involved in the LO process once the resonance decays are stripped. Since in the POWHEG-BOX-RES  this number is not fixed, we modified the PowhegHooks class accordingly, following the recipe adopted in Ref. [66].

The PYTHIA 8  parton shower implements two recoil schemes for initial-state radiation (ISR): in both cases the recoil is always applied to all final-state particles to absorb the transverse momentum imbalance due to ISR off an initial-initial dipole. In the default scheme [67] the same is also done for initial-final dipoles. There is, however, also the option to use a fully local scheme [68], in which when an initial-state emission takes place from an initial-final dipole, the final-state spectator absorbs the transverse-momentum recoil and the other particles in the event are left unchanged.

The default recoil scheme is the recommended option to handle the s-channel production of colour singlets, while the alternative one was originally designed to handle deep inelastic scattering and vector boson fusion events. Since at LO the process considered in this paper describes the production of a colour singlet, we maintain as our baseline the default recoil scheme of PYTHIA 8 . However, since in principle we can also generate events with a hard final state jet, in our numerical results we compare the two recoil prescriptions for exclusive observables that might be sensitive to this choice.

In all our showered predictions we include underlying event simulation and hadronization effects. However, in order to simplify the identification of the leptons, we turn off QED radiation and the decay of unstable hadrons.

3 Numerical setup

In this section we present the numerical inputs for the results presented in the following sections.

Coupling and mass input parameters are fixed to the following values:

$$\begin{aligned} m_{\mathrm {Z}}&=91.1876~\mathrm {GeV}\,\!\!,&\Gamma _{\mathrm {Z}}&=2.4952~\mathrm {GeV}\,\!\!,\\ m_{\mathrm {W}}&=80.3980~\mathrm {GeV}\,\!\!,&\Gamma _{\mathrm {W}}&=2.1054~\mathrm {GeV}\,\!\!,\\ m_{\mathrm {H}}&=125.1~\mathrm {GeV}\,\!\!,&\Gamma _{\mathrm {H}}&=4.03\cdot 10^{-3}~\mathrm {GeV}\,,\\ m_{\mathrm {t}}&=173.2~\mathrm {GeV}\,\!\!,&G_{F}&=1.16639 \cdot 10^{-5}\,\mathrm{GeV}^{-2}, \end{aligned}$$

where \(G_{F}\) denotes the Fermi constant and

$$\begin{aligned} \alpha =\frac{\sqrt{2}}{\pi }m_W \sin ^2\theta _W, \end{aligned}$$

with the real-valued weak mixing angle

$$\begin{aligned} \sin ^2\theta _W=1-\frac{m_W^2}{m_Z^2}. \end{aligned}$$

In general, the gg4l generator allows for finite bottom-quark masses. In this work, we mostly use the \(N_F=5\) flavour scheme and treat the bottom quark as massless \(m_{\mathrm {b}}=0~\mathrm {GeV}\,\). Only in Sect. 4.1, where we validate against the results of Ref. [33], do we use a non-zero bottom-quark mass, and there we choose \(m_{\mathrm {b}}=4.5~\mathrm {GeV}\,\).

We use the partonic luminosities and strong coupling from the NNPDF30_lo_as_0130 and the NNPDF30_nlo_as_0118 sets [69] for the validation against the results of Ref. [33] that we present in Sect. 4.1. For all other results, we use the NNPDF31_nlo_as_0118 set [70].

We consider center-of-mass energies of 13 \(\mathrm {TeV}\) , and set as renormalization and factorization scales for all modes

$$\begin{aligned} \mu =\mu _{R}=\mu _{F}&=\frac{m_{{\mathrm 4}\ell }\,}{2}, \end{aligned}$$
(5)

where

$$\begin{aligned} m_{{\mathrm 4}\ell }\,^{2}=\left( \sum _{i \in \{\ell , \nu \}} p_{i}\right) ^{2}. \end{aligned}$$
(6)

We obtain scale uncertainty bands by independently varying the renormalization and factorization scales by a factor of two and omitting antipodal variations.

At the generator level the following kinematic cuts are applied in the ZZ channel,

$$\begin{aligned} 5~\mathrm {GeV}\,&<m_{\ell \ell }\,<180~\mathrm {GeV}\,, \end{aligned}$$
(7)
$$\begin{aligned} 70~\mathrm {GeV}\,&<m_{{\mathrm 4}\ell }\,<340~\mathrm {GeV}\,. \end{aligned}$$
(8)

We need to impose such an upper cut on \(m_{{\mathrm 4}\ell }\,\) because, as discussed in the previous sections, the virtual corrections are computed using a \(1/m_t\) expansion which is no longer valid for large values of \(m_{{\mathrm 4}\ell }\,\) [33]. For WW production we only require

$$\begin{aligned} m_{{\mathrm 2}\ell {\mathrm 2}\nu }\,>1~\mathrm {GeV}\,, \end{aligned}$$
(9)

to ensure the renormalization and factorization scales remain inside the perturbative domain. We do not impose any transverse momentum or rapidity requirements on the final-state leptons.

In order to avoid numerical instabilities of the loop-induced amplitudes we need to impose additional mild technical cuts at the generation level. For the Born kinematics, we discard configurations where the transverse momentum of the vector boson is smaller than 100 MeV. For the real corrections, we neglect configurations where the transverse momentum of the radiated parton is smaller than 100 MeV, as also done in Ref. [53]. Indeed this region only gives rise to power-suppressed contributions that do not significantly change the total cross-section. We verified that our results are independent of these technical cuts varying them by a factor of 5 from 0.1 GeV to 0.5 GeV.

Finally, we reconstruct jets with the anti-\(k_{\scriptscriptstyle \mathrm T}\) algorithm [71] as implemented in the Fastjet  package [72, 73], with jet radius \(R=0.4\) and \(p_{T,j} > 20\) GeV.

4 Fixed-order NLO results

In the following we present selected fixed-order results that we used to validate our implementation and to investigate the accuracy of the applied approximation for the treatment of mass effects in the virtual corrections.

4.1 Validation

As a validation of our implementation, we compare the fixed-order LO and NLO cross sections for \(gg \rightarrow ZZ\rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) and \(gg \rightarrow W^+W^-\rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu }\) against the results of Ref. [33] in Table 1, with selection cuts as specified in Ref. [33]. The signal, background and interference contributions are shown separately. Following the approach of Ref. [33], a finite bottom-quark mass \(m_b\) is used everywhere in the signal (\(|A_\mathrm{signal}|^2\)). For the ZZ channel, the background (\(|A_\mathrm{bkgd}|^2\)) is computed with \(m_b=0\) and the interference (\(2 \mathrm {Re}(A_\mathrm{signal}A_\mathrm{bkgd}^*)\)) is computed with a finite \(m_b\) besides from the virtual contribution which is computed analytically in a mixed-mass scheme, where the bottom mass is neglected in the background amplitude \(A_\mathrm{bkgd}\).Footnote 5 For the WW channel, \(A_\mathrm{bkgd}\) is evaluated with \(n_f=4\), for both the background and the interference channels.

Table 1 Comparison of LO and NLO cross sections for the signal, background, and interference contributions to \(gg \rightarrow ZZ\rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) (top) and \(gg \rightarrow W^+W^-\rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu }\) (bottom) with those of Ref. [33]. The qg- or \(q\bar{q}\)-induced channels are not considered

For this validation we do not consider quark-initiated channels in the real contribution, consistent with Ref. [33]. Within numerical accuracy we find convincing agreement between the two calculations at LO and NLO.

4.2 Mass effects

As discussed in Sect. 2.1, the massive contributions to the two-loop virtual amplitudes are incorporated via approximations in our calculation. All other ingredients including the real amplitudes retain full mass dependence. For the ZZ process the approximation for the massive two-loop amplitudes is based on an expansion in \(1/m_t\). As discussed in detail in Ref. [33] the resulting accuracy is estimated to be at the percent level for \(m_{ZZ} < 2m_t\) and quickly deteriorates beyond this, as shown in Sect. 1. For the WW process we use a reweighting procedure to approximate the massive two-loop amplitudes. As previously mentioned, it is difficult to assess the accuracy of such a procedure. Nevertheless, we will attempt to gauge its impact by comparing against results obtained using only two massless generations for two-loop \(A_\mathrm{bkgd}\) amplitudes, while all other contributions are computed using full mass dependence as usual. We plot these results for the transverse mass \(m_{T}\) of the four lepton system in \(W^+W^-\) production in Fig. 2. This observable is defined as

$$\begin{aligned} m_{T} = \sqrt{2\, E_\mathrm{T,miss}\, p_\mathrm{T,\ell ^+\ell ^-} \,(1-\cos (\phi _\mathrm{miss, \ell ^+\ell ^-})}, \end{aligned}$$
(10)

where the missing transverse energy \(E_\mathrm{T,miss}\) is given by the neutrino momenta at truth level and \(\phi _\mathrm{miss, \ell ^+\ell ^-}\) is the angle between the sum of the neutrino momenta and the sum of the lepton momenta. For the background contribution the effect of the reweighting is at the few percent level for the bulk of the cross section and increases up to about \(15\%\)-\(20\%\) at large transverse masses. For the interference the impact is at the \(15\%\) level inclusively and mildly increases in the tail of the transverse mass distribution. The sum of all contributions – which also includes the signal where no approximations are needed – receives inclusive variations due to the reweighting procedure of \(0.7\%\). In the tail this increases to \(10\%\)-\(15\%\) but also the associate statistical error grows significantly, to the point that it is difficult to reach a firm conclusion.

5 NLO results matched to parton showers

In this section we present our numerical results matched to the PYTHIA 8  parton shower. We consider the different-flavour decay modes \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) and \(gg \rightarrow e^+ \nu _e \mu ^{-} \bar{\nu }_{\mu }\) and for simplicity denote them ZZ and \(W^+W^-\) production respectively. The same-flavour leptonic decay modes will also be made available in the gg4l generator, but are not the focus of this study.

Fig. 2
figure 2

Differential distribution in the transverse mass \(m_{T}\) of the four lepton system in \(gg \rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu } \) at fixed-order NLO. We show results with and without reweighting of the heavy-quark mass effects in the virtual amplitude using dashed and dashed-dotted curves, respectively. The comparison is shown for the full (red), the background (blue) and the interference (orange) contributions. The lower panels show the bin-by-bin ratios of the results without reweighting to those with reweighting. For the nominal prediction we use a symlog scale with a linear threshold=\(10^{-7}\)

Fig. 3
figure 3

Differential distribution in invariant mass \(m_{4\ell }\) of the four-lepton system in \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) at NLO matched to PYTHIA 8 . The upper panel shows nominal predictions at fixed-order NLO (dashed) for the background (blue), the signal (green) and the interference (orange) separately and their sum (red) together with NLO+PS predictions (solid). For the nominal prediction we use a symlog scale with a linear threshold=\(10^{-6}\). The first ratio plot shows the relative yield of the different contributions with respect to the full, both at the fixed-order NLO level and also after parton shower. The lower four ratio plots show the LHE level (dotted) and fully showered corrections with respect to fixed-order NLO for the sum of all contributions (second ratio plot), the background only (third ratio plot), the signal only (fourth ratio plot), and for the interference only (fifth ratio plot). The band associated to NLO+PS predictions indicates the 7-point scale variation uncertainty

5.1 ZZ production

In Figs. 3, 4, 5 and 6 we present numerical results at NLO, LHE level and NLO matched to PYTHIA 8  (NLO+PS) for gluon-induced ZZ production, showing the full result as well as the signal, background and interference contributions separately.

In Fig. 3 the invariant mass of the four-lepton system is shown. The Higgs-mediated signal shows the resonance peak at the Higgs boson mass together with the well-known significant offshell tail. The background clearly exhibits a single-resonant peak at \(m_{4\ell }=m_Z\)Footnote 6 and increases significantly for \(m_{4\ell }> 2m_Z\), where both intermediate Z bosons can become onshell. In this region the interference also starts to become relevant. As a consequence of the very inclusive phase-space cuts employed in our numerical analysis, both the signal and the interference reach about 10% of the full result at large \(m_{4\ell } \approx 2m_t\), with the interference being destructive. It is well known that the interference provides an even larger destructive contribution at higher values of \(m_{4\ell }\), which are however beyond the validity of the 1/mt expansion used in our calculation. The \(m_{4\ell }\) observable is inclusive in QCD radiation and consequently parton-shower corrections are marginal for all contributions (individually and in their sum). In fact, for all production modes the fixed-order NLO prediction agrees at the percent level with both the LHE level prediction and the fully showered prediction. Scale uncertainties at the fully showered level are approximately 20%. At small invariant masses (\(m_{4\ell }<150\) GeV) the interference becomes very small and consequently Monte Carlo statistics deteriorate quickly in this regime.

Fig. 4
figure 4

Differential distribution in \(H_\mathrm{T}\) in \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Fig. 5
figure 5

Differential distribution in the transverse momentum of the four lepton system \(p_\mathrm{T,4\ell }\) in \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Fig. 6
figure 6

Differential distribution in the transverse momentum of the hardest jet \(p_\mathrm{T,j_1}\) in \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Figure 4 shows the distribution in

$$\begin{aligned} H_T=\sum \limits _{i \in \{ \ell ,\nu ,j\}} p_\mathrm{T,i}, \end{aligned}$$
(11)

where the sum over the transverse momenta considers all leptons and reconstructed jets. In this distribution the signal peaks at \(H_T=m_H\), while the background peaks at \(H_T=2 m_Z\). For small \(H_T\) parton-shower corrections are mostly driven by the first radiation already present at the LHE level. For the background contribution, these corrections are small, but for the signal contribution they lead to a negative correction of about 50%. A possible explanation is that the signal distribution is strongly peaked around \(m_H\) and therefore very sensitive to additional radiation that moves events away from the peak. For large \(H_T\), the parton showers provide substantial positive corrections up to a factor of 2, while the scale uncertainties can be as large as 50%. This effect can be understood as follows. The upper cut on the invariant mass of the four leptons Eq. (8) also restricts \(H_T < 340\) GeV at LO. However, the phase space for \(H_T > 340\) GeV can be filled via additional QCD radiation. This leads to significant NLO corrections (not shown here), as well as to sizable parton-shower corrections and LO-like scale uncertainties.

Figures 5 and 6 display the transverse momentum of the four-lepton system and of the hardest jet respectively. For the latter no lower cut on the jet transverse-momentum is applied. The two distributions are identical at fixed-order (they only differ in the first bin which for \(p_\mathrm{T,4\ell }\) includes the Born and virtual contributions proportional to \(\delta (p_\mathrm{T,4\ell })\)). The fully showered predictions include a Sudakov suppression which can clearly be seen at the lower end of both the \(p_\mathrm{T,4\ell }\) and the \(p_\mathrm{T,j_1}\) distributions. We also observe that the parton shower changes the sign of the lowest bin in the \(p_\mathrm{T,4\ell }\) spectrum. This can be understood as follows: the virtual contribution, proportional to \(\delta (p_\mathrm{T,4\ell })\), always comes with an opposite sign of the corresponding real contribution.After the shower (and even after the first POWHEG emission) the virtual contribution gets spread out at finite values of \(p_\mathrm{T,4\ell }\). This results in a change of sign in the first bin.

Turning now to the opposite end of the spectrum, the \(p_\mathrm{T,4\ell }\) distribution corresponds to the entire QCD recoil of the four-lepton system and for all contributions receives large parton shower corrections in the tail, while LHE level corrections are largest at \(p_\mathrm{T,4\ell }\approx 40-50\,\mathrm {GeV}\,\) and become small in the tail, where the Sudakov suppression fades away. Similar behaviour was observed for the background contribution studied in Ref. [53]. As already discussed in Ref. [53] the large parton-shower corrections can be explained by the fact that, by adding further radiation, the shower increases the transverse momentum of the colour-neutral four lepton system, which has to recoil against the sum of all emitted particles. The observed large effects are strongly dependent on the employed recoil scheme, as will be discussed in Sect. 5.3. On the contrary, in the tail of \(p_\mathrm{T,j_1}\) no such enhancement of the corrections due to the parton shower is observed. In fact, by construction the shower emissions are subdominant with respect to the leading jet and on average are separated enough not to be clustered with it. With respect to the LHE level we observe small and negative parton-shower corrections, being compatible within scale uncertainties.

5.2 \(W^+W^-\) production

Fig. 7
figure 7

Differential distribution in transverse mass \(m_{T}\) of the four-lepton system in \(gg \rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu } \) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Fig. 8
figure 8

Differential distribution in \(H_\mathrm{T}\) in \(gg \rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu } \) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Fig. 9
figure 9

Differential distribution in the transverse momentum of the four-lepton system \(p_\mathrm{T,2\ell 2\nu }\) in \(gg \rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu } \) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

Fig. 10
figure 10

Differential distribution in the transverse momentum of the hardest jet \(p_\mathrm{T,j_1}\) in \(gg \rightarrow e^{+} \nu _e \mu ^{-} \bar{\nu }_{\mu } \) at NLO matched to PYTHIA 8 . Predictions, colour coding and bands as in Fig. 3

In Figs. 7, 8, 9 and 10 we present numerical results at NLO, LHE level and NLO matched to PYTHIA 8  for gluon-induced \(W^+W^-\) production, showing again the signal, background, interference, and full results.

In contrast to the corresponding results for ZZ production, here we consider the distribution in the transverse mass \(m_{T}\) of the four-lepton system, as defined in Eq. (10), instead of the invariant mass of the colour-singlet system. This is shown in Fig. 7. As for the invariant mass in ZZ production, the impact of the parton-shower corrections on the transverse mass in \(W^+W^-\) production is marginal, as expected from its inclusive (with respect to QCD radiation) nature. It is noteworthy that the interference becomes very large at high \(m_T\) and eventually contributes beyond \(-50\%\) for \(m_{T}>300\,\mathrm {GeV}\,\). However, also for the interference alone parton-shower corrections are marginal for the entire \(m_T\) range considered.

A similarly strong enhancement of the interference can also be observed at large \(H_\mathrm{T}\), as shown in Fig. 8. In the tail of this observable, parton-shower corrections are again sizable. However in contrast to ZZ production, here no upper boundary on the four-lepton invariant mass is applied and the parton-shower corrections level off for large \(H_\mathrm{T}\) at around \(50\%\) for the background and \(70\%\) for the full.

We finally consider the QCD recoil for \(W^+W^-\) production in Fig. 9 and the transverse-momentum distribution of the hardest jet in Fig. 10. We observe similar behaviour as for ZZ production: the anticipated Sudakov suppression at the low end of both spectra, very large scheme dependent (see Sect. 5.3 below) parton-shower corrections in the tail of \(p_\mathrm{T,2\ell 2\nu }\), and mild corrections in the entire \(p_\mathrm{T,j_1}\) spectrum.

5.3 Shower recoil scheme

As discussed in Sect. 2.2PYTHIA 8  implements two alternative shower recoil schemes: the default scheme in which the transverse momentum imbalance after an initial-final dipole emission is democratically distributed among all final-state particles, including the four lepton system, and a fully local scheme, in which the recoil is entirely absorbed by the coloured spectator.Footnote 7 In Fig. 11 we compare these two schemes considering the transverse momentum of the four lepton system in the background contribution to ZZ production.

As already anticipated in Sect. 5.1 the default recoil scheme leads to a very hard spectrum in the tail (with a 50% increase with respect to the LHE distribution around 100 GeV). Conversely the dipole scheme remains close to the LHE level at large \(p_{T,4\ell }\). For small values of \(p_{T,4\ell }\), the dominant contribution should arise from several (soft) emissions whose total transverse momentum sum up to zero. However, in the dipole scheme, the transverse momentum recoil for ISR is not always absorbed by the final-state colour singlet. This explains why for very small values of \(p_{T,4\ell }\) the local recoil leads to a significantly smaller cross section compared to the default scheme. Indeed if we dress the LO \(gg\rightarrow VV\) event with multiple soft gluon emissions well separated in rapidity, all the emissions must be independent. However, if we adopt the local recoil, the final-state parton always absorbs the transverse momentum recoil due to emission from an initial-final dipole. This will happen even if the incoming parton is tagged as emitter, and hence even when there is a large angle separation between the final-state spectator and the newly emitted gluon.

Thus, the default scheme yields a better description of the logarithmically enhanced region, while it also overpopulates the hard region of the spectrum. A detailed discussion of the logarithmic accuracy of the parton shower goes beyond the purposes of this article and the choice of the recoil scheme has important implications at higher logarithmic orders [74,75,76,77,78,79]. However, since the choice of the recoil scheme only affects our predictions beyond the claimed accuracy, a comparison of the two options available in PYTHIA 8  should help assess the size of the total theoretical uncertainty.

Fig. 11
figure 11

Differential distribution in transverse momentum of the four charged leptons in \(gg \rightarrow e^+ e^- \mu ^+ \mu ^- \) for the background contribution at the LHE level (black), and at the particle level, using the default PYTHIA 8  recoil (blue) and the fully local dipole recoil (violet). In the lower panel the ratio with respect to the LHE level is shown

5.4 Effect of qg and \(q\bar{q}\) channels

In contrast to the calculations in Ref. [33, 53], in this study we do include the qg and \(q\bar{q}\) induced channels contributing to the real radiation at NLO.Footnote 8 Here we would like to explicitly highlight the impact of these production channels. To this end in Figs. 12 and 13 we illustrate at the LHE level the impact of the qg and \(q\bar{q}\) channels with respect to only the gg channels for the different production modes, considering the \(m_{4\ell }\) and \(H_\mathrm{T}\) distributions in ZZ production. We find very similar results also for the \(W^+W^-\) production mode. In the \(m_{4\ell }\) distribution the impact of the \(qg/q\bar{q}\) channels is rather flat and about \(25\%\) for all production modes. For \(H_\mathrm{T}\) it is increasing with increasing \(H_\mathrm{T}\) and reaches up to \(50\%\) in the considered range. Clearly, any precision analysis of gg-induced four-lepton production should include these additional partonic channels opening up at NLO.

Fig. 12
figure 12

Differential distribution in invariant mass \(m_{4\ell }\) of the four-lepton system in \(gg \rightarrow e^{+} e^{-} \mu ^{+} \mu ^{-}\) at NLO and LHE level. Colour coding of the different production modes as in Fig. 3. The lower ratio plots show the full LHE contribution including all partonic channels (\(gg+qg+qq\)) over only the gg channel contribution for the different production modes

Fig. 13
figure 13

Differential distribution in \(H_\mathrm{T}\) in \(gg \rightarrow e^+ e^- \mu ^+ \mu ^- \) at NLO and LHE level. Predictions, colour coding and bands as in Fig. 12

6 Conclusions and outlook

Gluon-induced four-lepton production offers a unique laboratory for the measurements of offshell Higgs bosons. At the same time precision studies of diboson processes and corresponding background estimates in new physics searches are becoming sensitive to the accuracy of the modeling of the gluon-induced production modes. Having this in mind, in this paper we presented an implementation of the loop-induced processes \(gg \rightarrow ZZ\) and \(gg \rightarrow W^+W^-\) including offshell leptonic decays and non-resonant contributions at NLO matched to the PYTHIA 8  parton shower event generator. We consistently include the continuum background contribution, the Higgs-mediated signal, and their interference. All of these are loop-induced processes and therefore their implementation in a fully-exclusive NLO event generator matched to parton showers poses a significant technical challenge.

In inclusive observables, such as the four-lepton invariant-mass distribution in ZZ production, the parton-shower corrections are found to be marginal, while in more exclusive observables like the recoil of the four-lepton system they can become substantial. For the latter we highlighted the importance of the parton-shower recoil scheme. Furthermore we investigated the relevance of the \(qg/q\bar{q}\) induced production channels, which partly overlap with the higher-order corrections to quark-induced diboson production.

In our calculation all ingredients have been treated exact at the NLO level apart from the massive amplitudes contributing to the two-loop virtuals, which are incorporated via approximations. Exact results for the latter have become available very recently (albeit for onshell vector bosons) and could be incorporated in an updated version of the gg4l generator presented here. Moreover, the generator will be made publicly available in the POWHEG-BOX-RES  framework.