1 Introduction

Only a few months ago we celebrated the tenth-year anniversary of the Higgs boson discovery at the CERN Large Hadron Collider (LHC) [1, 2]. Since then, enormous efforts both from the theory and the experimental communities have been devoted to a precise determination of the Higgs boson properties such as its mass, decay width and couplings to other Standard Model (SM) particles. Indeed, an in-depth exploration of the Higgs sector is one of the main goals of the current and future LHC precision physics program [3].

The predominant mechanism of Higgs boson production at the LHC is gluon fusion. The \(H\rightarrow \gamma \gamma \) and \(H\rightarrow ZZ^*\rightarrow 4l\) decay channels, despite having small branching ratios, provide a very clean environment for the study of Higgs-boson properties. Measurements in the diphoton channel allowed for a determination of the Higgs boson mass with an uncertainty of \(260~\textrm{MeV}\), roughly half of which is systematic [4]. The ZZ channel allows for an even better determination, thanks to a very good experimental control on the final-state leptons. Indeed, in this channel the Higgs boson mass has been measured with an accuracy of \(184~\textrm{MeV}\) [5]. In this case, the uncertainty is mostly dominated by statistics, which accounts for \(180~\textrm{MeV}\).

Measuring the Higgs boson total decay width \(\varGamma _H\) is much more challenging, because of the extremely narrow nature of the Higgs resonance. Indeed, the predicted SM value of roughly \(4~\textrm{MeV}\) has to be confronted with an experimental sensitivity of the order of \(1{-}2~\textrm{GeV}\) [6, 7]. Therefore, one has to resort to indirect analysis to extract bounds on the Higgs boson width. One option is to perform a global fit of SM parameters, e.g. within the context of the Standard Model Effective theory [8,9,10]. However, it has also been pointed out in the literature that one can harness the sensitivity of specific observables on the Higgs width to constrain the latter. One proposal is to exploit the peculiarities of the off-shell Higgs cross-section in the four-lepton channel [11,12,13], which allows one to probe values of \(\varGamma _H\) as small as the SM one [14, 15]. However, such a technique relies on some underlying theoretical assumptions, see e.g. Refs. [16,17,18].Footnote 1 Another proposal is to exploit signal-background interference effects in the diphoton channel [20, 21]. Such effects are expected to shift the Higgs invariant mass distribution peak by a value that depends on the Higgs boson width. In the SM, the shift turns out to be quite small. Initial theoretical analyses estimated it to be of about 50–100 \(\textrm{MeV}\) [21, 22]. A more robust estimate, that took into account realistic experimental conditions, was conducted by the ATLAS collaboration and found a somewhat smaller mass-shift of about \(40~\textrm{MeV}\) [23]. The predicted experimental sensitivity on the mass-shift at the LHC is of about few hundred \(\textrm{MeV}\) [7, 24], which translates to an upper bound on the Higgs width of about 5–30 times the standard model value [21, 22, 25, 26]. While such bounds are not as constraining as the ones obtained from the off-shell method, they do not suffer from the same model dependence and provide therefore important complementary information.

For a reliable extraction of the Higgs boson width from the mass-shift, one needs a good theoretical control on the latter. The preliminary LO studies of Ref. [20], which considered the dominant \(gg\rightarrow \gamma \gamma \) channel, showed that the apparent mass-shift could be of \({\mathcal {O}}(100{-}200\,\textrm{MeV})\) for typical collider energies and setup. This LO analysis was later refined in Ref. [22] via the inclusion of qg, \(\bar{q}g\) and \(q\bar{q}\) initiated processes, which account for a shift of \({\mathcal {O}}(30\,\textrm{MeV})\), carrying an opposite sign with respect to the gg case. The impact of higher-order QCD correction was soon after addressed in Ref. [21], which also explicitly noted that a measurement of the mass-shift could be used to put indirect bounds on \(\varGamma _H\). The results of Ref. [21] were later confirmed in Ref. [26]. Additionally, the impact of small-\(p_T\) resummation [25] and extra hard QCD radiation [27] on the mass-shift were also studied. One of the main outcomes of Ref. [21], is that NLO QCD corrections account for a large \({\mathcal {O}}(40\%)\) effect on the mass shift, hence highlighting the relevance of higher-order corrections. Furthermore, the bulk of the effect comes from the low \(p_{T,\gamma \gamma }\) region [21, 27, 28].

The primary goal of this article is to improve on the current predictions for the mass-shift and extend the analysis beyond NLO QCD. For a long time, this was prevented by the lack of the relevant multi-loop amplitudes for the \(gg\rightarrow \gamma \gamma \) continuum production. This bottleneck has recently been overcome and analytic results for the three-loop helicity amplitudes for \(gg\rightarrow \gamma \gamma \) [29], as well as for the two-loop ones for \(\gamma \gamma +j\) production [30, 31] are now available. In principle, this – together with the well-known analogous results for the \(gg\rightarrow H\rightarrow \gamma \gamma \) signal – allows for a complete NNLO evaluation of the signal-background interference and hence of the mass-shift. In practice however, such an endeavour is non trivial as it requires very good numerical control of the two-loop 5-point scattering amplitudes in soft-collinear regions. In this paper, we perform a first step towards the full NNLO calculation and work in the so-called soft-virtual approximation. Within such approximation, one retains the full information of virtual corrections and soft real emissions, but neglects the impact of hard radiation. Since the bulk of the interference is dominated by the region where the \(\gamma \gamma \) pair has a low transverse momentum [21, 27, 28], it is reasonable to expect this approximation to capture the main effects of higher-order corrections.

The remaining of this paper is organised as follows. In Sect. 2 we review the theoretical background of Higgs interferometry in diphoton production. In Sect. 3 we provide details of our calculation, discussing all necessary ingredients with a special focus on the soft-virtual approximation for colour-singlet production. In Sect. 4 we discuss our phenomenological results. First, we validate the soft-virtual approximation at NLO, and then use it to estimate the impact of NNLO QCD corrections. We finally conclude in Sect. 5.

2 Theoretical background

In this section, we briefly review the main aspects of signal-background interference for Higgs-mediated diphoton production at the LHC. For the sake of illustration, we discuss the main features of the interference at LO, focusing on the gluon-fusion channel. The complete analysis will be presented in Sect. 3.

2.1 Higgs interferometry

We consider diphoton production at the LHC in the gluon-fusion channel. At order \(\alpha _{s}^2\) two main mechanisms contribute: the Higgs-mediated process \(gg\rightarrow H\rightarrow \gamma \gamma \) and the continuum process \(gg\rightarrow \gamma \gamma \). We refer to the former as our “signal” and to the latter as our “background”. Schematically, we write the scattering amplitude for this process as

$$\begin{aligned} {\mathcal {M}}_{gg\rightarrow \gamma \gamma } = \frac{{\mathcal {M}}_{\textrm{sig}}}{m_{\gamma \gamma }^2 - m_H^2 + i \varGamma _H m_H} + {\mathcal {M}}_{\textrm{bkg}}, \end{aligned}$$
(1)

where \(m_{\gamma \gamma }\) is the diphoton invariant mass and where we have explicitly factored out the Higgs-boson propagator. In order to improve readability, we dropped helicity labels, which are understood. It is helpful to further separate the real and imaginary parts of \({\mathcal {M}}_{\textrm{sig,bkg}}\), i.e.

$$\begin{aligned} {\mathcal {M}}_{\textrm{sig,bkg}} = \textrm{Re}{\mathcal {M}}_{\textrm{sig,bkg}} + i\,\textrm{Im}{\mathcal {M}}_{\textrm{sig,bkg}}. \end{aligned}$$
(2)

Since we will be ultimately interested in the diphoton invariant-mass distribution, we need to consider the square of the amplitude in Eq. (1), which reads

$$\begin{aligned} |{\mathcal {M}}_{gg\rightarrow \gamma \gamma }|^2&= \frac{|{\mathcal {M}}_{\textrm{sig}}|^2}{\left( m_{\gamma \gamma }^2 - m_H^2\right) ^2 + \varGamma _H^2 m_H^2} + |{\mathcal {M}}_{\textrm{bkg}}|^2 \nonumber \\&\quad + 2\,\textrm{Re}\left( \displaystyle \frac{{\mathcal {M}}_{\textrm{sig}}}{m^2_{\gamma \gamma }-m_H^2+i \varGamma _H m_H}{\mathcal {M}}_{\textrm{bkg}}^{\dagger }\right) . \end{aligned}$$
(3)

The invariant-mass distribution can then be schematically organised as follows

$$\begin{aligned} \displaystyle \frac{\mathrm d\sigma }{\mathrm dm_{\gamma \gamma }} \sim |S|^2 + |B|^2 + I, \end{aligned}$$
(4)

where the three terms S, B and I are in one-to-one correspondence with those on the right-hand side of Eq. (3). The signal-background interference part is the last one in the equation above. One can get insight on the structure of the interference contribution by further separating it into a so-called “real part” \(I_\textrm{Re}\) and an “imaginary part” \(I_\textrm{Im}\) [32], i.e. \(I = I_\textrm{Re}+ I_\textrm{Im}\). These two components can be expressed through the actual real and imaginary parts of the amplitudes in Eq. (2) and are given by

$$\begin{aligned} I_{\textrm{Re}}&\propto \displaystyle \frac{2}{(m_{\gamma \gamma }^2-m_H^2)^2+\varGamma _H^2 m_H^2}\left( m_{\gamma \gamma }^2-m_H^2\right) \nonumber \\&\quad \times \left[ \textrm{Re}{\mathcal {M}}_{\textrm{bkg}}\textrm{Re}{\mathcal {M}}_{\textrm{sig}}+ \textrm{Im}{\mathcal {M}}_{\textrm{bkg}}\textrm{Im}{\mathcal {M}}_{\textrm{sig}}\right] , \nonumber \\ I_{\textrm{Im}}&\propto \displaystyle \frac{2}{(m_{\gamma \gamma }^2-m_H^2)^2+\varGamma _H^2 m_H^2} \varGamma _H m_H \end{aligned}$$
(5)
$$\begin{aligned}&\quad \times \left[ \textrm{Re}{\mathcal {M}}_{\textrm{bkg}}\textrm{Im}{\mathcal {M}}_{\textrm{sig}}- \textrm{Im}{\mathcal {M}}_{\textrm{bkg}}\textrm{Re}{\mathcal {M}}_{\textrm{sig}}\right] . \end{aligned}$$
(6)
Fig. 1
figure 1

Real and imaginary parts of the signal-background interference terms, see text for details. This figure is just for illustration purposes. Our best prediction for these curves will be described in details in Sect. 4

It is clear from these equations that the real part of the interference is an antisymmetric function of the diphoton invariant mass \(m_{\gamma \gamma }^2\) around the Higgs resonance, and therefore does not contribute to the total cross-section. This is not the case for the imaginary part, which is instead symmetric around the resonance. This is illustrated in Fig. 1, where we plot \(I_\textrm{Re}\) and \(I_\textrm{Im}\) in the gg channel up to NLO, to better visualise their independent effects.

Naively, one may think that the relative impact of the interference on the Higgs total cross section could be quite sizeable, since the signal is a two-loop process while the background starts at one loop. Because of this, one may expect a loop-enhancement factor of the interference with respect to the signal. However, a close inspection of Eq. (6) shows that the contribution to \(I_{\textrm{Im}}\) from the imaginary part of the background is strongly suppressed at leading order. This follows from the fact that the Higgs boson, being a scalar, only decays into a pair of photons with identical helicity. In turn, if the photons have equal helicities the imaginary part of the background at leading order vanishes, unless the process is mediated by a massive quark. In our calculation we keep full dependence on the bottom quark at leading order but, since its contribution is mass-suppressed, the net effect is small and can be safely neglected at higher orders. Starting from two loops, the relevant background helicity amplitudes develop an imaginary part. This leads to \(I_\textrm{Im}\) having a destructive impact of around 1–2% [26, 33]. As far as the real part is concerned, there is no mass suppression at LO. Although such effect does not contribute to the total cross section, c.f. Eq. (6), it leads to a non-negligible shift of events around the diphoton invariant-mass peak, as it was first noted in Ref. [20].

Moreover, it was observed in Ref. [21] that the integrated interference I has a different dependence on the Higgs boson production and decay couplings compared to the integrated signal. Specifically, if we schematically denote with \(\lambda _g\) and \(\lambda _{\gamma }\) the Higgs couplings to gluons and photons, we find that the signal is proportional to \(\lambda _g^2 \lambda _{\gamma } ^2 / \varGamma _H\), while the interference is only proportional to \(\lambda _g \lambda _{\gamma }\). The main idea of Ref. [21] is to exploit this intertwined dependence on the Higgs boson couplings and its decay rate to extract information on \(\varGamma _H\) and to look for possible deviations from its SM value. Current experimental measurements constrain the Higgs-diphoton rate to be the same of its SM prediction to within 10% [34,35,36]. At this level of accuracy, one can neglect the small impact of the destructive interference \(I_\textrm{Im}\) on the cross section. One then schematically writes this experimental observation as

$$\begin{aligned} \displaystyle \frac{\lambda _g^2 \lambda _{\gamma } ^2}{\varGamma _H} \approx \displaystyle \frac{\lambda _{g,\mathrm SM}^2 \lambda _{\gamma ,\mathrm SM} ^2}{\varGamma _{H,\mathrm SM}}, \end{aligned}$$
(7)

where \(\lambda _{i}\), \(\varGamma _H\) are the “true” Higgs-boson couplings and width and \(\lambda _{i,\textrm{SM}}\), \(\varGamma _{H,\textrm{SM}}\) are their respective SM predictions. Equation (7) holds in particular under appropriate rescalings of \(\lambda _{i,\textrm{SM}}\) and \(\varGamma _{H,\textrm{SM}}\), e.g. \(\lambda _i \rightarrow \kappa \lambda _{i,\textrm{SM}}\) and \(\varGamma _H \rightarrow \kappa ^4 \varGamma _{H,\textrm{SM}}\), so it does not allow for a simultaneous direct extraction of couplings and width. However, if one supplements Eq. (7) with the observation that \(I_\textrm{Re}\propto \lambda _g\lambda _\gamma \), then one obtains

$$\begin{aligned} \displaystyle \frac{I_\textrm{Re}}{I_{\textrm{Re},\textrm{SM}}} \propto \sqrt{\displaystyle \frac{\varGamma _H}{\varGamma _{H,\textrm{SM}}}}. \end{aligned}$$
(8)

Hence, a measurement of \(I_\textrm{Re}\) allows for an extraction of \(\varGamma _H\), provided that the SM prediction \(I_{\textrm{Re},\textrm{SM}}\) is under good theoretical control.

As mentioned before, the main effect of \(I_\textrm{Re}\) is to distort the Higgs invariant-mass distribution, effectively shifting the position of the peak. This translates into an apparent mass-shift with respect to the SM value [20]. A measurement of such mass-shift can then be used to constrain \(\varGamma _H\). In the next section, we review how one can extract the mass-shift from the knowledge of the Higgs invariant-mass distribution.

2.2 Extraction of the mass-shift

One first remark is that the Higgs resonance is extremely narrow, and therefore strongly smeared by the finite detector resolution. To properly take this into account, one should convolve theoretical prediction with a full detector simulation. However, such a study can only be carried out by experimental collaborations and it is outside the scope of this theoretical work. In this paper we estimate the mass-shift using two different, yet related, proxies for it that have been presented in the literature. Namely, we will consider the first moment of the diphoton invariant-mass distribution [20] and a full gaussian fit to it [21].

Before reviewing these techniques, we stress that they are inherently different, so one should not expect identical results for the mass shift. In other words, we can think of these procedures as a measurement of an observable that is strongly correlated, yet not identical, to the mass shift that experimentalists would measure. Nevertheless, we may imagine that they capture similar physics, and hence they should receive comparable radiative corrections. In Sect. 4, we will see that this is indeed the case. This gives us confidence that, even if our theoretical predictions for the absolute values of the mass shift are inherently limited by our experimental modeling, our results for the QCD K-factors are quite robust.

We now describe the two methods in some detail. In both cases, we simulate detector effects by smearing the diphoton invariant-mass distribution using a Gaussian function with \(\sigma =1.7~\textrm{GeV}\) [23].

The first-moment method [20] is based on the observation that, from the theoretical side, a very simple way to access the mass shift is to consider the first moment of the invariant-mass distribution, i.e.

$$\begin{aligned} \langle m_{\gamma \gamma }\rangle = \displaystyle \frac{1}{\sigma _{gg \rightarrow \gamma \gamma }}\int d m_{\gamma \gamma } m_{\gamma \gamma } \displaystyle \frac{d \sigma _{gg \rightarrow \gamma \gamma }}{dm_{\gamma \gamma }} \end{aligned}$$
(9)

where \(\sigma _{gg \rightarrow \gamma \gamma }\) is the fiducial cross-section. The mass shift is then defined as

$$\begin{aligned} \varDelta m_{\gamma \gamma } = \langle m_{\gamma \gamma }\rangle _{\mathrm {sig+int}}-\langle m_{\gamma \gamma }\rangle _{\textrm{sig}}. \end{aligned}$$
(10)

The main advantage of this method is that it is theoretically very clean. Also, it is not very sensitive to overall normalisation issues, but rather focuses on the position of the peak. On the practical level however, it requires exquisite resolution on the invariant mass distribution which is very hard to achieve experimentally. It also strongly depends on the technical details of the theoretical analysis. For example, in Ref. [20] the author found that the mass-shift strongly depends on the choice of the upper and lower integration boundaries in Eq. (9). Indeed, a change of \({\mathcal {O}}(1\,\textrm{GeV})\) in such choice modifies the mass-shift estimate by almost \(20\%\) at leading order.

A different proposal that addresses at least some of the shortcomings of the first-moment method is to simply perform a Gaussian fit of the diphoton invariant mass distribution [21]. One then extracts the mass shift by comparing predictions obtained with and without including interference effects. It was argued in Ref. [21] that this method is more resilient against specific details of analysis with respect to the first-moment one.

These two methods predict a mass-shift of \({\mathcal {O}}(100)~\textrm{MeV}\) at LO. More precisely, the first-moment technique gives results in the interval \(\varDelta m_{\gamma \gamma } \in \{-250,-150\}~\textrm{MeV}\) depending on the integration window in Eq. (9) [20]. The likelihood fit of Ref. [21] instead predicts \(\varDelta m_{\gamma \gamma } = -120~\textrm{MeV}\). As we have stressed before, these two methods measure correlated yet slightly different observables, so one should not expect identical results.

3 Higher-order QCD corrections

In this section, we present the technical details of our calculation. We start by discussing LO predictions for the interference. At order \(\alpha _{s}^2\), we need to consider three different partonic channels, see Fig. 2 (top). We then write

$$\begin{aligned} \textrm{LO}= \mathrm {LO(gg)}+ \mathrm {LO(qg)}+ \textrm{LO}(\textrm{q}\bar{\textrm{q}}). \end{aligned}$$
(11)

At this order, we compute the signal retaining the full top- and bottom- mass dependence. For the background, we neglect top-mediated contributions, since they are heavily suppressed. As we have explained in Sect. 2.1, the bottom-quark contributions are the only ones leading to a non-vanishing \(I_\textrm{Im}\) at this order. This effect is very small [26], so we neglect it at higher orders. Because of this, when computing (N)NLO corrections we consider Higgs production in the heavy-top effective theory, described by the following Lagrangian

$$\begin{aligned} {\mathcal {L}}_{\textrm{eff}} = -\frac{\lambda }{4} \,H G_{\mu \nu }^a G^{\mu \nu }_a, \end{aligned}$$
(12)

where \(\lambda = -\alpha _{s}/(3\pi v) + {\mathcal {O}}(\alpha _{s}^2)\) is the bare coupling and v is the Higgs vacuum expectation value.Footnote 2 At higher-orders, we still treat the Higgs decay to photons at LO, with exact mass dependence. Indeed, QCD radiative corrections are known to be small [37]. For convenience, we report the relevant amplitude in Appendix A. As far as the background is concerned, we set the bottom mass to zero beyond LO.Footnote 3

It is well-known that the gg channel is the dominant one for the signal process. For the interference, this statement is somehow weakened but still true. Indeed, the qg channel (see Fig. 2) accounts for about 30% of the result [22]. While it is easy with current technology to perform a full NLO analysis, in this paper we are mostly interested in the impact of NNLO corrections. We therefore focus our attention on the gg channel only.

Fig. 2
figure 2

Representative diagrams contributing to the interference up to order \(\alpha _s^4\). The red dot denotes the effective vertex described through the Lagrangian in Eq. (12), while the blue dot denotes the Higgs decay via heavy quark loops and W boson, described in Eq. (A.1)

Strictly speaking, if we include only gluon-induced processes, NLO corrections are given by only the first three out of the four \(\delta \textrm{NLO}\) diagrams of Fig. 2. Of course, this would imply that one should evolve parton distribution functions (PDFs) in the same approximation. However, one can also retain formally subleading effects. In particular, the quark-induced contribution in the third row of Fig. 2 is linked to the gg channel through PDF evolution. One may then expect that its inclusion in the DGLAP evolution of the PDFs would alleviate the factorisation-scale dependence of the result. Therefore, following Ref. [21], in our analysis we use standard parton distributions and include the last diagram in the third row of Fig. 2. In what follows, we call ‘\(\delta \)NLO’ the contribution coming from the second and third rows of that figure. We stress that this is not the full NLO correction to the interference. It may also be instructive to consider only gluon-induced diagrams at this order. Indeed, comparing this against our default setup may give us a handle on the large gluon-density approximation. In what follows, we define corrections computed using only the first three diagrams of the block of diagrams (\(\delta \)NLO) in Fig. 2 as ‘\(\delta \)NLO(gg)’. Correspondingly we call the the full NLO result

$$\begin{aligned} \textrm{NLO}= \textrm{LO}+ \delta \textrm{NLO},~~ \textrm{NLO}(gg) = \textrm{LO}+ \delta \textrm{NLO}(gg), \end{aligned}$$
(13)

with \(\textrm{LO}\) defined in Eq. (11).

We now move to NNLO corrections. As we have said, at this order we work in the soft-virtual approximation. This is justified by the fact that, at least at lower orders, interference effects are stronger in the region where the diphoton transverse momentum is small [21, 28]. The soft-virtual approximation and various refinements of it have been extensively adopted for Higgs predictions [38,39,40,41,42]. The main advantage of working in this limit is that the only process-dependent part is encoded in purely virtual contributions, see e.g. Ref. [43] for an explicit derivation.

Here we only sketch the structure of the soft-virtual approximation, and refer the reader to Ref. [43] for more details. We write the fully-differential hadronic cross section for diphoton production in the gg channel as

$$\begin{aligned} d\sigma&(\tau ,y,\{\theta _i\}) = \int d\xi _1 d\xi _2 dz f_g(\xi _1,\mu _F) f_g(\xi _2,\mu _F)\nonumber \\&\times \,\delta (\tau -\xi _1\xi _2 z) d{\hat{\sigma }} \left( z,{\hat{y}},\{{\hat{\theta _i\}}},\alpha _{s},\displaystyle \frac{Q^2}{\mu _R^2},\displaystyle \frac{Q^2}{\mu _F^2}\right) . \end{aligned}$$
(14)

In this equation, \(\tau = Q^2/s_{\textrm{had}}\) with Q being the invariant mass of the diphoton system and \(s_{\textrm{had}}\) the square of the collider energy, y is the rapidity of the diphoton system in the laboratory frame and \(\theta _{i}\) are a set of variable that fully describe the diphoton system (e.g. scattering angles). Variables with hats represent the corresponding partonic quantities. Equation (14) is fully differential in the kinematics of the diphoton system, but retain no information on extra QCD radiation. We note that the partonic variables \(\{{\hat{\theta }}_i\}\) depend on their hadronic counterparts \(\{\theta _i\}\) and also on \(z,\xi _1,\xi _2\). Finally, \(\alpha _{s}= \alpha _s(\mu _R)\) is the renormalised QCD coupling constant evaluated at scale \(\mu _R\).

Major simplifications occur in the soft \(z\rightarrow 1\) limit. First, in this case the rapidity dependence of Eq. (14) is entirely fixed by the inclusive cross section, up to power corrections [44,45,46]. The partonic cross section in Eq. (14) simplifies to

$$\begin{aligned}&d\hat{\sigma }\left( z,{\hat{y}},\{\hat{\theta }_i\},\alpha _{s},\displaystyle \frac{Q^2}{\mu _R^2},\displaystyle \frac{Q^2}{\mu _F^2}\right) \nonumber \\&\quad \approx d\hat{\sigma }_\textrm{Born}(\{\hat{\theta }_i\},\alpha _{s}) \,z\, G\left( z,\alpha _{s},\displaystyle \frac{Q^2}{\mu _R^2},\displaystyle \frac{Q^2}{\mu _F^2} \right) , \end{aligned}$$
(15)

where we neglected power corrections in \((1-z)\). In Eq. (15), \(d\sigma _\textrm{Born}(\{{\hat{\theta _i\}}},\alpha _{s})\delta (1-z)\) is the Born cross section and G is the inclusive coefficient function in the soft limit, normalised such that \(G = \delta (1-z) + {\mathcal {O}}(\alpha _{s})\). In the soft limit, the diphoton kinematics is identical to the LO one, except that the partonic center-of-mass energy is rescaled by z. Such a rescaling factor can be absorbed by boosting an individual leg, i.e. by using the partonic momenta \(\xi _{1,\mathrm Born} \rightarrow z \,\xi _{1,\mathrm Born}, \xi _{2,\mathrm Born}\rightarrow \xi _{2,\mathrm Born}\) (and vice versa). Alternatively, one can boost both legs at the same time, i.e. \(\xi _{i,\mathrm Born} \rightarrow \sqrt{z} \,\xi _{i,\mathrm Born}\), \(i=1,2\). In the soft-limit, the two are formally equivalent and, in principle, one can consider both and treat their difference as an uncertainty. In practice, we expect this difference to be small [47], so for simplicity we always boost only one leg in this work.

If we write the perturbative expansion of the coefficient function as

$$\begin{aligned} G\left( z,\alpha _{s}\right) = \delta (1-z) + \sum _{n=1}^{\infty }\left( \displaystyle \frac{\alpha _{s}}{2\pi }\right) ^n G^{\left( n\right) }(z), \end{aligned}$$
(16)

then in the soft-virtual approximation the individual coefficients \(G^{(n)}\) have the form

$$\begin{aligned} G^{(n)}\left( z, \displaystyle \frac{Q^2}{\mu _R^2},\displaystyle \frac{Q^2}{\mu _F^2}\right) = c^{(n)}_0 \delta (1-z) + \sum _{k=1}^{2n-1} c^{(n)}_k {\mathcal {D}}_k(z), \end{aligned}$$
(17)

where \(c^{(n)}_k = c^{(n)}_k (Q^2/\mu _R^2,Q^2/\mu _F^2)\) and \({\mathcal {D}}_k\) are the standard plus distributions

$$\begin{aligned} {\mathcal {D}}_k(z) = \left[ \frac{\ln ^k(1-z)}{1-z} \right] _+. \end{aligned}$$
(18)

We present explicit formulas for the NLO and NNLO coefficients, retaining full scale dependence, in Appendix B. The coefficients \(c^{(n)}_k\) in Eq. (17) in principle depend on the process under consideration. However, it turns out that the only process dependence arises from finite remainders of purely virtual contributions.

For the analysis in the next section, we also require the signal process at NNLO accuracy. In principle, computing this exactly does not pose significant challenges. Nevertheless, we expect the exact result to be very well described by a (refined) version of the soft-virtual approximation [38,39,40,41]. While the pure soft-virtual prediction provides a good approximation to the interference, this is not the case for the signal. We therefore employ a modified soft-virtual approximation to describe the latter. Specifically, we follow the approach of Ref. [40] which is known to reproduce the exact NNLO prediction to within few-percent accuracy. In practice, this amounts to modifying the plus distributions in Eqs. (B.11, B.15) according to [40]

$$\begin{aligned} {\mathcal {D}}_i(z) \rightarrow {\mathcal {D}}_i(z) + (2-3z+2z^2)\frac{\ln ^i\frac{1-z}{\sqrt{z}}}{1-z} - \frac{\ln ^i(1-z)}{1-z}.\nonumber \\ \end{aligned}$$
(19)

This modification captures soft emission at next-to-leading power, as well as part of the hard collinear emission. Such an approximation was already used for estimating signal-background interference effects for the \(gg\rightarrow VV\) process [47]. In our case, we have specifically checked that for on-shell Higgs, within the fiducial volume used in our analysis, the soft-virtual approximation improved according to Eq. (19) agrees with the exact NNLO result to within few percent. This is good enough for the kind of accuracy targeted in our analysis. In the following section, we will refer to this improved soft-virtual approximation as “\(\mathrm {NNLOsv'}\)”. The reason why we do not adopt such an improved approximation for the background is because it is currently unknown how to properly capture next-to-leading power soft term in this case. Indeed, a simple modification of emission off external legs works for the Higgs point-like interaction, but not for the background amplitude. Fortunately, the interference seems to be dominated by low-\(p_T\) physics so the lack of such an improved approximation is less problematic than for the signal.

We conclude this section by listing the various ingredients of our calculation. The LO amplitudes, including the quark-mass effects, are well known, see e.g. [48] and references therein. At NLO, we took the one-loop amplitudes for the background from Refs. [29, 31, 49, 50], borrowing most of them from MCFM [51,52,53]. We note that the two-loop amplitude for \(gg\rightarrow \gamma \gamma \) was first computed in Ref. [54]. NLO predictions were computed using FKS subtraction [55]. Finally, the three-loop \(gg\rightarrow \gamma \gamma \) amplitude relevant for NNLO corrections was taken from Ref. [29]. All the tree- and one-loop results for the interference were cross-checked against a dedicated version of OpenLoops2 [56, 57].

4 Results

In this section, we discuss our main findings. We start by specifying the setup we are using.

We employ the so-called \(G_\mu \) input scheme for the electroweak parameters and we use

$$\begin{aligned} G_F= & {} 1.16639\cdot 10^{-5}~\textrm{GeV}^{-2}, \\ m_W= & {} 80.398~\textrm{GeV}, \quad m_Z = 91.1876~\textrm{GeV}\end{aligned}$$

that give the electroweak coupling constant \(\alpha = 1/132.338\). We set the Higgs mass to \(m_H=125~\textrm{GeV}\). Finally, we use \(m_t = 173.2~\textrm{GeV}\) and \(m_b=4.18~\textrm{GeV}\) for the top and bottom mass, respectively. We use the NNPDF31_nnlo_as_0118 [58] parton distribution functions as distributed through the LHAPDF library [59] and we use Hoppet [60] for various PDFs manipulations. The value of the strong coupling constant is extracted from the PDF set, with \(\alpha _{s}(m_Z)=0.118\). For the QCD factorisation and renormalisation scales, we choose the common value \(\mu _F = \mu _R = m_{\gamma \gamma }/2\), where \(m_{\gamma \gamma }\) is the invariant mass of the diphoton system. All of the results presented in this section have been derived for the \(13.6~\textrm{TeV}\) LHC, i.e. proton-proton collisions. We define the fiducial region by imposing the following set of cuts on the final-state photons

$$\begin{aligned}&p_{T,\gamma }> 20\,\textrm{GeV}, \quad \quad \sqrt{p_{T,\gamma _1} p_{T,\gamma _2}}> 35\,\,\textrm{GeV}, \nonumber \\&|y_{\gamma }| < 2.5, \quad \quad \varDelta R_{\gamma _1\gamma _2} > 0.4, \; \end{aligned}$$
(20)

which are designed to reduce sensitivity on infrared physics [61]. The cuts in Eq. (20) are different from the ones used in Refs. [21, 22]. Because of this, it is not immediate to compare our results with the ones presented there. Nevertheless, we have validated our LO and NLO calculations against Refs. [21, 22] in an equivalent setup. In particular, we have reproduced the results for the mass shift shown in Refs. [21, 22]. We have also used MCFM [51,52,53], together with in-house codes, to validate the signal and the reliability of the \(\mathrm {NNLOsv'}\) approximation.

We start the discussion of our results by assessing the validity of the soft-virtual approximation for the interference at NLO. We do this by first comparing the exact prediction including only the gg initiated process, which we dub \(\textrm{NLO}(\textrm{gg})\), to the corresponding soft-virtual approximation, \(\textrm{NLOsv}(\textrm{gg})\).

Fig. 3
figure 3

Left pane: comparison of the exact NLO calculation and the soft-virtual approximation in the gg channel. Right pane: complete NLO prediction, inclusive of all channels, compared to the corresponding soft-virtual approximation

This comparison is shown in the left pane of Fig. 3. Second, we compare the complete NLO calculation, i.e. including the qg and \(\bar{q}g\) initiated channelsFootnote 4 to the NLO soft-virtual approximation (\(\textrm{NLOsv}\)), which retains all LO contributions, but includes NLO corrections in the soft-virtual approximation for the gg channel only. This comparison is presented in the right pane of Fig. 3. The red and blue bands shown in both figures represent the envelope from a simultaneous rescaling of \(\mu _{F,R}\) by a factor of two up and down with respect to the central value. We note that our result suggests that the uncertainty due to scale variations does not provide a reliable estimate of the actual error of the NLO prediction. In the left pane of Fig. 3 one can see that the soft-virtual approximation does a remarkably good job in describing the dominant gg contribution. The shape of the interference is very well captured and the scale-variation bands almost perfectly match throughout the relevant interval of invariant mass. In the right side of Fig. 3 we notice a slight degrading of the approximation, mostly for what concerns the scale variation bands. This is because the soft-virtual approximation does not capture the full scale-compensation between different channels that happens at NLO. However, the shape, which is the relevant factor for the extraction of the mass-shift, is still described at a satisfactory degree.

This comparison gives us some confidence that the soft-virtual approximation can adequately describe higher-order QCD corrections to the signal-background interference. Furthermore, we note that the mass-shift extracted at NLO in the soft-virtual approximation differs from the exact value by \(5\%\), whilst the genuine NLO correction is of \({\mathcal {O}}(30/40\%)\) and the theory uncertainty is \({\mathcal {O}}(10\%)\). The ability of the soft-virtual approximation in reproducing the interference effects can be better understood by keeping in mind that the predominant contributions come from the low \(p_{T,\gamma \gamma }\) regions, as we already mentioned in Sect. 1. We point out that the fiducial setup adopted in Eq. (20), in particular with the choice of product cuts, plays a relevant role in the reliability of the soft-virtual approximation. We have indeed explicitly checked that imposing asymmetric cuts on the final-state photons, as it is routinely done in Higgs analysis, breaks the quality of the approximation.

Fig. 4
figure 4

Signal-background interference contribution to the diphoton invariant mass distribution after Gaussian smearing. Bands represent the envelope given by the scale variation

As already discussed in Sect. 2.2, in order to get a more realistic picture of the interference effect at the detector level, we convolute the invariant mass distribution with a Gaussian function with a standard deviation of \(\sigma = 1.7~\textrm{GeV}\) [20, 23]

The invariant mass distribution arising from signal-background interference after the smearing is shown in Fig. 4. The LO features the well known antisymmetric behaviour around the peak coming from the real part of the interference, whereas the NLO curve is shifted to the left and to the bottom, due to the impact of the imaginary part. In the same figure, we can appreciate for the first time the effect of the NNLO QCD corrections. The curve is further shifted down, thus depleting even more the number of events around the Higgs peak and softening the impact on the mass-shift.

Fig. 5
figure 5

Real part (left pane) and imaginary part (right pane) of the interference at LO, NLO and NNLOsv after Gaussian smearing

Moreover, from Fig. 4 one can see that the theory uncertainty, which arises almost completely from renormalisation scale variations, gets generally reduced at NNLO, except in the region where the NLO band shrinks to zero. This behaviour can be understood keeping in mind that the interference is the sum of the real and imaginary parts which have different shapes, as discussed in Sect. 2.1. As shown in Fig. 5, scale-variation bands are separately well behaved for the real and imaginary parts, but on the left hand side of the Higgs boson peak a cancellation occurs, whereas on the right the two effects sum up (both negative in sign).

Before moving to our results for the mass shift, we briefly discuss the impact of the imaginary part of the interference to the total cross section in our fiducial region. As we have explained in Sect. 2.1, at LO this interference is very small because it only comes from contributions mediated by virtual bottom quarks, either in the signal or in the background. These are either mass (background) or Yukawa (signal) suppressed. The smallness of the LO imaginary part is indeed seen in Fig. 5. In our setup, we find

$$\begin{aligned} \sigma _S^{\textrm{LO}} = 24.21^{+15\%}_{-14\%}~\textrm{fb},~~~~~~ \sigma _I^\textrm{LO}= -0.11^{+20\%}_{-17\%}~\textrm{fb}. \end{aligned}$$
(21)

Here and in the following the quoted uncertainties are obtained by coherently varying the renormalisation and factorisation scales by a factor of two around the central value \(\mu = m_{\gamma \gamma }/2\). At LO, we find that more than 80% of the destructive interference quoted above comes from the imaginary part of the signal interfering with the real part of the background. This gives us confidence that neglecting mass effects in the background prediction does not significantly impact our result. Furthermore, as far as the signal goes, we note that the bulk (about 95%) of the imaginary part is generated by bottom-mass effects in the production amplitude. This is easy to understand just by looking at the relative importance of the top, bottom and W contributions to the production and decay amplitudes.

At higher orders however, a larger interference is generated by the imaginary part of the background, which no longer requires the presence of bottom quarks (see the discussion in Sect. 3). Because of this, beyond LO we only compute radiative corrections in the infinite-top approximation and drop any mass dependence in the background amplitudes. At NLO, we obtain

$$\begin{aligned} \sigma _S^\textrm{NLO}= 58.12^{+20\%}_{-14\%}~\textrm{fb},~~~~~~ \sigma _I^\textrm{NLO}= -0.72^{+27\%}_{-21\%}~\textrm{fb}. \end{aligned}$$
(22)

These results are consistent with the analysis in Ref. [26]. Our best prediction beyond NLO is obtained within the soft-virtual approximation described in Sect. 3. We find

$$\begin{aligned} \sigma ^{\textrm{NNLOsv}'}_{S} = 72.21^{+8\%}_{-8\%}\; \textrm{fb}, ~~~~ \sigma ^{\textrm{NNLOsv}}_{I} = -1.21^{+7\%}_{-10\%}\; \textrm{fb},\nonumber \\ \end{aligned}$$
(23)

hence the destructive interference reduces the total rate by \(1.7\%\).Footnote 5 Given the theoretical [62] (see also Refs. [63, 64]) and experimental [35, 36] uncertainty on the Higgs total cross section, this effect is actually not negligible and it can be used to further constrain the Higgs width [26]. We do not pursue this line of investigation here, but we estimate that, with current uncertainties, one could already constrain the Higgs width to about 20–30 times the Standard Model.

Table 1 Mass-shift at different proton-proton collider energies with Gaussian fit method
Table 2 Mass-shift at different proton-proton collider energies with first moment method

We can finally present the main result of our study, i.e. the prediction for the mass-shift at NNLO. As discussed in Sect. 2.1, we adopt two different methods to estimate the mass-shift induced by the interference term. In Table 1 we show the results obtained by performing a chi-squared fit of the smeared signal-plus-interference distribution with a Gaussian function of standard deviation \(\sigma =1.7\,\textrm{GeV}\). The mass-shift is obtained as the difference between the obtained mean value and the input Higgs boson mass \(m_H = 125~\textrm{GeV}\). In Table 2 instead we present results derived by computing the first moment of the signal-plus-interference distribution after smearing. In both tables we show predictions for different collider energies, but in the same fiducial region specified in Eq. (20). As mentioned earlier, the denoted ranges represent the theoretical uncertainty related to a change of the central scale \(\mu =m_{\gamma \gamma }/2\) by a factor of two and a half. The entry \(\textrm{NNLOsv}\) indicates the result obtained by considering both signal and interference terms in the soft-virtual approximation. The \(\mathrm {NNLOsv'}\) entry instead, refers to the “improved” soft-virtual approximation discussed at the end of Sect. 3. Specifically, in this case we still use the \(\textrm{NNLOsv}\) approximation for the interference, but compute the signal in the \(\mathrm {NNLOsv'}\) framework. As we explained in Sect. 3, we expect this setup to be the most reliable one. Still, we find it useful to present numbers in both frameworks as a conservative way of estimating the uncerainties related to these approximations. In this respect, we note that the results obtained in the \(\textrm{NNLOsv}\) and \(\mathrm {NNLOsv'}\) approximations are compatible within their uncertainties.

We immediately notice that in both extraction methods, the estimated mass-shift is rather insensitive to the collider energy.

Table 3 Comparison of K-factors, measured w.r.t. the LO value, for the mass-shift at \(\sqrt{s}=13.6~\textrm{TeV}\) calculated via a gaussian fit method and via a first-moment method

Although a Gaussian fitting procedure and the evaluation of the first moment return different values for the mass-shift, smaller for the former and larger for the latter, we notice that the K-factors relative to the corresponding LO prediction are almost identical. We stress that this is a welcome feature since our analysis does not include reliable detector simulation. The results in Table 3 show that radiative corrections seem rather insensitive to the precise definition of the observable used to extract the mass shift.

We also observe how the mass-shift gets systematically reduced in absolute value when including higher-order corrections. This is mainly because the K-factor for the signal turns out to be larger than the one for the interference, thus leading to a reduction of the mass-shift. This trend, already observed at NLO [21], seems to persist at NNLO (at least in the \(\textrm{NNLOsv}\) approximation).

Fig. 6
figure 6

Mass-shift as a function of the Higgs boson width

Finally, we study how the bounds on \(\varGamma _H\) are affected by higher-order QCD corrections, following the discussion at the end of Sect. 2.1. The results at NLO and \(\mathrm {NNLOsv'}\) (i.e. our best prediction) are shown in Fig. 6. Our NLO curve is compatible with the one obtained in [21].Footnote 6 We note that the two curves do not overlap. This is a direct consequence of the analogous feature for the interference shown in Fig. 4. As explained in the comment of that figure, this is at least partially due to competing effects coming from real and imaginary part of the interference, see Fig. 5. Furthermore, similar to the Higgs-signal case, the interference receives rather large corrections as well. Finally, we note that the NNLO soft-virtual curve lies above the NLO one, thus loosening potential bounds on \(\varGamma _H\). For instance, if at the LHC the error on the mass-shift reaches roughly \(150~\textrm{MeV}\), one could exclude values of \(\varGamma _H\) larger than 10–20 times the SM prediction. This has to be compared with the corresponding bound at NLO of \(\varGamma _H<(6\text {--}9)\varGamma _{H,\textrm{SM}}\).

5 Conclusions

We presented a calculation of the NNLO soft-virtual corrections to signal-background interference for gluon-fusion Higgs production in the diphoton channel at the LHC. More specifically, we focused on the shift in the diphoton invariant mass distribution which arises from the inclusion of the interference between the Higgs-mediated process and its continuum background, as first observed in Ref. [20]. Such a study was only known up to NLO QCD so far [21], due to unavailability of the relevant background amplitudes.

Recent developments in multi-loop scattering amplitudes techniques enabled the computation of such amplitudes [29,30,31]. We were therefore able to extend this study one order higher in perturbation theory. We employed the soft-virtual approximation, motivated by previous studies which showed how the contribution to the mass-shift is enhanced in the low \(p_{T,\gamma \gamma }\) region, and therefore potentially dominated by virtual corrections and soft emissions. We have validated this approximation at NLO and shown that indeed it seems to capture the bulk of the corrections, to within few percent. We stress that this statement holds in our fiducial volume, which is designed to avoid spurious sensitivity to infrared physics [61]. Using more standard asymmetric cuts at fixed order, instead, would lead to a deterioration of the quality of the soft-virtual approximation.

We calculated the mass-shift by means of two different methods and found that, despite the absolute value being dependent on the observable chosen for its extraction, the K-factors for the NNLO soft-virtual corrections are very similar in both cases. Specifically, we found that the mass-shift further decreases at NNLO soft-virtual, confirming the trend already observed at NLO QCD. Within our approximation, we find that NNLO corrections are large, and amount to a decrease of almost 30% on top of the NLO prediction. Furthermore, these corrections are not captured by the standard scale variation band of the NLO prediction.

We used our results to obtain an updated prediction for the bounds that can be put on \(\varGamma _H\) as a result of the mass-shift determination. We observed that the decrease in the mass-shift prediction results in a loosening of such bounds. If we estimate the experimental error on the mass shift to drop to \({\mathcal {O}}(150~\textrm{MeV})\), we find that the Higgs boson width can be constrained to \(\varGamma _H\lesssim (10{-}20)\varGamma _{H,SM}\). Interestingly, we find that the imaginary part of the interference instead grows with the perturbative order. Hence, it leads to a shift of the total cross section that could be used to extract information on the Higgs total width as well [26]. Although a detailed extraction based on this is beyond the scope of this paper, we estimate that with the current experimental and theoretical uncertainties one could bound the Higgs width to within 20–30 times its Standard Model value.

There are several avenues for improving our predictions. First, one could perform an exact NNLO study. As we already mentioned, the main complication of such a calculation is a purely technical one, namely the numerical control of two-loop 5-point amplitudes (as well as 6-point one-loop ones) near singular contributions. Although we believe that our calculation captures the bulk of NNLO corrections, such an exact result would allow for a better modeling of the \(p_T\)-dependence of the interference, which would provide useful information for an actual extraction of the mass-shift. Indeed, the \(p_T\) dependence of the result can be used to define signal and control regions to extract the mass shift within the diphoton channel alone thus minimising systematic uncertainties.

From a theoretical point of view, perhaps an even more interesting line of investigation would be to understand how to improve the soft-virtual approximation for the background amplitude. This would require improving our understanding of next-to-leading power soft corrections to cope with the case at hand. We leave these avenues for future work.