1 Introduction

The boson discovered at the LHC [1, 2] seems to be in very good agreement with prescription of a Standard Model like Higgs boson. In the Standard Model the Higgs mass is the only free parameter in the theory and its precise determination was one of the main experimental targets [3]. Furthermore, the Standard Model predicts the shape of the Higgs potential, so a measurement of the parameters of the potential will allow us to discriminate a Standard Model Higgs boson from various BSM scenarios. This however requires the measurement of the Higgs self coupling, which can be measured in Higgs boson pair production processes. The value of the Higgs mass allows for measurements in a variety of decay channels and both ATLAS and CMS have performed studies to measure the Higgs self coupling, e.g. in the decay channels \(\gamma \gamma b\bar{b}\) [4,5,6,7], \(b\bar{b}b\bar{b}\) [6, 8,9,10,11], \(\gamma \gamma W W^*\), \(b\bar{b}W W^*\), \(\tau ^+\tau ^-b\bar{b}\) [6, 12,13,14,15,16,17,18].

From a Standard Model calculational point of view, the signal process (i.e. the production of a Higgs boson pair) is known at leading order in the full theory [19,20,21], and in various approximations taking higher order corrections into account [22,23,24,25,26,27,28,29,30,31]. Only very recently the NLO result taking the full top mass dependence into account became available [33, 34].

In this paper we focus on one possible decay channels, namely where one Higgs decays into a pair of photons, whereas the second decays into a pair of b-quarks. This process can be seen as a compromise between a four b-quark signal and a four photon signature. The first would benefit from a large \(H\rightarrow b \bar{b}\) branching ratio but suffers from a large irreducible QCD background, whereas the latter exhibits a very clean signal with four photons, but suffers from a very small \(H\rightarrow \gamma \gamma \) branching ratio.

In the case of massless b-quarks the process \(\gamma \gamma b \bar{b}\) can be seen as a subset of the process \(\gamma \gamma jj\), which is known at NLO in QCD [35,36,37]. The main motivation for the general two jet process was, however, more to assess the background of a single Higgs in VBF production rather than focusing on final state b-quarks. As we will see, the tagging of two final state b-jets significantly alters the behavior of the higher order corrections and therefore this process cannot be directly compared to the general two jet process.

The paper is organized as follows. In Sect. 2 we describe the setup that has been used to obtain the numerical results which we discuss in Sect. 3. Finally we conclude in Sect. 4.

2 Calculational setup

The NLO corrections are calculated by combining the two automated programs GoSam [38, 39] for the generation and evaluation of the virtual one-loop amplitudes, and the Monte Carlo event generator Sherpa [43]. The combination between the two is realized using the standardized Binoth Les Houches Accord [44, 45].

GoSam is based on an algebraic approach where d-dimensional integrands are generated using Feynman diagrams. It uses QGraf [40] and Form [41, 42] for the diagram generation, and Spinney [46] and Form to write an optimized Fortran output. For the reduction of the tensor integrals we used Ninja [47,48,49], which carries out the reduction on the integrand level in a fully automated way via Laurent expansion. Alternatively one can choose other reduction strategies such as the OPP reduction method [50,51,52], which is implemented in Samurai [53] or methods based on tensor integral reduction as implemented in Golem95 [54,55,56,57]. For the evaluation of the remaining scalar integrals we have used OneLoop [58].

The evaluation of all tree-level like matrix elements within Sherpa has been performed using Comix [59], the subtraction terms have been calculated with the Sherpa’s implementation of the Catani–Seymour dipole formalism [60, 61].

3 Numerical results

In the following we present numerical results for the LHC with a center of mass energy of \(\sqrt{s}=13\; \text {TeV}\). To assess b-mass effects the calculation has been carried out with both massless b-quarks in the five-flavor scheme as well as with massive b-quarks in the four-flavor scheme.

3.1 Cuts and parameter settings

For the massless case we have used the CT10nlo pdf set [62] and the CT10nlo_nf4 set for the massive case respectively. In the massive case the b-mass has been set to 4.7 GeV.

Renormalization and factorization scales are set to be equal and the central scale was chosen to be

$$\begin{aligned} \mu _R = \mu _F =\frac{1}{2}\sqrt{m^2_{\gamma \gamma } +\left( \sum _i p_{T,i}\right) ^2}\;, \end{aligned}$$
(1)

where the sum runs over the final state partons. As this process contains external photons the electroweak coupling constant \(\alpha \) is set to \(\alpha = 1/137.03599976\). We have included top-quark loops in the virtual corrections with a top mass of \(m_t = 171.2 \) GeV.

The presence of final state photons requires the application of a photon isolation criterion to render the NLO corrections finite. We employed a smooth cone isolation criterion [63] with the following parameters:

$$\begin{aligned} R=0.4,\quad \epsilon =0.05,\quad n=1\;. \end{aligned}$$
(2)

For tight isolation the smooth cone isolation has been shown to be in good agreement with the alternative approach of applying a photon fragmentation function (see e.g. [68]). Additionally the isolated photons are required to fulfill

$$\begin{aligned} p_{T,\gamma } > 30\; \text {GeV},\quad |\eta _{\gamma }|<2.5\;. \end{aligned}$$
(3)

The QCD partons are clustered with an anti-\(k_T\) algorithm [64] contained in the Fastjet package [65]. The jet radius has been set to \(R=0.4\) and events where both b-quarks are clustered into a jet are rejected in order to ensure that there are at least two b-jets present in the final state. For the jets we require

$$\begin{aligned} p_{T,j} > 20\; \text {GeV}, \quad |y_j|<4.4\;. \end{aligned}$$
(4)

3.2 Cross sections and differential distributions

We start the discussion of the numerical results with the case of a massless b-quark. We assess the theoretical uncertainty by usual scale variation of a factor of 2 around the central scale. Based on the cuts and settings described above we find for the total cross section

$$\begin{aligned} \sigma _{LO} = 38.6^{+22\%}_{-17 \%}\; \text {fb},\quad \sigma _\mathrm{NLO} = 56.2^{+20\%}_{-15\%}\;. \end{aligned}$$
(5)

From Eq. (5) one can see that the NLO corrections enhance the total cross section by almost 50 per cent. It also shows that the theoretical uncertainty does not improve at NLO, instead for both LO and NLO one obtains an uncertainty of 15–20% in each direction when varying the scale by a factor of 2. This situation is shown more explicitly in Fig. 1 where we show the cross section as a function of the scale for a broader range. Looking at the curve for the inclusive NLO result one sees a born-like behavior even at NLO. In particular no reduction on the scale dependence is obtained throughout the whole range of scales. The typical turnover that one expects at NLO is not present.

Fig. 1
figure 1

Total cross sections at LO and NLO for massless b-quarks. In addition it shows the NLO cross section where the initial qg-channel has been removed as well as the exclusive two b-jet cross section where a veto on a third jet has been imposed

A special feature of this process is that the leading order process is mediated by two types of initial state, the \(q\bar{q}\)- and the gg-channel. At NLO, however, also the quark–gluon channel is opening up in the real emission. In order to investigate whether it is this channel that is responsible for the tree-level like behavior we made two different checks. First, we completely remove the quark–gluon channel from the process (green curve). This has a tremendous impact on the NLO result rendering the corrections negative over the whole range and the absolute value increases when going to smaller scales which even leads to unphysical negative cross sections for scales smaller than the central scale. Removing a production channel is of course not a physical meaningful procedure but it shows that this channel is indeed responsible for the behavior of the inclusive NLO cross section. A physically well-defined strategy, however, is to impose a jet veto on a possible third jet. This means that we remove events where a third QCD parton (independent whether it is a b-quark or not) would form an independent jet applying the anti-kt algorithm and which passes the jet cuts in Eq. (4). A jet veto effectively cuts away an intrinsically positive contribution from the real emission and will therefore lead to a decrease of the NLO result. The exclusive NLO result is given by the turquoise curve. Interestingly the two approaches lead to very similar results. Even though vetoing a jet is a well-defined procedure it leads to negative cross sections for scales smaller than the central scale. This behavior is driven by the virtual corrections that yield a negative correction and their absolute value increases when going to smaller scales. For small scales they dominate the total cross section and supersede the leading order contribution. This general behavior is expected and it indicates that the central scale could be chosen to be larger although it has been proven to be a good choice for the general diphoton plus two jets process [35]. The general two jet process of [35] is, however, different in that respect that the qg-channel already occurs at leading order which explains the different behavior of the total cross section under scale variation.

A lowering of the veto threshold would lead to a further decrease of the total cross section as one cuts away a larger part of the positive real emission contribution. It is clear that imposing a jet veto raises the question to what extent possible resummation effects can change the result and the associated theoretical uncertainty. This is, however, beyond the scope of this paper. The results show that this process is highly sensitive to an additional jet veto and that the scale variation might therefore not be an accurate measure of the theoretical uncertainty. The inclusive NLO result seems, however, suitable as a conservative estimation.

We now turn to a discussion of the differential distributions. Fig. 2 shows the \(p_T\) distribution of the two leading b-jets, where the jets are \(p_T\)-ordered. For both jets the NLO corrections for low values of \(p_T\) are relatively small which means that the NLO result agrees with the leading order result within the systemic uncertainty. Also the size of the NLO uncertainty is reduced compared to the leading order uncertainty. Going higher in \(p_T\), however, very rapidly increases the NLO corrections and from the order of 100 GeV on the differential k-factors are in the range of 2–2.5. Also the size of the NLO uncertainty band increases and for values beyond \(\sim 100\) GeV the uncertainties at NLO are roughly twice as big as the LO ones.

Fig. 2
figure 2

Transverse momentum of the two leading b-jets

Fig. 3
figure 3

Transverse momentum of the two photons

For the transverse momentum distribution of the photons shown in Fig. 3 the behavior is less pronounced than for the jets. For the leading photon the corrections are smallest for low values of \(p_T\) and rise almost linearly with increasing transverse momentum leading to a k-factor of almost 2 for values around 500 GeV. Similar to the jet distributions there is basically no overlap between the uncertainty bands. The subleading photon shows a milder behavior compared to the leading photon. Although the uncertainty bands also hardly overlap the differential k-factor is flat to a good approximation. For both photons one sees that the size of the NLO uncertainty is roughly of the same size as the LO uncertainties whereas for the jets the NLO uncertainties were larger except for small values of \(p_T\).

Fig. 4
figure 4

Invariant mass distributions of the two leading jets (upper left), the two photons (upper right) and the total invariant mass

Fig. 5
figure 5

R-separation of the two leading jets (l.h.s.) and the two photons (r.h.s.)

Fig. 6
figure 6

Azimuthal separation of the two leading jets (l.h.s.) and the two photons (r.h.s.)

As this process constitutes a background to double Higgs production, the invariant mass distributions are also essential. In the upper row of Fig. 4 we show the invariant masses of the two leading b- jets and of the two photons. In both cases one observes a significant shape distortion by the NLO corrections. They exhibit large corrections at low values followed by a minimum in the range of 60–80 GeV. In the case of the jets the NLO corrections then increase roughly linearly again, leading to substantial corrections for invariant masses beyond say 200 GeV. For the photons this behavior is mitigated and the differential k-factor is flat to a good approximation in the mass range beyond 200 GeV. It is worth noting that in the range around the Higgs mass the corrections are rather mild and one still finds an overlap between the uncertainty bands. And in particular the NLO behavior for low invariant masses allows one to reduce the NLO corrections by imposing an appropriate cut around the Higgs mass. The increase in the low invariant mass region at NLO can be understood by keeping in mind that additional radiation decreases the available phase space for each particle and the two photons or two jets, respectively, become closer in terms of both energy and angular separation. In a region where the invariant masses are small they are more sensitive to extra radiation and therefore one observes a shift towards small invariant masses. The plot in the lower row of Fig. 4 shows the total invariant mass of the final state where the sum runs over the two photons and the jets. There we see a drastic change in the shape of the distribution when going from leading order to next-to-leading order. At low invariant masses the NLO corrections are negative and substantial but then increase linearly and lead to substantial positive corrections in the region above \(\sim 400\) GeV. With the additional quark–gluon channel in the real radiation it is not surprising that the kinematics of the process changes compared to the leading order behavior, and this observable, being very inclusive in the final state probes the underlying kinematics of the process. In addition, for high center of mass energies (and this is essentially what this observable shows) it is more likely to have a third jet than for small center of mass energies. The real emission contribution will therefore play a more important role for high invariant masses whereas for small invariant masses the chance that the event is rejected increases as the available energy is distributed among more particles which increases the chance that one or more particles might not pass the cuts. One can expect also differences between signal and background in various angular distributions as in the case of the signal the b-jets and the photons stem from the decay of a spin-0 particle, whereas for the background processes the angular correlations are different. Figure 5 shows the R-separation between the two leading jets and the two photons respectively. Both distributions exhibit large corrections for small values for the separation with a minimum around \(\pi \). A similar behavior is also found for the azimuthal angle which is shown in Fig. 6 for the leading jets (l.h.s.) and the two photons (r.h.s.). Also there one finds for both the jets and the photons the largest contributions for small angles followed by a constant decrease. For \(\Delta \phi \approx \pi \) the NLO result agrees with the LO result within the theoretical uncertainty. For Figs. 5 and 6 concerning the increase for small values at NLO the same argument as for the invariant masses of the two photons and the two leading b jets can be applied. Additional radiation leads to a decrease of angular separation among all particles which leads to an enhancement of small separation at NLO.

3.3 Massive b-quarks

In this section we will scrutinize the validity of treating the b-quark as a massless particle. For this purpose we set the mass of the b-quark to its pole mass of 4.7 GeV. For a consistent treatment we employ the four-flavor scheme and use the CT10nlo_nf4 pdf set. Table 1 shows the total cross sections for the central scale at LO and NLO for massive quarks in direct comparison to the massless results. The massive LO order result is reduced by \(\sim 10\%\), at NLO the massive result is \(\sim 16\%\) smaller than the massless result. This reduces the k-factor by \(\sim 7 \%\).

At first it may seem unreasonable that the introduction of the b-mass does have such an influence on the result given that the mass is relatively small compared to all other scales in this process. However, one should keep in mind that there are several effects that need to be taken into account. The biggest effect certainly comes from the change of the pdf set that comes along with a lower value of \(\alpha _s\). \(\alpha _s(M_Z)\) is \(\sim 4.5 \%\) smaller in the massive case. This effect is the driving force in the reduction of the cross section. In addition, for this process the subprocesses with initial state b-quarks are enhanced due to t-channel like diagrams with the b-quark line going from initial to final state. These type of diagrams yield a large contribution that enhances the importance of initial state b-quarks compared to the other sea-quark contributions. A sample of diagrams that are only present in the five-flavor scheme but not in the four-flavor scheme is shown in Fig. 7.

Fig. 7
figure 7

Sample of tree-level diagrams that are only present in the five-flavor scheme and not in the four-flavor scheme

This effect has also been observed in the context of multiple b-quark production [66, 67] and also there the overall effect has been found to be large [69]. From comparing LO order results within the four-flavor scheme for the massless and the massive case we estimate the pure mass effect contributing to \(\sim 40 \%\) of the reduction of the cross section.

Table 1 Total cross sections at LO and NLO for the central scale and for massless and massive b-quarks
Fig. 8
figure 8

Transverse momentum distribution of the two leading jets for massive b-quarks

Fig. 9
figure 9

Transverse momentum distribution of the two photons for massive b-quarks

Fig. 10
figure 10

Invariant mass distribution for the two leading jets and the two photons (upper row) and for the total invariant mass (lower row) for massive b-quarks

It is now also important to investigate if and how big the massive b-quark will affect differential distributions. For simplicity we present the massive results only for the central scale. The focus here is on the change of the shape caused by the mass effects and we assume that the theoretical uncertainty will be of a very similar size as for the massless case. In Fig. 8 we show the transverse momentum distribution for the two leading jets. The upper ratio plot shows the ratio of the massive LO contribution over the massless result, the lower ratio plot shows the same for the NLO result. For comparison we also show the scale uncertainty for the massless case. The mass effects are dominated by the general decrease of the cross section in the massive case, but the differential k-factor is flat to a quite good approximation and the central scale of the massive result is still within the uncertainty band of the massless result except for the first bin where the uncertainty band becomes smaller. One can therefore conclude that the uncertainty from setting the mass to a non-zero value is contained within the systematic uncertainty from scale variation. The transverse momentum distribution of the two photons shows exactly the same behavior as can be seen in Fig. 9. Also here the differential k-factor is flat to a good approximation and the massive result is still in agreement within the uncertainty of the massless result. The same is also true for the invariant masses of the two leading jets and the two photons as can be seen in Fig. 10. Also here the massive result can be incorporated in the systematic uncertainty of the massless calculation. For the total invariant mass shown in the lower row of Fig. 10 the situation is a bit more special. Also here the mass leads to a flat shift downwards, but the ratio plots shows that for the NLO result the error band becomes very small in the region between 250 and 300 GeV. The reason for this behavior is that the upper and the lower scale cross the central scale in that region which makes the scale uncertainty vanish and leaving the massive result outside the estimated uncertainty. This might also be interpreted such that for this observable our scale choice is not suitable to describe this particular observable and give a reliable estimation of the underlying uncertainties. In general on would expect that if the introduction of a massive b-quark yields a shape distortion compared to the massless case, then this should preferably show up in distributions that separate regions of low and high energy/transverse momentum, such that there are regions where the b-mass becomes large compared to the other scales in the process. Distributions like the transverse momenta of the b-jets or the invariant mass of the dijet system seem to be the ideal candidates. However, as we have seen above, even in these distributions we do not observe a significant shape distortion and the effects of the b-mass are essentially reduced to a global shift induced by the different value of \(\alpha _s\). It is therefore not surprising that also in angular distributions we do not observe a different pattern. We exemplify this by showing the angular separation between the two leading jets and the two photons in Fig. 11. As for the transverse momentum distribution of the jets one observes a small effect in the first bin where the massive result is slightly below the uncertainty band of the massless result but also here the differential k-factor is flat over the whole range. For the separation between the photons the situation is very similar to the massive result being at the lower end of the uncertainty band with an otherwise flat k-factor.

Fig. 11
figure 11

Angular separation between the two leading jets (l.h.s.) and the two photons (r.h.s.) for massive b-quarks

In summary, the inclusion of the b-mass has a substantial effect on the total cross section and on differential distributions. However, it leads just to a global shift towards smaller values largely caused by the four-flavor pdf set and the smaller value for \(\alpha _s\). But it does not lead to significant distortions of shapes of the differential distributions. A shift via a global k-factor would therefore be able to accurately describe the mass effect.

4 Conclusions

The measurement of the triple Higgs coupling is an essential ingredient to completely determine the structure of the Higgs potential and to answer the question whether the Higgs boson is in agreement with the prediction from the Standard Model. The production of two Higgs bosons via gluon fusion yields the biggest contribution that includes the triple Higgs vertex.

In this paper we investigated the background of one of the most import decay channels, where one Higgs would decay into a \(b\bar{b}\) pair and the other Higgs would decay into a pair of photons. We calculated the \(\mathcal{O}(\alpha _s^2 \alpha ^2)\) contribution at next-to-leading order in QCD in the fully automated Sherpa + GoSam setup. We found large corrections due to new partonic channels opening up for the real emission contribution leading to a tree-level like behavior of the cross section under variation of renormalization- and factorization scale. The inclusion of NLO effects is therefore viable for a reliable theoretical prediction.

We also assessed the impact of a massive bottom quark. In a consistent treatment the inclusion of the mass comes along with a four-flavor scheme pdf set and therefore also the removal of subprocesses with initial state b-quarks. Altogether we found a significant reduction of the cross section which, however, is largely caused by the pdf set and the smaller value of \(\alpha _s\). The actual mass only plays a minor role. The massive result is still contained within the systematic uncertainty of the massless one and the shapes of the differential distributions are unchanged to a good approximation. This means that the mass effects can effectively be described by applying a global k-factor to the massless results.