1 Introduction

Both the ATLAS and the CMS collaborations reported the observation of an excess of events in the diphoton mass distribution around 750 GeV [1, 2] in roughly \(3\,\mathrm{fb}^{-1}\) of data recorded in 2015 at \(\sqrt{s}=13\) TeV, which was found to be compatible with the data collected at 8 TeV. The possibility that this corresponded to a new resonance state generated a great deal of theoretical interest, and a wide range of BSM models describing it were proposed; see for example [3] for a review and further references. While no significant excess was observed in the larger data set collected in 2016 [4, 5], it is nonetheless interesting to consider how a potential resonance of this type, which might be present at higher mass and hence be observable at the LHC or a future collider, can be produced.

One interesting, and in some sense natural possibility, for an excess that is seen in the \(\gamma \gamma \) decay channel, is that the resonance may couple dominantly to photons, with the coupling to gluons and other coloured particles being either suppressed or absent entirely. This has been discussed in [6,7,8,9,10,11,12,13,14,15]. More generally, we may expect significant couplings to quarks and gluons to be present. A method to distinguish between these different production modes, discussed in for instance [8, 9, 16,17,18], is to measure the multiplicity of jets produced in addition to the resonance. In this case we may naturally anticipate that if the resonance is dominantly produced in \(\gamma \gamma \) collisions, then the average jet multiplicity will be lower compared to the \(q\overline{q}\) and gg cases. While additional jets in this case are not parametrically suppressed by \({\sim }\alpha /\alpha _s\)—no additional \(O(\alpha )\) suppression is introduced by requiring that the quark produced in the initial-state \(q\rightarrow q\gamma \) splitting leads to a visible jet in the final state—nonetheless some suppression is present due to the smaller size of \(\alpha \) and lower photon branching probability. It is in addition well known that the particle multiplicity associated with an initial-state gluon is higher compared to the quark case, and so again some difference in jet activity can be expected here. For the vector boson fusion (VBF) WW-initiated channel the resonance is generally produced in association with at least two additional jets, due to the relatively high \(p_\perp \) recoiling quarks in the final state.

With this in mind, in this paper we will consider the case of a scalar resonance R for three mass points \(M_R=0.75, 1.5, 2.5\) TeV, to demonstrate the viability of the method, although this can be readily extended to other spin-parities. We will present a detailed analysis of the expected jet multiplicities corresponding to the different production scenarios, focussing on the gg, light \(q\overline{q}\) and heavy \(b\overline{b}\), \(\gamma \gamma \) and WW cases. We will show that the above expectations are indeed born out by a more precise MC analysis, which accounts for the decay of the resonance and full experimental acceptances and jet selection.

To demonstrate how such an approach may be applied to data, we will then compare to the 2015 ATLAS [1] measurement of the jet multiplicity in the spin-0 event selection sample, and demonstrate that this limited statistics data already show some mild tension with a dominantly gg-initiated scenario, while the light quark and in particular \(\gamma \gamma \)-initiated scenarios give good descriptions. The continuum background-only hypothesis also gives a somewhat worse description than the latter cases, and the WW hypothesis is found to give a particularly poor description, with a dominantly WW-initiated production mechanism in strong tension with the data; a similar conclusion will hold for the ZZ, and to a lesser extent, \(Z\gamma \)-initiated mechanisms. In the \(b\overline{b}\)-initiated case there is expected to be a sizeable fraction of b-jets observed in the final state, rendering such a possibility relatively easy to confirm or constrain, even for this limited data set; indeed the ATLAS measurement of the b-jet fraction in the signal region disfavours any sizeable bb-induced production mode. We also compare in an appendix to the 2015 ATLAS data collected with a spin-2 event selection, but still assuming a scalar resonance signal. While the total number of observed events in the signal region is larger, the S / B ratio is lower, and we find that all hypotheses give acceptable descriptions of the data.

Although no excess at 750 GeV was observed in the updated 2016 data set, this nonetheless demonstrates the discriminating power of this observable. For this reason we will also present expected exclusion limits on different production scenarios with integrated luminosity, and we will show that with just a few 10’s of \(\mathrm{fb}^{-1}\) it is possible to place strong constraints on the gg production mechanism in favour of the \(\gamma \gamma \)/\(q\overline{q}\) cases, and vice versa. It is more challenging to distinguish between the \(u\overline{u}\) and \(\gamma \gamma \) modes via this method, although still possible with enough data.

The outline of this paper is as follows. In Sect. 2 we present the details of our analysis and the simplified production model we consider. In Sect. 3 we present results for the predicted jet multiplicities corresponding to the different production scenarios. In Sect. 4 we compare to the ATLAS jet multiplicity measurement, and comment on the implications for the various production modes. In Sect. 5 we present the expected exclusion limits on different production scenarios as a function of the integrated luminosity collected at the LHC. Finally, in Sect. 6 we conclude. In the appendix we compare to the ATLAS spin-2 event selection, and show that all initial-state hypotheses provide acceptable descriptions.

2 Production model and analysis

To model the production of a scalar resonance \(X\rightarrow \gamma \gamma \) via \(\gamma \gamma \), gg and \(q\overline{q}\) initiated production we use an effective theory approach, with corresponding Lagrangian terms

$$\begin{aligned} \begin{aligned}&\mathcal {L}^{gg}=g_{g} \,G^{\mu \nu }G_{\mu \nu }\,R,\\&\mathcal {L}^{\gamma \gamma }=g_{\gamma } \,F^{\mu \nu }\,F_{\mu \nu }R,\\&\mathcal {L}^{q\overline{q}}=g_q\, q\overline{q}R,\\&\mathcal {L}^{WW}=g_W\, W^{\mu \nu }\,W_{\mu \nu }R, \end{aligned} \end{aligned}$$
(1)

where we make no assumptions as regards the size of the couplings g. We consider resonance masses of \(M_R=0.75, 1.5, 2.5\) TeV, with a uniform width of \(\Gamma _\mathrm{tot}=45\) GeV, although the results which follow are largely independent of the size of the width.

Parton-level events are then generated at LO with up to 2 additional partons in the final state using MadGraph 5 [19], which are MLM merged at scale \(Q_\mathrm{cut}=125,\,250,\,350\) GeV for \(M_R=0.75, 1.5, 2.5\) TeV, respectively, to parton shower generated with Pythia 8 [20, 21], including hadronisation and multiple parton interactions. Events for the Standard Model continuum \(\gamma \gamma \) process, which proceeds dominantly via \(q\overline{q}\rightarrow \gamma \gamma \), are generated in the same way. For the WW initial state only the Born-level process, which generally leads to two additional quark jets in the final state, is generated and passed to Pythia for parton shower and hadronisation, with no matching required. For the initial-state photon PDFs we apply the approach described in [9, 22], with the MMHT14 LO [23] PDF set used for all other partons, however, the normalised distributions we present below are relatively insensitive to these choices. All the analyses performed below make use of Rivet [24].

Fig. 1
figure 1

Exclusive jet multiplicities for minimum jet \(p_\perp >25\) GeV, for the gg and \(\gamma \gamma \) initial-state resonance production processes, and the SM continuum \(\gamma \gamma \) production process. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

For concreteness, we apply the event selection:

  • Two reconstructed photons, satisfying \(p_\perp {>}0.4(0.3) M_{\gamma \gamma }\) for the leading (subleading) photon and \(|\eta _\gamma |<2.7\).

  • The diphoton invariant mass lies in the range \(M_R-25 \,\mathrm{GeV}<M_{\gamma \gamma }<M_R+25\) GeV.

  • An isolation requirement for all particles in a cone \(\Delta R=0.4\) around the photons direction \(E_\perp ^\mathrm{iso}<0.05 E_\perp ^\gamma + 6\) GeV.

This is guided by the ATLAS [1] selection for the spin-0 resonance sample, however, we note that the results which follow are largely insensitive to these precise choices, for example, if a looser set of cuts on the final-state photons is imposed. We reconstruct jets with the anti-\(k_t\) algorithm [25] with distance parameter \(R=0.6\), \(p_\perp ^j>25\) GeV and \(|\eta ^j|< 4.4\). We choose this somewhat larger value of R compared to that taken in [1] as this allows a greater discrimination between the different production modes, while still being relatively insensitive to the influence of the underlying event generated with Pythia for the corresponding minimum jet \(p_\perp \).

Fig. 2
figure 2

Exclusive jet multiplicities for minimum jet \(p_\perp >50\) GeV, for the gg and \(\gamma \gamma \) initial-state resonance production processes, and the SM continuum \(\gamma \gamma \) production process. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

3 Jet multiplicities

Using the event selection and model choice outlined in the previous section we can then evaluate the cross section for resonance production via the \(\gamma \gamma \), gg and \(q\overline{q}\) initial states. In the latter case we will consider both \(u\overline{u}\) and \(b\overline{b}\), since as we will see there is a non-negligible difference in the expected distributions for light and heavy quark mediated processes. The expected distributions for the other light \(d\overline{d}\) and \(s\overline{s}\) quark initiated modes are very similar to the \(u\overline{u}\), and thus while we will for concreteness refer in the following to the ‘\(u\overline{u}\)’ production mode, this can be considered as the prediction for any light quark-induced process.

In addition, as discussed in [18] this method may also be applied to the case of, for example, WW-initiated VBF. We will consider this below, but it is worth pointing out that, as discussed in [9], due to the relatively large mass of the exchanged t-channel W bosons the transverse momentum of the final-state \(\gamma \gamma \) system may also be a sensitive observable. In particular within the simple approach of that paper we expect that only roughly \({\sim }20\)% of the cross section has \(p_\perp ^{\gamma \gamma } < 60\) GeV. The same effect will also be similarly present in the ZZ and \(Z\gamma \) channels.

Fig. 3
figure 3

Exclusive jet multiplicities for minimum jet \(p_\perp>\) 25 GeV, for the \(u\overline{u}\), \(b\overline{b}\) and WW initial-state resonance production processes. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

Fig. 4
figure 4

Exclusive jet multiplicities for minimum jet \(p_\perp>\) 50 GeV, for the \(u\overline{u}\), \(b\overline{b}\) and WW initial-state resonance production processes. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

In Figs. 1 and 2 we show the cross section for resonance production accompanied by \(N_\mathrm{jet}\) jets, for \(p_\perp ^j> 25(50)\) GeV, respectively. The results are presented with the fixed merging scales described in the previous section, but we have checked that when varying these between a wider range the results presented below do not vary by more than \({\sim }10\%\), and usually much lower; this can be considered as an estimate of the theoretical uncertainty in our approach, and all results should be interpreted with this in mind. We can see that the difference between the gg and \(\gamma \gamma \) initial states for the lower \(p_\perp ^j> 25\) GeV cut is dramatic, with over \(50\%\) of the \(\gamma \gamma \) initiated events having no jets passing the selection, while for the gg-initiated process this is below \(20\%\). In addition, the overall shape of the jet multiplicity distribution is correspondingly different, with the \(\gamma \gamma \) distribution peaking at 0 jets, while the gg exhibits a peak at \(N_\mathrm{jet}=1-2\). At higher \(p_\perp ^j>\) 50 GeV the difference is less dramatic, although the population of the 0 jet bin is still almost a factor of 2 higher in the \(\gamma \gamma \) case. The cause of this remarkably different behaviour is discussed in detail in [9], and is generated by the simple fact that the initial-state coloured gluons exhibit a strong preference to radiate further, so that there is a strong Sudakov suppression in the cross section for no further emission within the corresponding acceptance; the initial-state photons, on the other hand, exhibit a much weaker preference to radiate further, and while for example the parent quark in the \(q\rightarrow q\gamma \) splitting may lead to an observable jet in the final state, the effect of the jet veto is much less significant. A similar trend is seen in all three mass points, although in general the jet multiplicity is observed to increase with increasing mass for the gg and continuum cases, as expected from the increasing phase space for bremsstrahlung gluon emission. Interestingly, for the \(\gamma \gamma \) initial state the multiplicity in fact decreases as \(M_R\) is increases. A possible explanation for this effect may be the fact that to very good approximation the only observed jets here are due to the final \(q\rightarrow q\gamma \) splitting before the hard process; the \(N_j \ge 3\) population is extremely low. As \(M_R\) increases the average momentum fraction x carried by the parent quark, and hence the produced quark jet, increases. This will lead to the jet being produced on average at higher rapidity, making it more like to fail the \(|\eta _j|<4.4\) requirement. As a result of these trends, for a \(M_R=2.5\) GeV resonance the \(\gamma \gamma \) to gg ratio in the 0 jet bin is particularly large, and the discriminating power of this approach improves with increasing resonance mass.

In Figs. 3 and 4 we show the predicted distribution for the WW-initiated VBF channel for \(p_\perp ^j>25(50)\) GeV, respectively: here, the resonance is produced in association with two outgoing quarks recoiling against the exchanged W bosons, and which carry on average a relatively high transverse momentum \(p_\perp \sim M_W\). These therefore tend to produce observable jets in the final state, so that the predicted jet multiplicity is very high, larger still than in the gg case, and strongly peaking at \(N_\mathrm{jet}=2\). As in the \(\gamma \gamma \) case, as the resonance mass is increased the average multiplicity is seen to decrease, with the 0 and 1 jet bins become more populated.

We also show in Figs. 3 and 4 the predictions for heavy \(b\overline{b}\) and light \(u\overline{u}\) quark mediated production. The results for the three mass points are similar, with some increase in jet multiplicity with increasing \(M_R\), similar to the gg case. We observe a non-negligible difference in the distributions between the heavy and light quark cases, with the \(b\overline{b}\) distribution following the gg closely, while the \(u\overline{u}\) is expected to be accompanied by a lower jet multiplicity. The difference between the \(u\overline{u}\) and gg cases is in line with QCD expectations, for which the average particle multiplicity from an initial-state gluon is higher than for a quark [26]. For the \(b\overline{b}\) case, as the b-quark PDF is generated entirely by DGLAP emission above the b-quark threshold, there is a much greater contribution from the \(g\rightarrow b\overline{b}\) process in the initial state, where the outgoing b-jet is observed within the jet acceptance. Therefore, by requiring one or more b-jets in the final state, the \(b\overline{b}\) initiated production mode may be identified or constrained. In Fig. 5 we show the b-jet multiplicity in this production mode for a range of jet \(p_\perp \) cuts, and we can see that roughly \({\sim }50\%\) of events are expected to be accompanied by an additional b-jet in the final state (for other initial states this fraction is of course significantly lower). The case of \(M_R=0.75\) TeV is shown for concreteness, although similar results hold for the other mass points.

Although Figs. 1, 2, 3 and 4 only consider two choices of jet cut, we can also generalise this, showing in Fig. 6 the ratio of 0 to \({\ge }1\) jet cross sections as a function of the minimum jet \(p_\perp \). The same hierarchy in jet ratios between the different production modes and the same trends with increasing \(M_R\) described above are clear. Such distributions have been considered in [17] within a distinct analytic SCET approach, for all cases but the \(\gamma \gamma \) and WW initial states, and the results are found to be similar. In addition, in [9] a simple analytic approach is taken to calculate the 0-jet cross sections in gg and \(\gamma \gamma \)-mediated production, and again the results are very similar to those here for these cases.

It is interesting to observe that the expected trend with the jet \(p_\perp \) cut in Fig. 6 is also in general different between the various production modes. In particular, as the \(p_\perp \) cut is increased the jet ratio in the quark mediated processes increases more rapidly, such that for \(p_\perp \gtrsim 60 \) GeV the \(u\overline{u}\) ratio is in fact higher than the \(\gamma \gamma \). On the other hand, while for lower jet \(p_\perp \) cut, the \(\gamma \gamma \) continuum ratios are very similar to the \(u\overline{u}\), as expected from the dominantly light \(q\overline{q}\) initial state for this process, as the \(p_\perp \) cut is increased, and the jet ratios become more sensitive to the structure of the production process, this is no longer the case, with the jet multiplicity being higher in the continuum case. This indicates that at these higher \(p_\perp \) values the predictions may be sensitive to the precise nature of the resonance production process, which is not included in the effective description considered here.

Finally, in Fig. 7 we show the 0 to \({\ge }2\) jet ratios. The relative hierarchy and trends with the jet \(p_\perp \) cut are comparable to the 0 to \({\ge }1\) case, but in fact the separation between the \(\gamma \gamma \) and gg predictions, for example, is increased. This indicates that a greater discrimination can be achieved by considering the full jet multiplicity distribution as in Fig. 1 rather than an individual ratio. In the following section we will consider quantitatively what limits may be set on the different production modes using such distributions.

Fig. 5
figure 5

Exclusive b jet multiplicities for different minimum jet \(p_\perp \), for the \(b\overline{b}\rightarrow R\) production process and for resonance mass \(M_R=0.75\) TeV

Fig. 6
figure 6

Ratio of 0 jet to \(\ge \) 1 jet cross sections as a function of the minimum jet \(p_\perp \) for a range of initial-state resonance production processes, and for the continuum \(\gamma \gamma \) background. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

Fig. 7
figure 7

Ratio of 0 jet to \(\ge \) 2 jet cross sections as a function of the minimum jet \(p_\perp \) for a range of initial-state resonance production processes, and for the continuum \(\gamma \gamma \) background. Results shown for resonance masses \(M_R=0.75, 1.5, 2.5\) TeV

Fig. 8
figure 8

Exclusive jet multiplicities, for different initial-state resonance production processes, and the SM continuum \(\gamma \gamma \) production process, compared to ATLAS [1] measurement in the range \(700<M_{\gamma \gamma }<840\) GeV. The continuum background is taken from [1], and is included in all signal distributions, assuming a S / B ratio of 1

4 Comparison to ATLAS data

In the ATLAS analysis [1] a measurement is presented of the exclusive jet multiplicity in the signal \(700<M_{\gamma \gamma }<840\) GeV region for the spin-0 resonance selection described in Sect. 2. Although no significant excess is observed in the larger 2016 data set, it is nonetheless instructive to compare the results of our study with these data. It should, however, be emphasised that such data have limited statistics and, moreover, as these are not presented in an unfolded form, our comparison can only be approximate, as it omits a full ATLAS detector simulation as well as the inclusion of pile-up (although pile-up jets are largely rejected by cuts based on tracking information). Nonetheless for these data the dominant source of experimental uncertainty is certainly statistical and, moreover, the more significant differences predicted between for example the \(\gamma \gamma \) and gg are visible even in light of this, as we will show. In our comparison we include a contribution from the SM continuum \(\gamma \gamma \) background, which to be as close to the ATLAS analysis as possible, we simply take from their quoted SHERPA [27] MC predictions for the \(\gamma \gamma \) continuum (which represent 90% of the total background). We take a S / B ratio of 1, roughly corresponding to that seen in the data, and compare to the \(N_\mathrm{jet}\le 3\) bins, where 29 of the 31 observed events lie, and the theoretical predictions are more reliable.

The predicted distributions for the production processes discussed above are shown in Fig. 8, along with the ATLAS data. From a straightforward visual comparison, it is clear that the data show some preference for the \(\gamma \gamma \) and \(u\overline{u}\) scenarios, in comparison to the background-only and in particular the gg and WW cases. To give a more precise estimate, we can evaluate the corresponding \(\chi ^2\) valuesFootnote 1, as shown in Table 1. These results confirm the above expectation, with the description in the gg case being poor; if we assume \(n_\mathrm{dof}=n_\mathrm{bins}-1=3\) for these normalised distributions, this corresponds to roughly a \({\sim }95\%\) exclusion. On the other hand, the description is very good in the \(\gamma \gamma \) and \(u\overline{u}\) cases.Footnote 2 Interestingly, the description for the pure \(\gamma \gamma \) background hypothesis is also relatively poor. Finally, the description for a purely WW initial state is very poor indeed, corresponding to a \({\sim }99.8\%\) exclusion; we expect similar results in the ZZ and, to a lesser extent, \(\gamma Z\) modes.

The description in the \(b\overline{b}\) case is reasonable, but we note that here by far the most discriminating variable is, as discussed in the previous section, the b-jet multiplicity. Even with the relatively low statistics data sample, a measurement of this observable would lend strong support, or place strong constraints on, such a scenario. Indeed, in [1] it is reported that, with a b-tagging efficiency of about 85%, roughly 8% of events in the signal region are found to contain b-jets, consistent within statistical uncertainties with the sideband regions. Comparing with Fig. 5, we can see that \({\sim }60\%\) of signal events in the purely bb-initiated scenario are predicted to contain b-jets in the ATLAS event selection. Even accounting for the continuum background such a result is in strong tension with a dominantly bb-induced production mode, although to evaluate the exact constraints would require a more precise comparison accounting for the b-jet efficiency and mis-ID rate. We will therefore not consider the \(b\overline{b}\) case in the following.

While the precise values of the \(\chi ^2\) depend on the uncertain S / B ratio, and merging scale \(Q_\mathrm{cut}\), these are only found to vary by \({\sim }10\%\) for reasonable changes in these parameters. Taking our simulation for the \(\gamma \gamma \) continuum background leads to somewhat larger values of \(\chi ^2\), although still with the \(\gamma \gamma \) and \(u\overline{u}\) cases giving very good descriptions of the data, with the same hierarchy observed for the other scenarios.

Before concluding this section, it is worth noting that the transverse momentum \(p_\perp ^{\gamma \gamma }\) of the diphoton system has also been measured in [1], where it is found that \({\sim }60\%\) of the observable cross section has \(p_\perp ^{\gamma \gamma }<60\) GeV. Such an observable is also sensitive to the production mode of the resonance, with the gg and in particular WW hypotheses predicting broader distributions. While the description of the measured \(p_\perp ^{\gamma \gamma }\) distribution is in fact relatively poor in all cases, including the background-only hypothesis, due to the apparent downward fluctuation in the second bin, the same hierarchy in the quality of the data descriptions is found as in the case of the jet multiplicity. While these two observables are evidently correlated, this nonetheless gives a consistency check of the overall approach, and indeed with further data and finer binning, the \(p_\perp ^{\gamma \gamma }\) observable may also allow further discrimination between the modes.

Finally, it is worth emphasising that the results presented above are from a statistical point of view expected to be conservative. In particular, the statistical errors on the normalised jet multiplicity distribution presented by ATLAS in [1] are simply found by rescaling the errors on the observed event numbers in each bin by the total number of observed events. However, this ignores the positive correlation between the measured numbers of events in each bin with the total number of observed events, which will lead in general to a reduction in the error size as well as a correlation between different bins for the normalised distribution; for the relatively low number of measured events this effect will not necessarily be negligible. Indeed, taking a simplified approach and treating all uncertainties on the normalised distribution as Gaussian, we find a that a more complete treatment including the full correlation matrix between the different bins leads to a further deterioration in the quality of the description for the gg, WW and background-only scenarios. However, to be conservative we do not present such a comparison in detail here.

Table 1 The \(\chi ^2\) values for the description of the four \(N_\mathrm{jet} \le 3\) bins for the ATLAS [1] measurement of the exclusive jet multiplicities, for different initial-state resonance production processes. The contribution from the SM \(\gamma \gamma \) continuum, taken from [1], is included, and \(S/B=1\) is assumed. These values correspond to the distributions shown in Fig. 8
Fig. 9
figure 9

The expected 95% exclusion limits on the fractional contributions to a \(M_R=0.75\) TeV scalar resonance production cross section, described in the text, against the hypotheses of pure gg, \(\gamma \gamma \) or \(u\overline{u}\)-initiated production. For example, in the upper two plots, \(f_{g\gamma }=1(0)\) corresponds to purely gg(\(\gamma \gamma \)) induced production, and similarly for the other plots. The dashed lines correspond to the median value of the limit, and the \({\pm }1\) and 2 \(\sigma \) ranges are shown

5 Expected limits on production modes

Having found such promising results when comparing to the limited 2015 ATLAS jet multiplicity data, it is natural to consider what limits we can expect on the production modes. To be more precise, we will perform a confidence limit (CL) hypothesis test, analogous to those used by the ATLAS [28] and CMS [29] analyses for determining the spin and parity of the SM Higgs. The test statistic is defined as

$$\begin{aligned} Q=-2\ln \frac{\mathcal {L}(h_1)}{\mathcal {L}(h_2)}, \end{aligned}$$
(2)

for the initial-state hypothesis, i.e. \(h=\gamma \gamma , gg, q\overline{q}\) or more generally some non-trivial admixture of these. This can be used to discriminate between an initial-state production hypotheses \(h_1\) in favour of \(h_2\). For an observed value of \(Q_\mathrm{obs}\) the exclusion of the \(h_2\) in favour of \(h_1\) is given in terms of the modified confidence level

$$\begin{aligned} \mathrm{CL}_s=\frac{P(Q\ge Q_\mathrm{obs}|h_1)}{P(Q\ge Q_\mathrm{obs}|h_2)} \end{aligned}$$
(3)

where \(P(Q\ge Q_\mathrm{obs}|h)\) is the probability that the test statistic is at least as high as \(Q_\mathrm{obs}\) under a hypothesis h.

We can thus use this approach to calculate the expected exclusion limits on a specified production mode \(h_2\), assuming the resonance is produced via \(h_1\). To be more concrete we assume three simplified scenarios where the produced resonance is only produced via two of the three \(\gamma \gamma \), gg and \(u\overline{u}\) initial states, i.e.

$$\begin{aligned} \sigma _{gg}&=f_{g\gamma }\, \sigma _\mathrm{R} \quad \sigma _{\gamma \gamma }=(1-f_{g\gamma })\,\sigma _\mathrm{R},\nonumber \\ \sigma _{gg}&=f_{g u}\, \sigma _\mathrm{R} \quad \sigma _{u\overline{u}}=(1-f_{g u})\,\sigma _\mathrm{R}, \nonumber \\ \sigma _{\gamma \gamma }&=f_{\gamma u}\, \sigma _\mathrm{R} \quad \sigma _{u\overline{u}}=(1-f_{\gamma u})\,\sigma _\mathrm{R}, \end{aligned}$$
(4)

where \(\sigma _R\) is the signal cross section; we do not consider the WW initial state for simplicity, but will comment on this below. We take the case of \(M_R=0.75\) TeV in the following, but similar results follow for higher mass points. To see how these limits can be expected to extend given a certain amount of luminosity, we will for concreteness consider an observed cross section of 7 fb after cuts in the narrower \(725<M_{\gamma \gamma }<775\) GeV region, with the SM continuum (taken now from our MC sample) and resonance signal present equally, i.e. \(S/B=1\). We also include a 10% systematic uncertainty on the data, as a fairly conservative assumption.

The expected 95% exclusion limits on the cross section fractions in (4) are shown in Fig. 9. In each case we consider the limit on the fraction f against the hypotheses that the resonance is purely produced in the corresponding production modes; for example we calculate the exclusion on \(f_{g\gamma }\) under both the purely gg and the \(\gamma \gamma \) hypotheses. We can see that already for the integrated luminosity of 3.2 \(\mathrm{fb}^{-1}\) corresponding to the ATLAS data, a 95% exclusion of the purely gg hypotheses, corresponding to \(f_{g\gamma },f_{gu}=1\), is consistent with the projected limits, and with the results of the previous section. Indeed, if we treat the fractions f in (4) as free parameters and perform a \(\chi ^2\) minimisation and profile test with respect to the ATLAS data we find that \(f_{g\gamma } (f_{g u}) < 0.71 (0.76)\) at 95% confidence, with minima at 0, while the \(f_{\gamma u}\) is unconstrained; these results are again completely consistent with these expected confidence limits. We can moreover see that with just a few 10s of \(\mathrm{fb}^{-1}\) any significant gg-induced component can be excluded in favour of the purely \(\gamma \gamma \) and \(u\overline{u}\) hypotheses, and conversely any significant \(\gamma \gamma \) or \(u\overline{u}\) induced component can be excluded in favour of the purely gg.

On the other hand, for the \(u\overline{u}\) and \(\gamma \gamma \) scenario, for which the two available production modes predict closer jet multiplicity distributions, see Fig. 8, the situation is more challenging, and a larger O(100) \(\mathrm{fb}^{-1}\) sample is required for a sizeable contribution from the \(\gamma \gamma \) mode to be excluded in favour of the purely \(u\overline{u}\), and vice versa. In such a situation it is likely that other methods for distinguishing between the different initial states will be more competitive. For example, as discussed in [6, 9], a measurement of just a few resonance events in the essentially background free exclusive channel, where both protons remain intact after the collision, will provide strong evidence in favour of a significant contribution from the \(\gamma \gamma \) mode, and conversely any lack of observed events will disfavour this. Alternatively, by selecting events with rapidity gaps in the final state the \(\gamma \gamma \)-initiated contribution may be isolated: it is worth recalling, in particular, that a significant fraction of \(\gamma \gamma \)-initiated events are expected to occur due to low-scale coherent emission from the protons, which naturally lead to rapidity gaps in the final state; see [22].

Finally, returning to the VBF channel, if we consider the simplified scenario that the resonance is produced by the WW and \(\gamma \gamma \) modes, i.e.

$$\begin{aligned} \sigma _{WW}=f_{W\gamma }\, \sigma _\mathrm{R} \quad \sigma _{\gamma \gamma }=(1-f_{W\gamma })\,\sigma _\mathrm{R}, \end{aligned}$$
(5)

then we already find that \(f_{W\gamma }<0.48\) at 95% confidence, with similar limits if the additional mode is gg or \(u\overline{u}\).

6 Conclusion

The observation by the ATLAS and CMS collaborations, in roughly \(3\,\mathrm{fb}^{-1}\) of data at \(\sqrt{s}=13\) TeV recorded in 2015, of an excess of events around 750 GeV in the diphoton mass spectrum provoked a great deal of theoretical interest. Although no significant excess was seen in the increased 2016 data set, this initial observation has motivated us to discuss in detail the possibility of using measurements of the jet multiplicity associated with the production of a high-mass resonance as a means to distinguish between different production mechanisms.

Specifically, we have considered the \(\gamma \gamma \), gg, WW, light quark \(q\overline{q}\) and heavy \(b\overline{b}\) quark initiated processes and shown that in each case the predicted level of jet activity is quite different. In particular, due to the small size of \(\alpha \) and the lower photon branching probability, the jet multiplicity is expected to be lower in the \(\gamma \gamma \) case compared to the QCD quark and gluons. As the particle multiplicity associated with an initial-state gluon is higher compared to the quark case, the higher jet activity in the gg case also allows this mechanism to be separated from the \(q\overline{q}\). For the WW-initiated channel the resonance is generally produced in association with at least two additional jets, due to the relatively high \(p_\perp \) recoiling quarks in the final state. For the heavy \(b\overline{b}\) the most discriminating observable is instead simply the b-jet fraction, which is predicted to be significantly higher than in all other scenarios.

To demonstrate how such an approach may be applied to data, in this paper we have compared these results with the 2015 ATLAS measurement of the jet multiplicity in the spin-0 signal region associated with the initial diphoton excess observation. We have found that even with these fairly limited data, a purely gg-initiated scenario is disfavoured, while the light \(q\overline{q}\) and \(\gamma \gamma \) scenarios provide a good description. The continuum background-only hypothesis also gives a somewhat worse description than the latter cases. In addition, we have found that a dominantly WW-initiated production mechanism is in strong tension with the data, and we have seen that the relatively small observed b-jet fraction disfavours a dominantly \(b\overline{b}\)-initiated production mechanism. We have also presented expected exclusion limits on different production hypotheses with the collected integrated luminosity. With just a few 10’s of \(\mathrm{fb}^{-1}\), we can expect to rule out any sizeable fraction of gg in favour of light \(q\overline{q}\) and \(\gamma \gamma \) production, and vice versa. It is more challenging to distinguish between the \(u\overline{u}\) and \(\gamma \gamma \) modes, although still possible with enough data.

If a high-mass resonance is observed at the LHC or a future collider, one of the first tasks will be to determine as precisely as possible the nature of such a new state. It has been the goal of this paper to present a method for distinguishing between various production modes, which can be applied to any heavy resonance which we might hope to find in the future.

Fig. 10
figure 10

Exclusive jet multiplicities, as in Fig. 8, but compared to data taken with the ATLAS [1] spin-2 event selection. The continuum background is taken from [1] and is included in all signal distributions, assuming a S / B ratio of 0.5