1 Introduction

Resonance search is one of the most simple and direct ways to establish the presence of new particles at a collider. With high centre-of-mass energies available at hadronic colliders such as the Large Hadron Collider (LHC), it serves as a perfect playground for resonance searches of massive particles predicted in many beyond the standard model (BSM) scenarios. The LHC energies and available integrated luminosity can help search for these exotic particles beyond 4–5 TeV. The envisaged 100 TeV hadron machine [1] will expectedly improve this range by nearly a factor of magnitude and more so for strongly interacting particles. However, finding a signal for such strongly interacting heavy particles which dominantly decay to either quarks,Footnote 1 gluons or both against the huge QCD background is very difficult. The major challenge at a hadronic machine such as the LHC is compounded by the fact that final states are swamped with jets which one needs to understand and unravel in order to extract signal for new physics in the all hadronic final state. However, there are techniques to reduce these huge QCD backgrounds from the signal. There already exist searches for heavy resonances in the dijet channel by CMS [2,3,4,5,6,7] and ATLAS [8,9,10,11]. The experimental collaborations provide an upper limit at 95% C.L. on the cross section for heavy resonance production and its decay to three different decay channels, viz. qq, qg, and gg, separately. Although they do not distinguish between quark or gluon jets explicitly in the final state, they use the line-shape information of the resonance peak to set individual limits [12] for the three different types of final state. In this work, we use jet substructure techniques in studying several exotic resonances in the dijet final state and show that in addition to the line-shape, if we can distinguish between the flavourFootnote 2 of the jets, existing and future limits on the cross section for heavy resonances can be significantly improved. Thus a good knowledge of discrimination between quark and gluon jets will not only help in improving the resonance searches at the HE-LHC and future 100 TeV machine but also help in pinning down the new physics scenario by giving hints on the interaction Lagrangian.

Table 1 Examples of resonant particles with the production cross section at 13 TeV LHC for each resonant particle at 2 TeV

The recent developments in the study of jet substructure provide us with one of the best ways to gather more information from a collider event with large hadronic activity [13,14,15,16,17]. The use of jet substructure is becoming popular day by day due to the recent developments in both theoretical and experimental understanding of physics inside the jets, where perturbative and non-perturbative effects dominate in different regions in the energy scales. It has been shown that theoretical understanding about the substructure of a jet can even allow us to differentiate between quark-initiated and gluon-initiated jets to a certain extent [18,19,20,21,22,23,24,25,26,27,28,29] due to the difference in their radiation pattern inside the jet. Further improvements in this direction can be possible with the use of machine learning techniques which along with the substructure picture of the jets makes the discrimination even more robust and allows us to enhance the discrimination power between quark and gluon jets [30,31,32,33,34,35,36,37]. This quark and gluon tagging can therefore be used to improve the search for resonances in several BSM physics scenarios at the colliders.

While discrimination between a quark and a gluon jet would perhaps be a novel approach to identify certain BSM scenarios, in this work we take a slightly different approach. Instead of distinguishing a quark jet from a gluon jet on a jet-by-jet basis, we take an event containing several jets as a whole and the analysis is done on the event variables as well as on the jet substructure observables. The jet substructure observables chosen for quark-gluon discrimination will have different probability distributions for quark and gluon jets. In general, all the jets in a set of events produced at the collider are not entirely quark-initiated or gluon-initiated jets, but a certain fraction of them are quark jets or gluon jets contributing to the events. If these fractions are different for signal events from the background events, there will be some degree of distinguishing power between signal and background. Through this work, we try and show that even if we do not tag the jets in a particular event as quark or gluon jets, we can still conclude with some confidence on whether the event is a signal-like event or a background-like event.

The fraction of quark or gluon jets in a set of events, which we talked about in the last paragraph, can be a fixed fraction generated in every single event or it can be an overall fraction from a set of events. For example, if a strongly interacting heavy fermion (e.g. excited quarks \(q^*\)) decays to two light-flavoured jets, then one expects that there will be 50% quark jets and 50% gluon jets in each event and hence the fraction will be 50% in the whole set of events for both quark and gluon jets. On the other hand, if a heavy coloured boson (e.g. a coloron) decays to two light-flavoured jets then it will either decay to a pair of quark jets or to a pair of gluon jets. Hence the overall fraction of quark jets and gluon jets in the collection of events will be determined by the relative coupling of the heavy boson with a quark pair and a gluon pair. However, note that such a fraction in the events could also originate from non-resonant dijet or multijet production processes at a collider. In fact, the Standard Model (SM) dijet background is a non-resonant production of two light flavoured jets. The probability distribution for jet substructure observables will nearly be the same for a given fraction irrespective of whether it originates from every single event or from a collection of events. Thus closer scrutiny of the events with jet tagging, comparing quark-initiated and gluon-initiated jets would be crucial in understanding the heavy resonances.

In this work, we highlight the improvements that can be achieved in dijet resonant searches by the use of jet substructure techniques. Similar studies have been done previously, where in Ref. [38] the jet substructure observables have been suggested to improve the searches while Refs. [39, 40] use color discriminant variable and jet energy profile as the means of improving the sensitivity in the dijet channel. However, all three studies considered parton showered events only. Our work is more along the lines presented in Ref. [38] but with substantial improvements in the analysis. We include the SM QCD background and use a fast detector simulation and account for the correction to jet energy scales in the computation of the event variables, similar to the procedure followed by experimental collaborations. Moreover, we use an additional set of observables which are considered to be useful in CMS and ATLAS analyses [21,22,23,24,25]. We also show how one can use jet substructure technique and the boosted decision tree (BDT) multivariate method to distinguish different type of resonances in the dijet channel.

2 Dijet resonances

In this study we consider three different kind of dijet resonances, each of which gives us one of the three types of dijet signals: qq, qg, or gg. For resonances leading to qq final state, each of the jets will be quark-like while it will be gluon-like for gg final state. On the other hand, for qg final state, its properties will be an admixture of both quark and gluon-like. Hence, if we consider jet substructure observables of the hardest jet (or second hardest jet), we expect their distributions (observables sensitive to jet-types) to be different for these three cases. The distributions for observables of qq will be far apart from those of gg final state while for the qg final state, the distribution is expected to show an overlap with above two.

In Table 1, we list a few important BSM models and their relevant part of Lagrangians which would yield the dijet signal processes in these models, (e.g. see Refs. [41,42,43]). Note that we have classified the choice of models based on the intrinsic spin of the resonance in the dijet signal. A more comprehensive list of all possible coloured particles that give a dijet resonance at the LHC can be found in Ref. [44]. We also list the production cross sections for the resonant exotic particle at 13 TeV LHC for the specific values of the parameters listed in Table 1 for each model. The parameter values have been chosen such that the cross sections remain below the existing upper limit from the dijet resonance search by CMS [2]. For further analyses, we restrict ourselves to only three specific models (as no significant difference was observed in the current analysis between models where the resonance differed in spin but gave the same dijet final state). We therefore chose the coloron model [45,46,47] for qq resonance, color octet scalar model [48,49,50] for gg resonance, and excited quark model [51,52,53,54,55] for qg resonance.

Fig. 1
figure 1

Distribution of jet substructure observables for SM gg events (blue dashed), qq events (green solid), and qg events (red dot-dashed). The distributions shown are for a Les Houches Angularity (LHA) and for b \(p_T D\)

3 Jet substructure observables

For the jet substructure analysis we first list out the criterion and observables we are interested in. We classify our requirements based on the event variables and therefore we choose only the simple variables, viz. \(p_T\) and \(\eta \) of the leading and sub-leading jet (arranged according to larger transverse momenta) and their angular separation \(\varDelta R(j_1,j_2) =\sqrt{(\varDelta \eta )^2 + (\varDelta \phi )^2}\) defined between the two jets. For the jet substructure observables, we refer to the two types of observables as pointed out in Ref. [19, 23, 24], viz. discrete and continuous observables. The most important ones are:

  • Particle and charged particle multiplicity inside a jet [19].

  • Les Houches Angularity: \(\mathrm{LHA}=\sum _{i\in J} \dfrac{p_{T_i}}{p_{T_J}} \sqrt{\dfrac{\varDelta R(i,J)}{R}}\) [20, 56].

  • Girth: \(g = \sum _{i\in J} \frac{p_{T_i}}{p_{T_J}}\left( \dfrac{\varDelta R(i,J)}{R}\right) ^2\) [19].

  • Width: \(w = \sum _{i\in J} \frac{p_{T_i}}{p_{T_J}}\left( \dfrac{\varDelta R(i,J)}{R}\right) \) [57,58,59].

  • Two-point energy correlation variables:

    \(e_\beta = \sum _{i>j\in J} \frac{p_{T_i}p_{T_j}}{p_{T_J}^2}\left( \dfrac{\varDelta R(i,j)}{R}\right) ^\beta \) [60].

  • \(p_T^2\) weighted jet minor axis (\(\sigma _2\)) with respect to the jet axis in \(\eta -\phi \) plane [23, 61].

  • \(p_T D = \dfrac{\sqrt{\sum _{i\in J}p_{T_i}^2}}{\sum _{i\in J} p_{T_i}}\) [23].

where R is the radius parameter of the jet and, in the subscripts, ij represent the constituents of the jet while J represents the jet.

We know from first principle as well as from data collected in collider experiments that radiation pattern inside a gluon jet differs from that of the light-quark jets. Due to the difference in radiation pattern, many jet substructure observables have been proposed in the literature to discriminate between quark and gluon jets. Though our primary aim in this work is not distinguishing a quark jet from a gluon jet, our analysis procedure is still guided by similar jet substructure observables that help in quark-gluon discrimination. The choice of the observables can be explained as follows.

  • Gluon fragments more than a quark due to its colour factor (colour factor ratio is \(\frac{C_A}{C_F}= \frac{9}{4}\)), which results in higher particle multiplicity in a gluon jet compared to a quark jet.

  • Charged particle multiplicity is also more in case of gluon jets than in quark jets.

  • Gluon fragmentation function is softer than that of a quark, i.e. constituents of gluon jets tend to be softer than a quark jet. This means \(p_T D \rightarrow 0\) in the case of a gluon jet while \(p_TD \rightarrow 1\) in case of quark jet.

  • Gluon jets are less collimated compared to quark jets. This gives wider radius for gluon jet with respect to quark jet. So, if the shape of a jet is approximated to an ellipse in \(\eta -\phi \) plane, a gluon jet tends to have longer minor axis than a quark jet.

  • Combination of the last two points also tends to give higher values for girth, width, LHA and two-point energy correlation variables (\(e_\beta \)) for gluon jets compared to quark jets.

To illustrate the difference in the distribution of jet substructure observables between quark and gluon jets, in Fig. 1 we show area normalized distributions for the two jet substructure observables, both for quark (green solid) and gluon (blue dashed) jets in (qq) and (gg) subprocesses in SM respectively. The jet substructure observables are calculated from the leading jets in the subprocesses. We can clearly see that the distribution for a quark jet is quite different from that of a gluon jet. However, for a mixed sample of quark and gluon jets, the distribution will be smeared out. The same two observables are plotted (red dot-dashed) in Fig. 1 for the leading jet in qg events. A selection criteria of \(p_{T_J} > 500\) GeV and \(|\eta |<2.5\) for both the jets was imposed on the event sample. The anti-kt algorithm with radius parameter \(R=0.4\) was used for jet clustering. The distribution of the jet substructure observables shown in Fig. 1 are for Les Houches Angularity (LHA) and for \(p_TD\). This difference in distribution among quark, gluon and admixture of quark and gluon will form the basis of the analyses to follow.

4 Analysis and results

To facilitate this study, Universal FeynRules Object (UFO) [62] files were generated corresponding to the Lagrangian listed in Table 1 using FeynRules2.0 [63, 64]. The mass of the heavy resonances have been taken to be 2 TeV for all the three types of resonances. Parton-level dijet final states from the decay of resonant particles were then simulated at \(\sqrt{s} = 13\) TeV using the UFO files with the help of MadGraph5 [65] with a hard \(p_T\) cut to populate the relevant phase space with enough events. The events generated with \(p_T > 700\) GeV for both the jets (parton-level) from MadGraph5 were parton showered and hadronized using Pythia8235 [66] with the default tune implemented in Pythia. These showered events were then passed on to Delphes3.4.2 [67] for detector simulation. An average of 25 pile-ups have been merged with the generated events inside the Delphes. Jets were clustered using the anti-kt algorithm with radius \(R=0.4\) from the Particle Flow outputs from Delphes with the help of FastJet 3.3.2 [68]. In order to reduce the effects of contamination from Underlying Event (UE) and pile-up (PU), many different taggers and groomers have been proposed in the literature e.g. trimming [69], pruning [70, 71], mass drop tagger [72] and soft drop [73] groomer. In this work, we use the soft drop groomer which is a IRC safe groomer. After jet clustering, the soft drop groomer was used to groom away the UE and the PU with soft drop parameter \(\beta =1.0\) and \(z_\mathrm{cut}=0.1\). Both the groomed jets from each event are required to have \(p_T\) more than 700 GeV.

Next, we perform jet energy correction on the groomed jets, i.e. correct the detector-level jet to the particle-level jet following the procedure prescribed in Ref. [74]. We correct the jet four-momenta of the groomed jets by calibrating groomed jets with respect to the Monte Carlo (MC) truth jets. For this purpose, SM dijet samples were generated using MadGraph5 [65] in \(p_T\) and \(\eta \) bins. The MC truth jets were then constructed from the stable particles after showering the dijet events using Pythia8235 [66]. The groomed jets were constructed from the Particle Flow outputs from Delphes following the same procedure described in the previous paragraph. The algorithm and parameters for jet clustering and soft drop groomer were taken to be the same in this case also. The groomed jets have been constructed after merging an average of 25 pile-up event, while no pile-up events have been considered in the case of MC truth jet sample. For the calibration, we imposed a matching condition of \(\varDelta R < 0.25\) between MC truth jets and groomed jets. The events containing at least two matching jets were further considered for the calibration. For each event, we then calculate response as

$$\begin{aligned} R\left( p_T^\mathrm{groomed}\right) = \frac{p_T^\mathrm{groomed}}{p_T^\mathrm{truth}} \end{aligned}$$
(1)

for each of the two jets separately. In each \(p_T\) and \(\eta \) bin, we next compute average of \(p_T^\mathrm{groomed}\) \(\left( \overline{p_T^\mathrm{groomed}}\right) \) and mean response \(\Big (\overline{R} \Big )\) as a function of \(\overline{p_T^\mathrm{groomed}}\). In each \(\eta _j\) bin, the response function \(\mathscr {R}_{\eta _j}\!\left( p_T^\mathrm{groomed}\right) \) is obtained from the fit of \(\left( \overline{R_i}, \overline{p_{T_i}^\mathrm{groomed}} \right) \) data, where i runs over \(p_T\) bins. The fitting function \(\mathscr {R}_{\eta _j}\) is parametrized as

$$\begin{aligned} \mathscr {R}_{\eta _j}\!\left( p_T^\mathrm{groomed}\right) = a_0 + a_1 \ln \left( \overline{p_T^\mathrm{groomed}}\right) , \end{aligned}$$
(2)

where \(a_0\) and \(a_1\) are fitting parameters. The response function has been computed for leading and sub-leading groomed jets separately. The corrected four momenta (p) of the groomed jets are then obtained by multiplying with the inverse of the response function

$$\begin{aligned} p^\mathrm{corrected}_{\eta _j} = \frac{p^\mathrm{groomed}_{\eta _j}}{\mathscr {R}_{\eta _j}\!\left( p_T^\mathrm{groomed}\right) } \,\, . \end{aligned}$$
(3)
Fig. 2
figure 2

Distribution of dijet invariant mass for three different types of signals, viz. gg, qg and qq resonance signals

With the four momenta corrected groomed jets, we then reconstruct the mass of the heavy resonance by taking the invariant mass of the dijet system. The normalized distribution of dijet invariant mass is plotted in Fig. 2. The dijet invariant mass distribution is one way of looking at the parton content of the jets making up the resonance. It is quite clear that the line-shapes of the resonant mass are different for \(qq,\ qg,\) and gg resonances as also mentioned in Ref. [2]. The CMS collaboration has actually used the line shape information to put 95% C.L. upper bound on the three above mentioned configurations of the dijet resonances [2]. A quick comparison of the line-shape plot with that of Ref. [2] finds that the \(m_{jj}\) distributions are in agreement reasonably well. An upper cut on \(|\eta _j| < 2.15\) for both the jets has been imposed for further analysis. Note that all jet substructure observables have been constructed after the soft drop grooming procedure. However, the four momenta, \(p_T\) and invariant mass of the dijet system are taken only after jet energy correction.

With the generated events and variables, we then try to find distinguishing score of the three different types of resonances from the SM background. Since, at a hadron collider, one cannot really avoid SM QCD background, we consider the non-resonant SM dijet as our background in this analysis. Note that for the SM QCD background, we shall use the same tools and parameters discussed earlier. The dijet background has been simulated at the leading order (LO) of QCD. The properties of the leading and sub-leading jets of the QCD background will again be a mixture of the properties of pure quark and pure gluon jets. So we expect better discrimination for qq and gg resonances from the SM QCD dijet background. However, we still expect some degree of discrimination for qg resonance since the fraction of quark and gluon will be different for the SM background from the qg resonance, where it is expected to be an admixture of 50% quark and 50% gluon.

Fig. 3
figure 3

Normalized distribution of jet substructure observables for the three types of signals as well as for the SM dijet background. In all the panels, blue dashed lines are for gg resonance, green solid lines are for qq resonance, red dot-dashed lines are for qg resonance, and black dotted lines are for SM background

Before going into the details of our results, it is useful to highlight the distributions of various event variables and jet substructure (JSS) observables. We therefore plot the distributions of some of the important event variables as well as JSS observables in Fig. 3, which shows the normalized distributions of these observables for leading (\(j_1\)) and sub-leading (\(j_2\)) jets. In all the panels of Fig. 3, the blue dashed lines represent the distributions of gg resonance, while the green solid lines are for qq resonance and the red dot-dashed lines are for qg resonance. The normalized distributions for the SM dijet background are also shown in black dashed lines for all the variables. We note that the distributions of JSS observables for \(j_1\) and \(j_2\) (arranged in descending order of \(p_T\)) are similar for different types of signals as well as for the background. This is expected because of the following reasons. In the case of gg (qq) resonance signal, both the jets are gluon-initiated (quark-initiated) jets. In the case of qg resonance signal, the percentage of quark-initiated and gluon-initiated jets in \(j_1\) is the same as in \(j_2\) in a set of events. The same argument applies for the SM dijet background as well.

One may now proceed on to perform a cut based analysis to maximize the different types of signals with respect to the background using the shapes of the distributions. However, a multivariate analysis (MVA) further enhances the signal with respect to background by providing a classifier variable after combining all the input variables in an optimized way. In this study, we used BDT classifier variable for the rest of the analyses to follow. For the implementation of BDT, the TMVA2.0 [75] package which is built-in in Root6 [76, 77] was used. At this point, we note that though the resonance masses of all the three different types of resonances have been taken to be the same (2 TeV), the peak positions and spreads of the invariant mass distribution, as shown in Fig. 2, for different types of resonances are slightly different. Hence, in order to maximize the effect of BDT analysis, we put the following cut on the invariant mass of the different types of resonancesFootnote 3 before passing them on to the BDT analysis.

  • \(1580< M_{gg} < 2100\)

  • \(1680< M_{qq} < 2180\)

  • \(1600< M_{qg} < 2120\)

Table 2 List of values of variable importance defined as the weighted fraction of the times a particular variable has been used in the boosted decision tree training. All the values are calculated in the training with JSS observables
Fig. 4
figure 4

BDT response for the signal and the background for three resonance cases. In all the panels, red solid (black dashed) line represents BDT response for the signal (background) without using the JSS observables while the green (blue) shade represents the same with the JSS observables. The responses shown are for a gg resonance, b qq resonance, and c qg resonance

As mentioned in Refs. [19, 20, 23, 24], some jet substructure observables are better suited for quark and gluon jet tagging than others. In our work we too perform a thorough investigation of the jet substructure observables to find out which of the observables are best suited for better distinguishing power. We also note that the same jet substructure observables will be equally important in distinguishing the different types of final state events. To highlight this, we list the variable importance in BDT [75] in Table 2. The variable importance is defined as the weighted fraction of how many times a particular variable is used to split the decision tree nodes. The weight is the product of the separation gain-squared it has achieved in each split occurrence and the number of events in the node. As we see that the most important variables are the event variables, viz. \(m_{jj}\), \(\varDelta R(j_1,j_2)\) and \(\varDelta \eta \). However, the importance of JSS observables are not very small. Combination of JSS observables give rise to substantial contribution.

In this study, our primary aim is to investigate how much improvement one can get over the current resonant dijet search by the use of jet substructure observables. Hence, we optimize BDT response in two stages. In order to see that the use of jet substructure observables help in discriminating signal from the background, we first optimize BDT response for the signal using only event variables, viz. \(p_T\), \(\eta \) of both the jets, \(\varDelta R(j_1,j_2)\), \(|\varDelta \eta |\), and \(m_{jj}\) for the discrimination of signal from the SM dijet background. We then supplement the analysis by adding jet substructure (JSS) observables, viz. particle multiplicity, LHA, \(p_T D\), \(\sigma _2\) of leading as well as sub-leading jets in addition to the simple event variables. To illustrate the effect of JSS observables, we plot the distributions for BDT response for different types of signals as well as for the background in Fig. 4. In all the panels of Fig. 4, the red solid (black dashed) line represents the BDT response for the signal (background) without using the JSS observables while the green (blue) shade represents the same with the JSS observables. The responses shown are for (a) gg resonance, (b) qq resonance, and (c) qg resonance. The improvement of signal-background separation can be understood from the relative shift in the BDT response distribution for the signal and the background in Fig. 4. As we can see from the figure that the distribution of BDT response for the signal (red solid) and that for the background (black dashed) are close to each other when JSS observables were not considered. However, after the incorporation of the JSS observables, the two distributions (green and blue shaded regions) are better separated. This tells us that the use of JSS observables will further improve the probe of the signals over the background although the degree of improvement will be different for different types of signals. A quick comparison among the BDT responses tells us that the improvement will be better for gg and qq resonance signals than the qg signal.

Fig. 5
figure 5

Illustrating ROC curves for the a gg resonance, b qq resonance and c qg resonance signal against the SM dijet background for analyses with (blue solid) and without (red dashed) JSS observables

Although Fig. 4 shows some degree of improvement when JSS observables are used along with event variables, it does not provide any quantitative assessment of the improvement. One of the ways to compare and see the improvement is via receiver operating characteristic (ROC) curve. A ROC curve gives the background rejection efficiency, i.e. the fraction of background rejected out of the total background for a given signal efficiency (the fraction of signal accepted out of the total signal). For a single variable, ROC curve can easily be obtained by sliding upper or lower cut on that particular variable and plotting the values of background rejection efficiency versus signal efficiency. However, for more than one variable, the ROC curve is not unique. Since the MVA already provides single optimized classifier variable from more than one input variable, we can use this classifier variable to draw ROC curves for the three different types signals with respect to the background. We plot the ROC curves in Fig. 5 with-JSS (blue solid) and without-JSS (red dashed) observables for all the three types of resonances. We can see, quite clearly, a significant improvement if we add JSS observables to the analysis.

Fig. 6
figure 6

(Left) variation of signal significance as a function of relative systematic uncertainty (\(\lambda \)) for with-JSS (blue-solid) and without-JSS (red-dashed) case. (Right) variation of significance improvement as a function of \(\lambda \). The variations plotted are for gg resonance signal

Fig. 7
figure 7

(Left) variation of signal significance as a function of relative systematic uncertainty (\(\lambda \)) for with-JSS (blue-solid) and without-JSS (red-dashed) case. (Right) variation of significance improvement as a function of \(\lambda \). The variations plotted are for (top) qq and (bottom) qg resonance signal

Once the numbers of signal and background events are estimated, we are ready to calculate signal significance by the traditional cut and count method. In this method, one generally counts the number of signal (S) and background (B) events after imposing a cut at a specific point in the variable space. The cut is chosen in such a way so that the significance, defined as \(S/\sqrt{B}\), is maximized. Since BDT response is already an optimized variable, this single variable can be used for cut and count analysis method. However, estimation of signal significance using the shape of the signal and background distribution of a particular variable gives significant improvement over the simple cut and count method. In this work, we performed profile likelihood ratio method to calculate signal significance. We used profile likelihood calculator, which is implemented in Roostats [78] package, after building probability distribution functions (PDF) for the signal and for the background using Number Counting Pdf Factory  [79,80,81] from the BDT response histograms. In each bin of the histograms, the factory incorporates relative systematic uncertainty (\(\lambda _i\)) to the fraction of events in the sideband region (defined as \(\tau _i\) in Refs. [79,80,81]), as follows

$$\begin{aligned} \tau _i = \frac{1+\sqrt{1+4\lambda _i^2}}{2\lambda _i^2 b_i} \end{aligned}$$
(4)

where \(b_i\) is the number of background and i represents the index of the bin. In this analysis, the relative uncertainty in the background is taken to be flat, i.e. \(\lambda _i=\lambda \) for all the bins. The variation of the signal significance as a function of relative systematic uncertainty (\(\lambda \)) is shown in the left panel of Fig. 6 for gg resonance signal. The red-dashed line is for the analysis “without-JSS” case while the blue-solid line is for “with-JSS” case. The luminosity is taken to be 100 fb\(^{-1}\) in the signal significance calculation. From the figure, we see that the signal significance can be improved if we use JSS observables in addition to the event variable. To quantify the improvement of using JSS observables, we define a significance improvement variable as

$$\begin{aligned} \mathrm{significance\ improvement} = \frac{\sigma ^\text {JSS}}{\sigma } \end{aligned}$$
(5)

where \(\sigma \) and \(\sigma ^\mathrm{JSS}\) represent significances for without-JSS and with-JSS case respectively. significance improvement is plotted as a function of \(\lambda \) in the right panel of Fig. 6. We can easily say that the signal significance can be improved by about 40% with the use of JSS observables even with systematic uncertainty higher than 2%.

The plots in Fig. 6 consider the same values of \(\lambda \) for both the with-JSS and without-JSS case and is more of a phenomenological choice. In an experimental analysis, with-JSS case may have additional source of uncertainty and therefore the plots for the individual cases should be seen with respect to the absolute systematics and not relative to each other. However, from Fig. 6, we find that with-JSS case still performs better, even with a \(\sim \) 2% additional uncertainty. Hence, one would still benefit from the study with jet substructure once more accurate estimates of systematics is performed in the experimental analysis.

Fig. 8
figure 8

Normalized distribution of BDT responses for a gg vs. qq, b gg vs. qg, and c qg vs. qq signal separation. In all the panels, red solid and black dashed lines represent the classifier responses for the case when JSS observables are not used while green and blue shaded regions represent the case when JSS observables are used along with event variables

We note that we did not carry out any explicit analysis on the systematics. One of the major source of systematic uncertainty could be from the determination of jet energy scale. Though we have corrected the jet four-momenta for the jet energy scale, we did not really estimate the uncertainty in that. Another major source of uncertainty can be from PDF and parton shower scale uncertainty. Also, it is not guaranteed that the uncertainty would remain uniform in the whole range of signal efficiency. However, our analysis is still useful in inferring that JSS observables help improve the performance of existing dijet analysis even in the case of non-zero systematics.

Another important point to discuss is the calibration and estimation of systematic uncertainties in BDT data. The distributions of the input variables to the BDT can be validated in data in a background enriched region. The background enriched region, for example, can be considered to be the region in dijet mass distribution which are already excluded by experiments at 95% CL. Similarly, the distributions of BDT output can also be validated in data in such a background enriched region. A correction can be derived and/or an additional uncertainty can be assigned if any systematic shift is observed. Similarly, the systematic uncertainties affecting the input variables can be propagated to the BDT ouput by evaluating the BDT response for up and down variations of each of the input variables.

We carried out a very similar exercise (analyses) for the other two types of resonant configuration in the dijet, i.e. the qq and qg resonances. The results are shown in Fig. 7. In the left panel of Fig. 7, we plotted the variation of signal significance as a function of relative systematic uncertainty (\(\lambda \)) for the qq (top) and qg (bottom) resonance signals against SM dijet background. Significance improvement as a function of \(\lambda \) is plotted in the right panel of the figure. We again observe reasonable improvements as seen for gg resonance. For qg resonance the improvement is slightly depleted compared to the other two cases. This is because both qg signal and SM dijet background has an admixture of both quark and gluon jet properties in either jet in the dijet event. Nevertheless, we could achieve some degree of improvement since the fraction of quark and gluon jets are different in the signal with respect to the background. A quantitative comparison of the upper limits on the product of cross section (partonic) and acceptance (A) at 95% CL is also shown in Table 3 for a fixed value of the integrated luminosity of 36 fb\(^{-1}\). It is worth pointing out here that we make some approximations, such as normalizing our background estimates in the relevant mass bin with that of the CMS analysis and account for approximate acceptance values A,Footnote 4 to arrive at the upper limits shown in Table 3.

Table 3 Comparison of 95% CL upper limit on \(\sigma \times A\) between our analyses vs. CMS result [2] at \(M=2\) TeV. The upper limits are given for two different values of relative systematic uncertainty \(\lambda \). The analysis is performed for \(\sqrt{s} = 13\) TeV and \(\mathscr {L}=\) 36 fb\(^{-1}\)

5 Discrimination between gg vs. qg vs. qq resonances

Once the signal is identified over the SM background, we can expect to discriminate between different types of resonances, e.g. discriminating gg resonance from qq and qg resonances. This is in similar spirit of determining the CP properties [82, 83] of the observed 125-GeV scalar at the LHC. For the purpose of discriminating different types of signals, the BDT is trained for one signal against the other signal. If we consider three different types of signals, we have three different combinations, viz. (a) gg vs. qq, (b) gg vs. qg and (c) qg vs. qq. The BDT responses for these three combinations are shown in Fig. 8. Panel (a) shows BDT responses for gg vs. qq signal separation while panels (b) and (c) show BDT responses for gg vs. qg signal discrimination and qg vs. qq discrimination respectively. In all the panels, red solid and black dashed lines represents the classifier response for the case when JSS observables are not used while green and blue shaded regions represent the case when JSS observables are used along with event observables. In this study we used the same set of event variables and JSS observables as done in the analysis of signal separation from the SM dijet background. All the panels in Fig. 8 show similar features, as seen in Fig. 4, i.e. the distributions of BDT response of two resonances for “without-JSS” case have higher overlap compared to the case when JSS observables were used. This tells us that different types of resonances can be better separated if we use JSS observables in addition to the event variables.

To get a quantitative idea, we calculated separation power [75] for the three different combinations, viz. (a) gg vs. qq, (b) gg vs. qg and (c) qg vs. qq. The separation power is defined as

$$\begin{aligned} \langle S^2\rangle = \frac{1}{2}\int dx \frac{\left( p(x)-q(x)\right) ^2}{p(x)+q(x)} \end{aligned}$$
(6)

where p(x) and q(x) are normalized distribution functions for the two resonances in consideration for discrimination. The values for separation power for the BDT response in “without-JSS” and “with-JSS” analysis is listed in Table 4. We can see from Table 4 as well as from Fig. 8 that the use of JSS observables help us to discriminate between the different types of resonances much better.

Table 4 Separation power between different resonance hypotheses for without-JSS and with-JSS case

6 Summary and outlook

To summarize, we studied dijet resonances at the LHC and look at the application of jet substructure techniques to such resonances. These resonances may carry different partonic imprints in the jets which will be driven by the spin and color structure of the on-shell particle produced as a resonance. We note that JSS techniques have become an essential part of today’s collider physics and are being utilized as a very effective tool to understand physics at high energy colliders. Our aim of using jet substructure in this work is to make a statement on the improvements one can achieve in resonant search strategies at the colliders in the dijet final state. However, such an improvement is not restricted to only an exclusive dijet final state but can be applied when final states are of multijet in nature.

Currently, experimental collaborations put 95% C.L. upper limit on the production cross section of different types of resonances, viz. gg, qq, and qg resonances using only line-shape information of these types of resonances at 13 TeV collider. In this work, we attempt to make use of JSS to improve the existing search strategies of heavy colored resonances in dijet channel. We improve upon earlier studies by performing a more realistic event analysis by including fast detector simulation and jet energy correction. We show how the jet substructure observables help in discriminating a signal for any of the heavy dijet resonances, viz. gg, qq, and qg, from the SM QCD dijet background. We utilize BDT multivariate classifier for the discrimination. We highlight our results in the form of ROC curves where we find considerable improvement in the ROC curves when we use jet substructure observables in addition to the simple event variables. We also find that the improvement in signal significance is substantial even in the presence of non-zero systematic uncertainties. This suggests that, with the help of jet substructure technique, the search for different types of heavy resonances can be improved to a great extent at the LHC. Furthermore, we also establish and show that distinction between different types of resonances can also be achieved to a higher degree if jet substructure observables are used in addition to the event variables. The same technique can also be effectively applied to proposed future high energy machine although analyses with only 13 TeV has been presented in this article.