Massive coloured scalar particles that decay into gluons are predicted in several extensions of the Standard Model (SM). The most prominent examples are the scalar partners of a Dirac gluino called scalar gluons (‘sgluons’) in extended supersymmetric models like the \(\mathcal{N}=1/\mathcal{N}=2\) hybrid model [1, 2] or the R-symmetric MSSM [3, 4]. These particles are also present in compositeness models [59] where they are called hyperpions. While single production of sgluons is possible, the production cross section depends strongly on the masses of the supersymmetric particles and, in typical supersymmetric scenarios, is of the same order as the pair production cross section. On the contrary the pair production cross section does not, at leading order, depend on supersymmetric parameters except for the sgluon mass. Since the sgluon has positive R-parity [10] and since the sgluon coupling to quark–antiquark pairs is suppressed by the quark mass, light sgluons, i.e. sgluons with masses of the order of 100 GeV, are expected to decay to two gluons with a branching ratio close to one [2, 4]. Pair production of sgluons each decaying to two gluons, leading to a four-jet final state, is therefore used as a benchmark process. ATLAS has previously searched for signatures of these particles in the dataset of 34 pb−1 recorded in 2010 [11], excluding at the 95 % confidence level (CL) sgluons with masses of 100 GeV to 180 GeV, with the exception of a mass window of 5 GeV around 140 GeV. The search described in this paper, using data recorded in 2011, explores the mass region from 150 GeV up to about 300 GeV.

The strategy of the analysis is to first reconstruct the two sgluon candidates. The relative mass difference and the ΔR between the two jets associated to a reconstructed sgluon are used to select well reconstructed sgluons with a small mass difference, where \(\Delta R=\sqrt{(\Delta\phi)^{2}+(\Delta\eta)^{2}} \) with Δϕ and Δη being the difference in azimuth and pseudorapidity of the two jets. The distribution of the reconstructed average mass of the two sgluon candidates is then analysed for evidence of a signal with a fit to the background plus a signal template of variable strength. The background for this search is the SM multijet background, due to its large cross section.

ATLAS is a multipurpose detector [12, 13] with nearly 4π coverage in solid angle. The inner detector, consisting of silicon pixel and microstrip detectors as well as a transition radiation tracker, is immersed in a 2 T axial magnetic field. In the pseudorapidityFootnote 1 region |η|<3.2, high-granularity lead/liquid-argon (LAr) electromagnetic (EM) sampling calorimeters are used. An iron/scintillator-tile calorimeter provides hadronic coverage for |η|<1.7. The end-cap and forward regions, spanning 1.5<|η|<4.9, are instrumented with LAr calorimeters for both EM and hadronic measurements. The calorimeters are surrounded by a muon spectrometer which consists of three large superconducting toroids, a system of precision tracking chambers, and fast detectors for triggering. ATLAS uses a three-level trigger system. The first-level trigger is implemented in custom hardware, the other two trigger levels are implemented in software running on commercially available PC farms.

A data-driven method is used for the background estimation. The method is developed and validated on SM multijet Monte Carlo (MC) samples. The samples are also used to determine a systematic error on the background determination. To incorporate detector effects, these events are passed through a detailed simulation of the ATLAS detector [14] based on Geant4 [15]. Alpgen [16] SM multijet samples are generated with the MLM matching scheme [17] and interfaced to Herwig [18] for the parton shower and fragmentation processes and to Jimmy [19] for the simulation of the underlying event. The Alpgen samples are generated with the CTEQ6L1 parton distribution functions (PDFs) [20] (underlying event tune AUET2-CTEQ6L1 [21]). Pythia [22] SM multijet samples are generated with the LO MRST PDFs [23] (underlying event tune AUET2B LO∗∗ [21]). The sgluon pair production differential cross section [2] is implemented as an external process interfaced to Pythia. The decay of the sgluons as well as fragmentation and hadronisation are performed by Pythia. For the normalisation of the signal a next-to-leading-order (NLO) cross section is used [24].

Signal samples of 40k to 120k events each were generated with sgluon masses of 150, 200, 250, 300 and 350 GeV and passed through the fast simulation of the ATLAS detector. Additionally the fast simulation was verified by passing some samples through the detailed simulation which leads to results with good agreement.

Jets are reconstructed using the anti-k t jet clustering algorithm [25] with a radius parameter of 0.4. The inputs to the jet algorithm are three-dimensional clusters [26] formed from energy deposits in the calorimeters. The jets are calibrated using transverse momentum (p T) and \(\mathrm{\eta}\)-dependent correction factors based on MC simulations and validated by test beam and collision data studies [27]. Quality criteria are applied to reject jets produced by non-collision backgrounds [28]. Such jets are typically produced by hardware problems in the calorimeters, LHC beam-gas interactions or cosmic-ray induced showers. The jet energy resolution for a jet with a p T of 80 GeV is about 11 % [29].

The analysis uses collision data collected in the year 2011 at a centre-of-mass energy of \(\sqrt{s} = 7\ \mbox{TeV}\) and corresponding to an integrated luminosity of 4.6 fb−1. The data were recorded with a multijet trigger requiring at least four jets. Reflecting the threshold of the third level trigger of 45 GeV, the trigger efficiency does not depend strongly on the p T of the four highest-p T jets for p T>80 GeV. To ensure full trigger efficiency for the analysis, the four highest-p T jets in a selected event are required to be separated from each other by ΔR>0.6. The resulting trigger efficiency of at least 99 % obtained with simulated events was verified in data with the use of a single-electron trigger. The average number of proton–proton interactions in the same event (pile-up) has increased to about 12 events over the course of the run. Simulated minimum-bias events are overlayed onto the simulated samples of the hard-scattering processes. The resulting events are reweighted to reproduce the luminosity profile of the data. The requirement on the p T of the jets makes the analysis robust with respect to the increase of pile-up.

A selected event must contain at least one reconstructed primary vertex with five or more associated tracks each having p T>400 MeV. At least four jets with p T>80 GeV and |η|<1.4 are required, since the signal is produced centrally, in contrast to the SM multijet background. These selection criteria together with the trigger requirements are referred to as preselection in the following. To improve the sensitivity of the analysis, the jet p T threshold is defined as a function of the probed sgluon mass. The p T of the fourth highest-p T jet is required to be greater than the maximum between 80 GeV and 0.3 times the sgluon mass plus 30 GeV (p T (4th jet)>max(0.3×m sgluon+30,80) GeV). The stringent p T requirements select sgluons produced with a boost and hence lead to an accumulation of jet pairs from sgluon decays with ΔR≈1. Taking advantage of this property to reconstruct the two sgluon candidates, the four highest-p T jets in a preselected event are paired by minimising the quantity |ΔR pair1−1|+|ΔR pair2−1|. The event is rejected if, for the chosen combination, a jet pair has ΔR>1.6. The reconstructed masses of the sgluons are denoted m 1 and m 2. The reconstructed average mass is (m 1+m 2)/2. The scattering angle is defined as the angle between the direction of motion of the reconstructed sgluons in the rest frame of the four highest-p T jets and the boost direction between the lab frame and the rest frame of the four highest-p T jets. The magnitude of the cosine of the scattering angle |cos(θ )| is required to be less than 0.5. In fact the SM multijet background is peaked in the forward region, reflecting t-channel gluon exchange, while the signal is produced centrally due to the (mainly) s-channel production and scalar nature of the sgluon. Finally, to further improve the rejection of the SM multijet background, the relative difference between the two reconstructed masses |m 1m 2|/(m 1+m 2) is required to be less than 0.15. The requirement on the relative mass difference selects well reconstructed events increasing the bulk of the signal distribution with respect to the tails. Loosening the requirement, i.e., accepting events with larger mass differences, leads to an increase of signal events but also an increase of background events. The selection that increases the number of events in the control regions without decreasing the sensitivity is chosen. As the requirement on the 4th jet is mass dependent, a selection efficiency of 0.6 % is achieved for all simulated samples of the different sgluon masses.

The reconstructed average mass distribution after all cuts is shown for sgluon signals with varying masses in Fig. 1. The natural width of the sgluon in this mass range is negligible and the width of the observable is entirely dominated by the instrumental mass resolution. As the mass of the sgluon increases, the requirement on the transverse momentum of the jets becomes relatively less stringent. An increase of 100 GeV in the sgluon mass leads to an increase of only 30 GeV in the requirement on the jet transverse momenta. As a consequence the radiative tails to lower masses are more evident for higher masses in the figure, as they are less sculpted by the p T cut.

Fig. 1
figure 1

The reconstructed average mass distribution after all analysis cuts for the signal samples with m sgluon=150, 250 and 350 GeV. The curves are normalised to unit area

After applying preselection and jet pairing, the primary variables used in the analysis are compared to the Alpgen and Pythia multijet simulation. Backgrounds other than SM multijet events are estimated to be smaller than 1 % of the total background sample and are thus neglected. Since the analysis requires at least four jets, Alpgen is expected to give a better description than Pythia, which generates the third and fourth jets via a parton shower.

The MC samples are normalised to the data after the preselection described above. The normalisation factor of 1.25 obtained for Alpgen is compatible with the one (1.26) obtained in Ref. [11]. For Pythia the normalisation factor is 0.75, close to the value of 0.65 [30] obtained with the 2010 data using a different tune (AMBT1). After this normalisation, for the transverse momentum of the 4th jet (Fig. 2(a)), the separation of the two jets of the sgluon candidate with the highest transverse momentum (Fig. 2(b)), the relative mass difference (Fig. 2(c)) and the cosine of the scattering angle (Fig. 2(d)), the agreement between the data and the MC simulations is, in general, at the 20 % level. To reduce the dependence on simulation, the background is estimated from data, taking advantage of the kinematic properties of the sgluon signal. Only the systematic error on the method is taken from the Monte Carlo studies.

Fig. 2
figure 2

The kinematic variables of the analysis are shown after applying the preselection and pairing the four highest-p T jets: (a) is the transverse momentum of the fourth highest-p T jet; (b) is the ΔR between the two jets of the reconstructed sgluon candidate with the highest transverse momentum jet; (c) is the relative mass difference; (d) is the cosine of the scattering angle in the four-jet centre-of-mass frame. The black histogram is the signal for a sgluon mass of 150 GeV normalised to the NLO cross section. Data (dots) are compared to the Alpgen (triangles) and Pythia (rectangles) SM multijet samples where the MC samples are normalised to the data. The ratio data/MC is also shown separately for Alpgen and Pythia with its statistical uncertainty

The main discriminating variable for the analysis is the reconstructed average mass. To determine the background normalisation as well as the shape of the background in the signal region, an ABCD method is used. The data sample is divided into one signal region (A) and three background-dominated regions (B, C and D). The variables used to define the four regions are |cos(θ )| and |m 1m 2|/(m 1+m 2). The regions defined in Table 1 are chosen as a compromise between the statistical significance of the signal in region A and the statistical uncertainty in the regions B, C and D which feeds into the uncertainty on the background prediction.

Table 1 Definition of the four regions for the background determination. Region A is the signal region

The correlation between the two variables is less than 0.1 % in the four regions in the data and less than 1 % in the Pythia samples, so the normalisation of the background in the region A is derived from the ratio of events in the control samples using \(N_{\mathrm{A}}^{\prime}=N_{\mathrm{B}}\cdot N_{\mathrm {C}}/N_{\mathrm{D}}\). A closure test is performed with the Pythia and Alpgen MC samples and shows that \(N_{\mathrm{A}}^{\prime}\) reproduces the actual number of events in region A, N A, within 2 %. The difference is assigned as a systematic uncertainty on the background prediction.

Table 2 summarises the results obtained in data, for the five sgluon mass hypotheses and corresponding signal regions, A, together with the corresponding background predictions. Good agreement is observed. The assumption of the ABCD method that the shapes of the reconstructed average mass distributions in regions A and B are the same was verified on the Monte Carlo samples. The last column gives the p-value obtained from a Kolmogorov–Smirnov test between the shapes of the reconstructed average mass distributions for data in regions A and B. Satisfactory p-values are found in this test on the data, which considers statistical uncertainties only.

Table 2 Comparison of the data in the signal region with the background prediction. The first column is the sgluon mass hypothesis, the second column is the corresponding minimum p T requirement on the four jets, followed by the number of observed data events in the signal region A (third column) and the number of predicted events using the ABCD method (fourth column), where the first uncertainty given is statistical and the second systematic. The last column gives the p-value obtained from a Kolmogorov–Smirnov test between the shapes of the reconstructed average mass distributions in regions A and B. Only statistical uncertainties are considered in this test

The result of the background estimation is shown in Fig. 3 for sgluon masses of 150, 250, 300 and 350 GeV. The data in region A are compared to the data in the control region B normalised using the ABCD method. The expected sgluon signal in region A is also shown. The ratio of background to data as well as the significance, in standard deviations, of the difference between the data and the prediction are shown as a function of the reconstructed average mass. The significance takes into account only statistical uncertainties.

Fig. 3
figure 3

The comparison of the data in the signal region with the background prediction is shown for: (am sgluon=150 GeV, (bm sgluon=250 GeV, (cm sgluon=300 GeV and (dm sgluon=350 GeV. The points are the data in the signal region (region A). The plain histogram (red) is the expected signal in region A normalised to the NLO cross section. The prediction of background in region A based upon the data in region B normalised using the ABCD method is shown as the rectangles which include the statistical uncertainty. The data/background ratio and the statistical significance of its difference from one, in standard deviations, are also shown in the lower panels

The average mass distribution of the data in region A is compared to the background-only prediction and to the background-plus-signal prediction using a binned likelihood test, which incorporates the signal contamination in the control regions and systematic uncertainties via nuisance parameters. Systematic uncertainties affecting the simulated sgluon signal shapes are incorporated in the fit by varying signal templates, taking into account the migration of events between the regions.

The systematic uncertainties on the acceptance, and the correlation assumed for each uncertainty source between the four regions, are listed in Table 3 for a sgluon mass of 300 GeV. The uncertainty on the jet energy scale as well as the uncertainty on the jet energy resolution impacts both the signal shape and the acceptance. These uncertainties were measured with the complete 2010 dataset [31] to which an extra uncertainty for the higher pile-up in the 2011 run was added. The variation of the Pythia parameters controlling initial state and final state radiation in a range consistent with experimental data [32] produces only a small effect on the reconstructed average mass distribution. Therefore, this systematic uncertainty is taken into account for the signal acceptance. An uncertainty of 1 % is assigned to the trigger efficiency for sgluon signals. The uncertainty on the signal acceptance due to the signal MC statistical uncertainty is uncorrelated among the four regions. The acceptance uncertainty due to the PDFs is estimated using the independent CT10 [20] error sets. The uncertainty on the luminosity is taken to be 3.9 % [33, 34].

Table 3 The systematic uncertainties on the signal due to the jet energy scale (JES), jet energy resolution (JER), initial and final state radiation (ISR/FSR), the trigger efficiency (Trigger), the Monte Carlo signal statistics (MC Statistics), the choice of parton distribution functions (PDFs) and the integrated luminosity (Luminosity). The relative uncertainty of the signal acceptance is given for the four regions and for a sgluon mass of 300 GeV. The JES uncertainty is treated as asymmetric, corresponding to upward and downward fluctuations of the JES. For the JER uncertainty, only an upward fluctuation of the JER is considered, i.e. only a degradation of the JER, with respect to the nominal MC JER. The last column shows the expected correlation among the four regions

To probe for the presence of a signal, a fit with a freely varying signal strength parameter is performed for each mass hypothesis. No significant deviation from zero is found, and limits on sgluon production are derived.

The profile likelihood ratio \(\tilde{q}_{\mu}\) [35] is used as test statistic, and exclusions are determined using the CL s approach [36]. Exclusion limits are computed using samples of pseudo-experiments generated taking into account all uncertainties and also contamination of the control regions by signal events. The normalisation and shape of the event distribution in region B are used, whereas for regions C and D only the normalisation is used. MC templates are used to generate the shape of the signal in each pseudo-experiment. In each pseudo-experiment the statistical and systematic uncertainties are randomised, using Poisson and Gaussian distributions.

Figure 4 shows the expected and observed 95 % CL upper bounds on the product of the scalar pair production cross section and the branching ratio to gluons as a function of the scalar mass. For a mass of 150 GeV (350 GeV), a limit of 70 pb (10 pb) on the scalar gluon pair production cross section at the 95 % CL is obtained. The solid line corresponds to the prediction of the sgluon pair production cross section at NLO [24], which is larger than the leading order cross section by a factor of about 1.6. The hatched band indicates the systematic uncertainty due to the choices of renormalisation and factorisation scales. Due to this recent NLO calculation, the previously unexcluded mass region around 140 GeV [11] is now excluded by reinterpreting the limits obtained with the data recorded in 2010. For the analysis of the data recorded in 2011, sgluons with a mass from 150 GeV to 287 GeV are excluded. The endpoint of the mass limit is defined as the intersection of the cross-section limit with the NLO cross section minus one standard deviation of the theory uncertainty.

Fig. 4
figure 4

Expected and observed 95 % CL upper bounds on the product of the scalar pair production cross section and the branching ratio to gluons as a function of the scalar mass using the profile likelihood ratio as the test statistic. The predictions of the sgluon and hyperpion pair production cross section are also shown as well as the observed limit from a previous ATLAS search [11]. The sgluon cross section is at NLO [24]

The dashed line is the prediction for the hyperpion cross section of a compositeness model, obtained by rescaling the sgluon cross section according to the ratios from Ref. [7]. Since the ratios were calculated at leading order, this line should only be considered as an approximate indication of the excluded mass region.

In summary, using 4.6 fb−1 of \(\sqrt{s}=7\ \mbox{TeV}\) proton–proton collision data, collected by the ATLAS detector, four-jet events have been analysed to search for the pair production of a new scalar particle decaying into two gluons. The data in the signal region are in good agreement with the data-driven background estimation. No evidence for new phenomena is found. Cross section times branching ratio limits as a function of the mass of the scalar particle are derived. Interpreting the cross-section limit with the NLO cross-section calculation, sgluons with masses from 150 GeV to 287 GeV are excluded at 95 % CL using the data recorded in 2011 while the range 150 GeV to 306 GeV is expected to be excluded in the absence of a signal. Combining the exclusions obtained with the analyses of 2010 and 2011 data, sgluons with masses from 100 GeV to 287 GeV are excluded at the 95 % CL.