1 Introduction

The ATLAS experiment [1], one of the four major experiments at the LHC [2], relies on the precise measurement of particle showers as the starting point for physics object reconstruction. Calorimeter showers are reconstructed by clustering groups of topologically connected calorimeter cells, the first step in this process involving cells with high absolute energy relative to calorimeter noise [3]. The produced clusters are referred to as topo-clusters and provide the input to the reconstruction of a number of physics objects, including jets, electrons, \(\tau \)-leptons and missing transverse momentum (\(E_{\textrm{T}}^{\textrm{miss}}\)). The topological cell clustering algorithm is well established and performed very well during the LHC Run 1 and Run 2 data-taking periods, from 2010 to 2012 and 2015 to 2018 respectively. However, further improvement is possible, especially in view of the increased luminosity foreseen in LHC Run 3 and beyond.

At the LHC, bunches of about \(10^{11}\) protons collide every 25 ns, giving rise to multiple pp interactions in each bunch-crossing.Footnote 1 Signals from particles produced in additional soft pp collisions pile up on top of those from the pp hard-scattering that triggered the ATLAS data-recording process. If the additional particles are produced in the same bunch-crossing that produced the recorded event’s pp hard-scattering, they are referred to as in-time pile-up, whereas if they are produced in the previous or next bunch-crossings they are called out-of-time pile-up. Calorimeter signals are sensitive to both in-time and out-of-time pile-up, which can result in the formation of additional topo-clusters, as well as spurious contributions to those originating from the hard scattering. Multiple pile-up suppression techniques have been studied in ATLAS, such as the grooming algorithms used for the reconstruction of boosted objects [5,6,7] or the Constituent Subtraction [8] and Soft Killer [9] methods used prior to jet reconstruction [10]. These methods, however, are all applied after topo-cluster reconstruction, whereas in this paper a low-level mitigation technique is explored with the goal of reducing pile-up contributions at cell level while building the topo-clusters.

The ATLAS calorimeters provide a measurement of the signal time, in addition to the deposited energy. In this paper, the topo-clustering algorithm is modified by applying a calorimeter time-measurement selection criterion (the time cut in the following) to cells with large signal-to-noise ratio in order to reject contributions from out-of-time pile-up. Multiple versions of the time cut were compared and tested on both Monte Carlo (MC) simulated events and a sample of ATLAS data events. An Upper Limit, switching off the time cut for very high energy signals, was explored and it was adopted as a safety measure to avoid rejecting potential new-physics processes which might produce out-of-time signals. The performance of the new algorithm has been tested in the context of jet reconstruction, focusing on the suppression of pile-up-initiated jets as well as its effects on jet energy calibration and resolution.

The rest of this paper is organised as follows. Section 2 describes the detector. Section 3 lists the data and MC samples used for this analysis. Section 4 describes the current topo-cluster reconstruction algorithm and Sect. 5 introduces the time cut. Section 6 discusses the time cut’s performance on topo-cluster and jet kinematics as evaluated using MC samples. Section 7 discusses the time cut’s performance on ATLAS data taken during Run 2 of the LHC, including checks for possible time-cut inefficiency and the cut’s impact on the ATLAS event size. Conclusions are presented in Sect. 8.

2 ATLAS detector

The ATLAS detector [1] at the LHC covers nearly the entire solid angle around the collision point.Footnote 2 It consists of an inner tracking detector surrounded by a thin superconducting solenoid, electromagnetic and hadron calorimeters, and a muon spectrometer incorporating three large superconducting air-core toroidal magnets.

The inner-detector system is immersed in a \(2~\textrm{T}\) axial magnetic field and provides charged-particle tracking in the range \(|\eta | < 2.5\). The high-granularity silicon pixel detector covers the vertex region and typically provides four measurements per track, the first hit normally being in the insertable B-layer [11, 12] installed before Run 2. It is followed by the silicon microstrip tracker, which usually provides eight measurements per track. These silicon detectors are complemented by the transition radiation tracker (TRT), which enables radially extended track reconstruction up to \(|\eta | = 2.0\). The TRT also provides electron identification information based on the fraction of hits (typically 30 in total) above a higher energy-deposit threshold corresponding to transition radiation.

The calorimeter system covers the pseudorapidity range \(|\eta | < 4.9\). Within the region \(|\eta |< 3.2\), electromagnetic (EM) calorimetry is provided by barrel and endcap high-granularity lead/liquid-argon (LAr) calorimeters, with an additional thin LAr presampler covering \(|\eta | < 1.8\) to correct for energy loss in material upstream of the calorimeters. Hadron calorimetry is provided by the steel/scintillator-tile calorimeter (Tile calorimeter in the following), segmented into three barrel structures within \(|\eta | < 1.7\), and two copper/LAr hadronic endcap calorimeters. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic and hadronic energy measurements respectively. Electromagnetic and hadronic calorimeters based on LAr technology are collectively referred to as the LAr calorimeter in the following. Additional details about the timing measurements in the ATLAS calorimeters are provided in Sect. 2.1.

The muon spectrometer comprises separate trigger and high-precision tracking chambers measuring the deflection of muons in a magnetic field generated by the superconducting air-core toroidal magnets. The field integral of the toroids ranges between 2.0 and \(6.0~\textrm{T}\cdot \textrm{m}\) across most of the detector. Three layers of precision chambers, each consisting of layers of monitored drift tubes, cover the region \(|\eta | < 2.7\), complemented by cathode-strip chambers in the forward region, where the background is highest. The muon trigger system covers the range \(|\eta | < 2.4\) with resistive-plate chambers in the barrel, and thin-gap chambers in the endcap regions.

Interesting events are selected by the first-level (L1) trigger system implemented in custom hardware, followed by selections made by algorithms implemented in software in the high-level trigger (HLT) [13]. The first-level trigger accepts events from the \(40~\textrm{MHz}\) bunch-crossings at a rate below \(100~\textrm{kHz}\), which the high-level trigger reduces in order to record events to disk at about \(1~\textrm{kHz}\). Triggers with acceptance rates that are too large are prescaled, i.e. only a fraction of the events satisfying the trigger are written to disk. The prescale factor is then applied to estimate the original rate. An extensive software suite [14] is used in data simulation, in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

2.1 Calorimeter timing measurement

In addition to the energy deposited in each cell, both the LAr and Tile calorimeters can measure the time of arrival of the particle depositing the energy. LAr signals are read out using Front End Boards (FEB). They shape the signal and sample it at \(40~\textrm{MHz}\), four samples are then digitised if the event passes the L1 trigger. The shape of the LAr signal is optimised to minimise the noise contribution [15]. An example of the typical LAr calorimeter pulse shape is shown in Fig. 1. The long negative tail of the shaped LAr pulse implies that out-of-time pile-up provides a negative energy contribution on average, while the contribution from in-time pile-up is positive on average. Because of this, the average pile-up energy per event is zero. While this feature is very useful in correcting for the average pile-up energy deposition, suppression methods like the one presented in this paper are still very useful in suppressing pile-up contributions to individual physics objects.

Fig. 1
figure 1

Example of the LAr (central EM barrel) calorimeter pulse shape from Ref. [3]. The unipolar triangular pulse is the one generated by fast ionising particles in the liquid argon. Its characteristic time is the drift time \(t_{\textrm{d}} \simeq 450~\textrm{ns}\) in the example shown. The shaped pulse is superimposed, with a characteristic duration of \(t_{\textrm{signal}} \simeq 600~\textrm{ns}\). The full circles on the shaped pulse indicate the nominal bunch-crossings at 25 ns intervals

The signal amplitude A (proportional to its energy) and timing (t) are both reconstructed using an optimal filtering algorithm [16] applied to the digitised samples \(S_i\):

$$\begin{aligned} A = \sum _{i=1}^{n_{\textrm{samples}}} a_i S_i, \qquad t = \frac{1}{A} \sum _{i=1}^{n_{\textrm{samples}}} b_i S_i, \end{aligned}$$
(1)

where the optimal filtering coefficients \(a_i\) and \(b_i\) are computed from the predicted pulse shape and measured noise. The cell time is only measured if the detected energy is above a certain configurable threshold. Typically, threshold values equal to three times the cell noise (\(3\sigma _{\textrm{noise}}\)) are used. If the reconstructed energy is below threshold, then the cell time is not computed and \(t=0\) is stored.

The LAr time measurement is synchronised with the LHC clock and fine-tuned at the FEB level [17]. Time alignment corrections are recalculated using beam-splash and early collision data [18]. The measured time is also monitored as part of the data quality assessment [19] in order to ensure its stability. The LAr time resolution is typically parameterised as the sum in quadrature of a constant term and a \({\sim }1/E\) noise term:

$$\begin{aligned} \sigma _t = \frac{p_1}{E} \oplus p_0. \end{aligned}$$

In Run 2 and after applying calibration constants obtained from data, the constant term \(p_0\) was found to reach \({\sim }200\,\text {ps}\), while the noise term \(p_1\) was \(\mathcal {O}(1~\text {GeV ns})\) [18].

The Tile calorimeter signal is also shaped, amplified and sampled at \(40~\textrm{MHz}\) by the front-end electronics. Unlike the LAr calorimeter, seven rather than four samples are digitised and the cell energy and time are reconstructed via an optimal filtering method as described by Eq. (1). If the signal amplitude is below 5 ADC counts,Footnote 3 then the timing is not measured and a default value of \(t=0\) is stored. Since most Tile calorimeter cells are read out by two independent channels, the average of the two times is taken as the measured cell time [20]. The Tile calorimeter uses its Laser calibration system [21] to monitor the timing of all channels during physics runs. Laser events are recorded during empty bunch-crossings in physics data-taking and the stability of the timing is monitored as part of the data quality assessment, allowing corrections for possible misconfiguration to be made in the main data processing. Channels that exhibit a large number of timing jumps are flagged as bad timing and are not used for further physics object reconstruction [21]. Cells flagged as Tile bad timing are not considered when implementing the time cut. The Tile timing resolution is described with a functional form similar to the one commonly used for the calorimeter energy resolution:

$$\begin{aligned} \sigma _t = p_0 \oplus \frac{p_1}{E} \oplus \frac{p_2}{\sqrt{E}}. \end{aligned}$$

In Run 2, the constant term was found to be \({\sim }300~\text {ps}\), while \(p_1\) and \(p_2\) are \(\mathcal {O}(1~\text {GeV ns})\) and \(\mathcal {O}(1~\sqrt{\textrm{GeV}} \text { ns})\), respectively, with variations depending on the gain [22].

3 Data and Monte Carlo samples

The time cut’s performance has been studied on simulated multi-jet event samples as well as pp collision data from the ATLAS detector. The data sample used corresponds to one run, taken by ATLAS in 2017 during Run 2 of the LHC at a centre-of-mass energy \(\sqrt{s}=13\) TeV, with an integrated luminosity of \(\mathcal {L} = 76 \pm 2~\hbox {pb}^{-1}\). Because the goal of this study is only to compare jet kinematic distributions, a larger data sample is not needed. In the selected run the LHC provided collisions from bunch trains with a nominal bunch-crossing spacing of 25 ns. The recorded average mean number of interactions per bunch-crossing \(\langle \mu \rangle = 38.8\) is consistent with the average \(\langle \mu \rangle \) recorded during the 2017 data-taking period. Data quality requirements are applied to ensure that all detector components were fully operational [23]. Data were collected using multiple single-jet triggers (see Sect. 7). Events selected by each trigger are scaled by the corresponding trigger prescale factor. In a few Luminosity BlocksFootnote 4 at the beginning of the run, the prescale factors for the chosen single-jet triggers were found to be very large, resulting in an apparent loss of statistical precision due to recording few events with very large trigger weights. To avoid this issue, the Luminosity Blocks in question were excluded from the analysis. The loss of integrated luminosity due to this additional rejection amounts to \(6.5\%\). The integrated luminosity after data quality requirements are applied is \(\mathcal {L} = 71 \pm 2\,\,\hbox {pb}^{-1}\). The uncertainty in the 2017 integrated luminosity is assumed for this run; it amounts to \(2.4 \%\) [4], obtained using the LUCID-2 detector [24] for the primary luminosity measurements.

Additional data samples were used to check for inefficiencies caused by time miscalibration and to test the ATLAS event size for reductions due to the time cut. For time-cut inefficiency checks, one Luminosity Block of data collected in 2018 was used, with an average number of interactions per bunch-crossing of \(\langle \mu \rangle = 42.5\). The data used for the event size test also consists of one Luminosity Block, taken in 2022. The average pile-up for this run is \(\langle \mu \rangle = 41.3\).

Multi-jet production was modelled using PYTHIA 8.230 [25] with leading-order matrix elements for dijet production which were matched to the parton shower. The renormalisation and factorisation scales were set to the geometric mean of the squared transverse masses of the two outgoing particles in the matrix element, \(\sqrt{(p_{\textrm{T}, 1}^{2} + m_1^2) (p_{\textrm{T}, 2}^{2} + m_2^2)}\). The NNPDF2.3lo set of parton distribution functions (PDFs) [26] was used in the matrix element generation, the parton shower, and the simulation of the multi-parton interactions. The A14 [27] set of tuned parameters was used. The modelling of fragmentation and hadronisation was based on the Lund string model [28, 29]. The effect of both in-time and out-of-time pile-up was modelled by overlaying the simulated hard-scattering event with inelastic proton–proton (pp) events generated with PYTHIA 8.186 [30] using the NNPDF2.3lo PDF set [26] and the A3 set of tuned parameters [31].

The MC events have been reweighted to account for the difference between the simulated number of interactions per bunch-crossing (\(\mu \)) profile and the one from the single run used for this paper. The correction factor is defined in bins of \(\mu \), as the ratio of the data to MC sample \(\mu \) spectra, normalised to unity and prior to any event selection. This reweighting is not applied when performing MC-only studies, to allow a wider \(\mu \) range to be covered.

An additional MC sample was used to assess the ATLAS event size reduction due to the time cut. This MC sample describes \(t\overline{t}\) production in the fully hadronic decay channel. The production of \(t\bar{t}\) events was modelled using the Powheg Box v2 [32,33,34,35] generator at NLO with the NNPDF3.0nlo [36] PDF set and the \(h_{\textrm{damp}}\) parameter set to 1.5 \(m_{\textrm{top}}\) [37]. The events were interfaced to PYTHIA 8.307 [25] to model the parton shower, hadronisation, and underlying event, with parameters set according to the A14 tune [27] and using the NNPDF2.3lo set of PDFs [26]. The decays of bottom and charm hadrons were performed by EvtGen 2.1.1 [38].

4 The topo-cluster reconstruction algorithm

The cell time cut is introduced into the existing ATLAS topo-cluster reconstruction algorithm, a detailed description is provided in Ref. [3]. In short, ATLAS topo-clustering is based on the cell signal significance \(\zeta _{\textrm{cell}}^{\textrm{EM}}\), defined as the ratio of the cell signal energy to the average expected noise (Eq. (2)). Both are measured at the electromagnetic (EM) scale, defined by calibrating the calorimeter energy measurement to the energy deposited by an electron or photon, without any correction for the non-compensating response of the calorimeter.

$$\begin{aligned} \zeta _{\textrm{cell}}^{\textrm{EM}}= \frac{E_{\textrm{cell}}^{\textrm{EM}}}{\sigma _{\textrm{noise,cell}}^{\textrm{EM}}}. \end{aligned}$$
(2)

Topo-clusters are formed by a growing-volume algorithm, configured by three threshold parameters \(\{S, \ N, \ P\}\):

$$\begin{aligned}{} & {} {|}\zeta _{\textrm{cell}}^{\textrm{EM}}{|}>S \quad (\text {primary seed threshold}), \end{aligned}$$
(3)
$$\begin{aligned}{} & {} {|}\zeta _{\textrm{cell}}^{\textrm{EM}}{|}>N \quad (\text {threshold for growth control}), \end{aligned}$$
(4)
$$\begin{aligned}{} & {} {|}\zeta _{\textrm{cell}}^{\textrm{EM}}{|}>P \quad (\text {principal cell filter}); \end{aligned}$$
(5)

set to \(\{S=4, \ N=2, \ P=0\}\) in Runs 1, 2 and 3. The algorithm is seeded by cells (seed cells in the following) whose signal significance exceeds a threshold S (Eq. (3)) and which have not been marked as having either read-out or general signal extraction problems in the actual run conditions. Seed cells are then sorted from highest to lowest energy significance and topo-clusters are grown by adding all neighbouring cells that satisfy Eq. (5).

Two cells are considered to be neighbours if they are directly adjacent in a given sampling layer, or, if in adjacent layers, if they at least partially overlap in the plane. If a neighbouring cell has a signal significance above the threshold N in Eq. (4), then the procedure is iterated over its neighbours, while if it satisfies Eq. (5) but not Eq. (4), its neighbours are not included in the growing topo-cluster. This algorithm iterates over further neighbours until no more neighbouring cells satisfying Eq. (4) are found. Since the S, N and P thresholds are applied to the absolute significance, cells of either positive or negative energy can seed a cluster or be included in one. As discussed in Sect. 2.1, negative cell signals in the ATLAS calorimeters are often the result of out-of-time pile-up, since the residual signal trace produced by out-of-time particles is scaled by the negative tail of the LAr calorimeter signal-shaping function [15]. Only clusters with an overall positive energy are used as inputs for jet reconstruction. The algorithm described so far, however, has no limitations to how large a cluster can grow and it could potentially lead to very large, unphysical clusters. For this reason, clusters are then split around local energy maxima, as detailed in Ref. [3].

5 The cell time cut

As discussed in Sect. 2.1, both the ATLAS LAr and Tile calorimeters provide a time measurement for signals of sufficiently large energy. Figure 2 shows the distribution of the energy significance and time of calorimeter signals in real data. Calorimeter cells belonging to the LAr EM barrel region are picked as an example, but similar distributions occur in other calorimeter regions. At high energy significance, two secondary peaks at \(\pm 25\) ns are clearly visible, associated with out-of-time pile-up from the previous and next bunch-crossings. The asymmetry visible between positive and negative cell time is due to the LAr calorimeter being affected by a larger number of bunch-crossings before the current one than after it.

Fig. 2
figure 2

Time and energy significance for calorimeter cells in the second layer of the LAr EM barrel calorimeter. a Shows the two-dimensional cell time vs significance spectrum. The cell time is computed if the cell energy exceeds a given threshold (typically \(3\cdot \sigma _{\textrm{noise,cell}}^{\textrm{EM}}\)), below which a default value of \(t=0\) is stored. The dashed vertical line represents the seed candidate requirement \(S=4\) (Eq. (3)), while the horizontal lines represent the cell time rejection limits of \(\pm 12.5~\textrm{ns}\). The three dotted vertical lines represent the three possible values considered for the Upper Limit (see Sect. 5.3). b Shows a comparison between the inclusive cell time spectrum and the one for seed candidates (\(|E|/\sigma _E>4\)). The vertical lines represent the cell time rejection limits of \(\pm 12.5~\textrm{ns}\)

At lower energy significance the time resolution is poorer, and the secondary peaks cannot in general be distinguished. When limiting the study to candidate seed cells, i.e. those with energy significance greater than \(S=4\), the three peaks are separated well enough to justify a selection on the absolute cell time, requiring it to be within \(\pm 12.5\) ns of the midpoint between the secondary peaks. Multiple approaches to implementing a cell time cut in topo-cluster building were explored. The most relevant are detailed below.

5.1 The Seed time cut

The requirement for seeding a topo-cluster is modified. Seed cells must satisfy Eq. (6), i.e. have both an absolute cell signal significance larger than \(S=4\) and a cell time within \(\pm 12.5\) ns:

$$\begin{aligned} \left\{ \begin{array}{l} {|}\zeta _{\textrm{cell}}^{\textrm{EM}}{|}>S \quad (\text {primary seed threshold, default }S = 4) \\ {|}t_{\textrm{cell}}{|} < 12.5 \text { ns}. \end{array} \right. \end{aligned}$$
(6)

The cluster growth and splitting stages of the topo-clustering algorithm are left untouched. Since this selection is exclusively applied to topo-cluster seeds, it is referred to as the Seed cut in the following.

5.2 The Seed Extended time cut

In the Seed cut implementation, a cell exceeding the signal significance threshold in Eq. (6), but failing the time cut, would be prevented from seeding a cluster. It could, however, be included in another cluster as a neighbouring cell if it falls in the vicinity of an in-time signal. Due to the high level of pile-up at the LHC, this phenomenon is not unlikely. To prevent their inclusion, a tighter version of the cut was defined, in which candidate seed cells that satisfy Eq. (3), but not Eq. (6), are vetoed from being included in growing topo-clusters. In the following, this version of the time cut is referred to as the Seed Extended cut.

5.3 The upper limit

Although this paper focuses on the reduction of out-of-time pile-up, delayed calorimeter signals with substantial energy can also arise from new physics, for example from the decays of slow-moving long-lived particles (LLPs) that are predicted by several theories for physics beyond the Standard Model [39, 40]. Optimising the time cut for LLP signals would require taking into account the many signatures considered for LLP searches and is outside the scope of this paper. A conservative approach was therefore adopted, where the time cut is not applied to signals with very large significance, thus leaving the decision of whether or not to reject such signals to a later stage of the reconstruction chain, where differentiation between standard and LLP-dedicated selections is possible.

The time cut was therefore modified by introducing an Upper Limit (UL). The time cut is turned off if the cell signal significance \(\zeta _{\textrm{cell}}^{\textrm{EM}}= E_{\textrm{cell}}^{\textrm{EM}}/\sigma _{\textrm{noise,cell}}^{\textrm{EM}}\) is positive and larger than a given value \(X_{\mathrm{{UL}}}\), i.e. if:

$$\begin{aligned}{} & {} \left\{ \begin{array}{l} {|}\zeta _{\textrm{cell}}^{\textrm{EM}}{|}>S \quad (\text {primary seed threshold, default }S = 4)\\ {|}t_{\textrm{cell}}{|} < 12.5 \text { ns} \end{array} \right. \\{} & {} \quad \text {OR} \quad \zeta _{\textrm{cell}}^{\textrm{EM}} >X_{\textrm{UL}}. \end{aligned}$$

A lower value of \(X_{\textrm{UL}}\) implies a larger phase space left untouched by the time cut. Three values \(X_{\textrm{UL}}=10\), 20 and 40 were tested in order to estimate how low the Upper Limit can be set without reintroducing too much pile-up into the event. While the Upper Limit could in principle be combined with either the Seed cut or the Seed Extended cut, in this paper it is only studied in combination with the Seed Extended cut, as the latter appeared more promising than the Seed cut (Sect. 6). The Upper Limits \(X_{\textrm{UL}}=10\), 20 and 40 are shown as dotted lines in Fig. 2. As expected, \(X_{\textrm{UL}}=40\) affects a very limited number of out-of-time cells and it is expected to have a negligible effect, while \(X_{\textrm{UL}}=10\) is expected to have the largest impact.

6 Performance of the time cut on MC simulation

The performance of the time cut was evaluated using a PYTHIA 8.230 dijet MC sample, as described in Sect. 3. The event selection is kept as loose as possible in order to observe ample ranges of cluster energy and jet energy.

6.1 Performance on topo-clusters

Since the time cut is applied in the topo-clustering algorithm, the overall number of clusters is expected to be reduced, because the applied cut leads to a smaller number of cluster seeds. It should be noted, however, that the topo-clusters built with the time cut applied are not necessarily a subsample of those built using only the cell energy significance. Excluding out-of-time signals may change the number of local maxima, prompting additional cluster splitting. Figure 3 shows the number of clusters reconstructed per event; only positive-energy topo-clusters (i.e. eligible inputs to the jet reconstruction) are shown. The number of positive-energy clusters in each event is on average \({\sim }15\%\) smaller when the time cut is applied, with very little difference between the different choices for the cut. The Seed and Seed Extended cuts differ only in their treatment of out-of-time signals with significant energy when these signals are included as neighbouring cells, and hence the two algorithms can be expected to differ more strongly in their effect on topo-cluster properties than on the number of reconstructed topo-clusters. Since the UL only truncates the relatively sparsely populated high-energy tail of the topo-cluster spectrum, its effect then becomes subdominant when considering the inclusive number of clusters.

Fig. 3
figure 3

Number of positive energy (\(E_{\textrm{cl}}>0\)) reconstructed topo-clusters per event: comparison between no time cut and multiple choices for the time cut. The first and last bins show the overflow and underflow

Removing out-of-time contributions can also result in a change in the kinematic properties of the reconstructed topo-clusters. This effect is expected to be more significant when applying the Seed Extended time cut as opposed to the Seed time cut. Figure 4 shows the multiplicity of topo-clusters that have various kinematic properties, comparing the various time cut options. Each kinematic moment of the topo-clusters is defined as the weighted average of the property of interest over the cells contributing to the cluster. A full description of the topo-clusters’ properties is given in Ref. [3]. An additional calibration using the local cell weighting (LCW) scheme is appliedFootnote 5 to take into account the non-compensating response of the calorimeter, out-of-cluster energy deposits and energy deposited in the dead material within the detector [3]. Only the topo-clusters with overall positive energy in each event are shown. Bottom panels show the “cut/no-cut” ratio plot. Uncertainties in the ratios must take into account that the distributions with and without the time cut are obtained from the same events, and hence are correlated in a non-trivial way. To properly account for the correlations, uncertainties are computed by splitting the available sample into subsamples of approximately 10,000 events each, and the cut/no-cut ratio is recomputed for each subsample. The ratio’s uncertainty is estimated from the standard deviation of the resulting distribution. The estimated uncertainty is propagated to the inclusive ratio as a relative uncertainty. This method is used to compute the uncertainties in the cut/no-cut ratio plots throughout this paper.

Fig. 4
figure 4

Kinematics of reconstructed topo-clusters, compared between no time cut, the Seed cut, the Seed Extended cut and the Seed Extended cut combined with multiple choices of Upper Limit: \(X_{\textrm{UL}}=40\), \(X_{\textrm{UL}}=20\) and \(X_{\textrm{UL}}=10\). Each kinematic moment of the topo-clusters is defined as the weighted average of the property of interest over the cells contributing to the cluster [3]. Topo-clusters with energy \(E_{\textrm{cl}}>0\) are shown. Plots are normalised to the total number of positive-energy topo-clusters per event. Uncertainties in the cut/no-cut ratios are computed by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty

Figure 4a shows the cluster time. When no time cut is applied, two shoulders are clearly visible near \(\pm 25\) ns, consistent with being due to out-of-time pile-up from the previous and next bunch-crossings. Once the time cut is applied, the number of clusters around \(\pm 25~\text {ns}\) is reduced by more than \(80\%\) by the Seed Extended time cut, smoothing out the two shoulders, while the main peak at 0 ns appears unchanged. The Seed cut has a smaller impact, reducing the two peaks by \({\sim }70\%\), while the impact of the Seed Extended plus Upper Limit \(X_{\textrm{UL}}=10\) cut lies between those of the Seed and Seed Extended cuts. Overall, topo-clusters with absolute time greater than 12.5 ns are not removed entirely, since out-of-time contributions can still occur due to predominantly low-energy cells being included in topo-clusters as neighbouring cells.

Fig. 5
figure 5

Topo-cluster a isolation and b distance \(\lambda \) from the calorimeter front face. A comparison between no time cut, the Seed cut, the Seed Extended cut and the Seed Extended cut combined with \(X_{\textrm{UL}}=40\), \(X_{\textrm{UL}}=20\) and \(X_{\textrm{UL}}=10\) is shown. Each kinematic moment of the topo-clusters is defined as the weighted average of the property of interest over the cells contributing to the cluster [3]. Topo-clusters with energy \(E_{\textrm{cl}}>0\) are shown. Plots are normalised to the total number of positive-energy topo-clusters per event. Uncertainties in the cut/no-cut ratios are computed by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty

The relative effect of the time cut on the number of clusters is also found to vary with jet rapidity (Fig. 4b), the largest effect being a \({\sim }25\%\) reduction in the cluster multiplicity for \(2 \lesssim |y| \lesssim 3.5\). The energy deposited by pile-up is expected to increase at larger absolute rapidity, but the time cut has only a \({\sim }10\%\) effect above \(|y|\sim 3.5\) because each forward calorimeter cell covers a larger pseudorapidity interval. This geometrical effect implies that larger numbers of both in-time particles and out-of-time particles deposit energy in the same cell, thus resulting in a less effective distinction between in-time and out-of-time signals. The effect of the time cut is also seen to vary with the cluster energy (Fig. 4c). The number of clusters with energy between about 1 GeV and 100 GeV is reduced by \({\sim }25\%\), with the reduction decreasing to \({\sim }10\%\) at both very low and very high energies. Finally, the dependence on the cluster \(p_{\textrm{T}\,\,}\)(Fig. 4d) provides the best way to observe the effect of the Upper Limit. As expected, \(X_{\textrm{UL}}=40\) has hardly any effect, while \(X_{\textrm{UL}}=20\) produces a small difference and it starts to diverge from the Seed Extended cut for clusters with \(p_{\textrm{T}}\gtrsim 15\) GeV. However, \(X_{\textrm{UL}}=10\) has a larger impact, which starts to appear for clusters with \(p_{\textrm{T}}\gtrsim 2\) GeV. At \(p_{\textrm{T}\,\,}=5~\textrm{GeV}\) for instance, the Seed Extended cut reduces the number of clusters by \({\sim }20\%\), while the Seed Extended plus \(X_{\textrm{UL}}=10\) cut reduces their number by only \({\sim }15\%\).

Figure 5 shows two topo-cluster moments. The topo-cluster isolation (Fig. 5a) is computed from the number of non-clustered cells on the outer perimeter of the topo-cluster [3]: an isolation value close to 1 indicates an isolated topo-cluster, while it tends to 0 when the topo-cluster is not isolated. Since the lower values of the isolation spectrum are affected the most by the time cut, it can be concluded that topo-clusters reconstructed with the time cut tend to be more isolated than those from the standard algorithm. This observation can be explained as a combination of two effects. Firstly, isolated topo-clusters are mostly produced by EM showers, and are less sensitive to pile-up because of their smaller size. Secondly, the pile-up reduction introduced by the timing cut is likely to produce more isolated topo-clusters. Specific studies are needed to further understand the interplay between these two effects. The distance \(\lambda \) of the cluster’s centre of gravity from the calorimeter’s front face [3] is shown in Fig. 5b. The region of \(\lambda \lesssim 100\) mm is known to be dominated by EM clusters, while clusters located at \(\lambda \gtrsim 400\) mm are predominantly hadronic [3]. In Fig. 5b very little difference is visible in the region dominated by EM clusters while a relatively uniform reduction of about 20% is visible for predominantly hadronic clusters. The area between \(\lambda \sim 100\) mm and \(\lambda \sim 400\) mm corresponds to a mixture of EM and hadronic clusters. Here, peaks are visible in the spectrum of standard clusters, while they are almost entirely removed by the time cut. This effect hints that the timing cut may increase the separation between hadronic and EM showers in the presence of pile-up.

6.2 Performance on jets

Topo-clusters serve as inputs to the reconstruction of many physics objects in ATLAS. This paper focuses on the effect of the time cut on jets reconstructed by applying the anti-\(k_{t}\) clustering algorithm with radius parameter \(R=0.4\) [41]. In ATLAS, jets are most often reconstructed from particle-flow objects [42, 43], which are built by combining topo-clusters with matching inner-detector tracks. Despite it being the preferred method in ATLAS, particle flow adds a layer of complexity to the jet reconstruction: jets built directly from topo-clusters provide a more straightforward way of evaluating the time cut’s performance and are hence used in this paper. Preliminary studies show that the effect of the time cut on \(R=0.4\) particle-flow jets is about the same size as the effect on those built directly from topo-clusters. Since the ATLAS tracking is performed within the triggered bunch crossing, it is reasonable to expect that most out-of-time calorimeter signals would not match tracks, so the time cut can be expected to primarily affect objects classified as neutral by the particle-flow algorithm, which in turn consist only of topo-clusters, with no track-based correction.

6.2.1 Jet calibration

Jets are calibrated to correct for both the pile-up contribution and the detector response. A complete description of jet calibration in ATLAS can be found in Ref. [43]. The calibration procedure consists of three stages. First, a two-step pile-up correction is made. Then a MC-based calibration factor is applied to correct for the detector response (MC jet energy scale: MC-JES) and improve the jet resolution by reducing the calibration’s dependence on additional jet properties (Global Sequential Calibration). Finally, an in situ correction is applied to account for discrepancies between data and MC events.

The first stage of the calibration procedure starts with an area-based correction in which the average pile-up contribution is subtracted from the jet’s transverse momentum:

$$\begin{aligned} p_{\textrm{T}\,\,}^{\textrm{area}} = p_{\textrm{T}\,\,}- \rho \times A, \end{aligned}$$

where \(p_{\textrm{T}\,\,}\)is the jet \(p_{\textrm{T}\,\,}\), A is the jet catchment area and \(\rho \) is the median \(p_{\textrm{T}\,\,}\)density of the event. The value of \(\rho \) is calculated using jets reconstructed by applying the \(k_{t}\) algorithm [44, 45] to positive-energy topo-clusters with \(|\eta |<2\). A residual pile-up correction is then applied to account for residual dependence of the jet \(p_{\textrm{T}\,\,}\)on pile-up activity in the event. Two \(\eta \)-dependent correction terms are derived independently by fitting the dependence of the jet \(p_{\textrm{T}\,\,}\)on \(\mu \) and the number of reconstructed primary vertices per event, \(N_{\textrm{PV}}\). In general, \(\mu \) provides an estimate of the amount of out-of-time pile-up in the event, while \(N_{\textrm{PV}}\) estimates the amount of in-time pile-up. The residual correction is then applied as:

$$\begin{aligned} p_{\textrm{T}\,\,}^{\textrm{residual}} = p_{\textrm{T}\,\,}^{\textrm{area}} - \alpha \times (N_{\textrm{PV}}-1) - \beta \times \mu , \end{aligned}$$

where \(\alpha \) and \(\beta \) are the correction factors estimated from MC simulation.

The pile-up contribution to jets is expected to be affected by the time cut. Figure 6 shows the spectrum of the median \(p_{\textrm{T}\,\,}\)density \(\rho \), as well as the dependence of \(\rho \) on \(\mu \). When the time cut is applied, \(\rho \) is found to be \({\sim }20\%\) smaller overall, and also to increase more slowly as a function of \(\mu \). Since \(\rho \) is built from topo-clusters, the improved pile-up rejection of the time cut leads to smaller \(\rho \) values, decreasing the pile-up-dependent contribution to the jet energy. To account for possible remaining data–MC differences in the pile-up contribution to jets, not covered by the jet-area correction, the residual correction is recalculated after the application of the time cut. The standard procedure summarised above is repeated for both the Seed and Seed Extended cuts. Since the effect of the Upper Limit is smaller and limited to high-energy regions, the residual correction is not recalculated for any of the \(X_{\textrm{UL}}\) values, and the residual correction calculated for the Seed Extended cut is used.

Fig. 6
figure 6

Topo-cluster median \(p_{\textrm{T}\,\,}\)density (\(\rho \)): comparison between the no time cut and the considered time cut options. a \(\rho \) spectrum. b average \(\rho \) in bins of the number of interactions per bunch-crossing (\(\mu \)). Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty

6.2.2 Jet selection

Reconstructed jets are calibrated by applying the pile-up correction and the MC-JES calibration. They are then preselected, requiring them to lie within \(|\eta |{<}4.5\) and satisfy \(p_{\textrm{T}\,\,}{>}7\) GeV. Jets are also required to be isolated, i.e. there must be no other preselected jet satisfying \(\Delta R(j_1,j_2){<}1.5\, R_{\textrm{jet}}\), where \(j_{1}\) and \(j_{2}\) are any two reconstructed jets and \(R_{\textrm{jet}}\) is the jet radius parameter used in the anti-\(k_{t}\) algorithm (here \(R_{\textrm{jet}}=0.4\)). Since this study focuses on both signal and pile-up jets, pile-up suppression requirements such as the one using the Jet Vertex Tagger [46] are not applied.

In order to distinguish jets produced by the hard-scattering from jets originating from pile-up, reconstructed jets are matched to truth-level jets in MC events. Truth-level jets are reconstructed by applying the anti-\(k_{t}\) algorithm (\(R=0.4\)) to stable final-state particles, defined as having \(c\tau > 10~\text {mm}\) in the generator’s event record, excluding muons and neutrinos. Three types of truth jets are distinguished. First, truth jets originating from the hard scattering (HS-truth jet) are obtained by clustering stable particles from the simulated hard-scattering event, excluding particles from pile-up interactions. Next, two types of pile-up truth jets are obtained by clustering stable particles from the minimum-bias events which are overlaid on the hard scattering in order to simulate pile-up [47]. Jets are clustered for each minimum-bias event individually and then grouped into two pile-up jet collections: jets from in-time overlay events are included in the in-time pile-up jet collection (IT-truth jet in the following), while those from out-of-time overlay events are included in the out-of-time pile-up jet collection (OOT-truth jet in the following). The latter are built from 32 bunch-crossings before the current one and 6 after [47].

Default cuts requiring \(p_{\textrm{T}\,\,}>10~\textrm{GeV}\) and \(p_{\textrm{T}\,\,}>15~\textrm{GeV}\) are applied to IT-truth jets and OOT-truth jets respectively. HS-truth jets are also preselected by requiring \(p_{\textrm{T}\,\,}>7\) GeV. All truth jets must satisfy \(|\eta |<5\). It should be noted that while a \(p_{\textrm{T}\,\,}\)match can be expected between HS-truth jets and reconstructed jets after the calibration is applied, this is less the case for OOT-truth jets: because of negative energy contributions, one might expect the reconstructed energy of an out-of-time pile-up jet to be lower than that of the truth jet. This is the reason for having a higher \(p_{\textrm{T}\,\,}\)selection for truth pile-up jets than for HS-truth jets. Truth-level jets are also required to be isolated: there must be no other truth jet satisfying \(\Delta R(j_1,j_2) < 2.5 \,R_{\textrm{jet}}\). Here \(j_{1}\) and \(j_{2}\) are any two truth jets, all three truth-jet categories being considered together when computing the isolation. Reconstructed jets are matched to truth jets via a geometrical matching requirement: a reconstructed jet \(j_{\textrm{r}}\) is said to match a truth jet \(j_{\textrm{t}}\) if their separation satisfies \(\Delta R(j_{\textrm{r}},j_{\textrm{t}})<0.3\). Reconstructed jets are first checked for matches with a HS-truth jet. If no such match is found, then matches with IT-truth and OOT-truth jets are attempted.

6.2.3 Jet kinematics

Figure 7 shows the calibrated transverse momentum distributions for the jet categories described in Sect. 6.2.2. Both the reconstructed jets matching HS-truth jets and those matching IT-truth jets show only a percent-level impact from the time cut. It should be pointed out that the time cut can affect the reconstructed jet kinematics, possibly resulting in bin-to-bin migrations in the spectra shown in this subsection. Energy (and \(p_{\textrm{T}\,\,}\)) variations can occur in either direction, since the cell energies included in the topo-clusters can be either positive or negative. Out-of-time cells often produce negative energy signals and this explains why the ratio plots in Fig. 7a, b have values above unity in some places. The small \(p_{\textrm{T}\,\,}\)-dependent decrease in the fraction of jets matching HS-truth jets was found to be due to random matches. The fraction of jets that match OOT-truth jets, on the other hand, is drastically reduced by the Seed Extended cut, especially in the high \(p_{\textrm{T}\,\,}\)region. The Seed Extended cut reduces the multiplicity of out-of-time jets by \({\sim }60\%\) at \(p_{\textrm{T}\,\,}=20~\textrm{GeV}\), and no out-of-time jets are found above \(\sim 45~\textrm{GeV}\). The Seed cut, on the other hand, reduces the multiplicity of the out-of-time jets by only \({\sim }30\%\) at 20 GeV and only \(\sim 20\%\) at 50 GeV, indicating that the Seed Extended cut rejects out-of-time pile-up more effectively. Figure 7 also illustrates the effect of the different Upper Limit values. At \(p_{T} \sim 45~\textrm{GeV}\), the reduction in out-of-time jet multiplicity is \({\sim }90\%\) for an Upper Limit of 40, \({\sim }70\%\) for an Upper Limit of 20, and \({\sim }50\%\) for an Upper Limit of 10, confirming that lower Upper Limit values allow more pile-up in the event.

Fig. 7
figure 7

Comparison between jet \(p_{\textrm{T}\,\,}\)spectra obtained when the Seed, Seed Extended, or no time cut is used. The Seed Extended cut is also shown in combination with the Upper Limit for \(X_{\textrm{UL}}=40\), \(X_{\textrm{UL}}=20\), and \(X_{\textrm{UL}}=10\). a Jets matching HS-truth jets. b Jets matching IT-truth jets. c Jets matching OOT-truth jets. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The vertical dashed line represents the minimum jet \(p_{\textrm{T}\,\,}\)of 20 GeV normally required in ATLAS physics analyses

Fig. 8
figure 8

Comparison between jet rapidity spectra obtained when the Seed, Seed Extended, or no time cut is used. The Seed Extended cut is also shown in combination with the Upper Limit for \(X_{\textrm{UL}}=40\), \(X_{\textrm{UL}}=20\), and\(X_{\textrm{UL}}=10\). a Jets matching HS-truth jets. b Jets matching IT-truth jets. c Jets matching OOT-truth jets. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty

Figure 8 shows an analogous comparison for the jet rapidity. In order to limit the kinematics plots to the phase space of interest in physics analyses, an additional requirement of \(p_{\textrm{T}\,\,}>20~\textrm{GeV}\) is applied to the jets. Overall, all cuts except the Seed cut and Seed Extended plus Upper Limit 10 cut provide a \({\sim }40\%\) suppression of out-of-time jets within \(|y|<1.5\). The effect increases to a 50–60% reduction for \(1.5< |y| < 3.2\) and then decreases until it has negligible impact for \(|y|>4\). From these observations, it can be concluded that the Seed Extended time cut is preferable to the Seed cut because it is more effective in suppressing pile-up. It can also be seen that the Upper Limit \(X_{\textrm{UL}}=10\) allows too much pile-up to remain in the event, so the lowest practical Upper Limit among those tested is \(X_{\textrm{UL}}=20\).

6.2.4 Jet resolution

As discussed in Sect. 6.2.1, the time cut affects the jet calibration because of the different amount of pile-up remaining. To check for its impact on the performance, the jet energy resolution after calibration is compared for the different time cuts. The jet response is defined as the ratio of the calibrated energy of the reconstructed jet to the energy of the matching truth jet: \(R=E^{\textrm{calibrated}}_j/E^{\textrm{truth}}_j\). Only those jets matched to a HS-truth jet are considered for this study. The response distribution is computed in bins of truth-jet \(p_{\textrm{T}\,\,}\)and rapidity (|y|), or truth-jet \(p_{\textrm{T}\,\,}\)and the event \(\mu \). For each bin, the response distribution is fitted with a Gaussian function. The jet resolution is then computed for each bin, as the ratio of the fitted standard deviation of the jet response R to its fitted mean value (\(\sigma _{R}^{\textrm{fit}}/\langle R \rangle ^{\textrm{fit}}\)).

The estimated resolution is shown for two \(p_{\textrm{T}\,\,}\)bins in Fig. 9a, b as a function of the jet rapidity and in Fig. 9c, d as a function of \(\mu \). Some improvement in the jet resolution is visible, amounting to 5–10% for \(20~\textrm{GeV}\le p_{\textrm{T}\,\,}\le 30~\textrm{GeV}\), but no strong dependence on \(\mu \) or |y| is observed. A more accurate estimation of the jet resolution and calibration performance after the time cut is left for future studies, following the implementation of the algorithm in the standard ATLAS reconstruction chain.

Fig. 9
figure 9

Jet energy resolution after calibration as a function of a, b the jet \(|\eta |\) and c, d the number of interactions per bunch-crossing (\(\mu \)). Results are shown in bins of the jet \(p_{\textrm{T}\,\,}\): a, c \(20~\textrm{GeV}< p_{\textrm{T}\,\,}^{\textrm{truth}} < 30~\textrm{GeV}\); b, d \(30~\textrm{GeV}< p_{\textrm{T}\,\,}^{\textrm{truth}} < 60~\textrm{GeV}\). The resolution is defined as the ratio of the standard deviation and mean of a Gaussian function fitted to the response distribution

7 Performance on ATLAS data

As a further probe of its performance, the time cut was tested on data collected by the ATLAS detector in 2017, corresponding to an integrated luminosity of \(\mathcal {L} = 71 \pm 2~\hbox {pb}^{-1}\) after data quality requirements [23]. Single-jet triggers were used to collect the data, requiring at least one jet with sufficiently large \(p_{\textrm{T}\,\,}\). Multiple triggers, listed in Table 1 and implementing different minimum \(p_{\textrm{T}\,\,}\)requirements, were studied independently. Relatively low minimum \(p_{\textrm{T}\,\,}\)requirements are considered, as the time cut is known to mostly effect low-energy jets. Triggers with different L1 seeds are also considered: events passing triggers seeded by random L1 triggers are expected to have a larger contamination from out-of-time pile-up than those triggered by single jet at L1. Since all the triggers in Table 1 are prescaled, the data were reweighted event-by-event by the respective trigger prescale factor.

Data and simulated events are required to contain at least one primary vertex; the one with the highest sum of squared track \(p_{\textrm{T}\,\,}\)is considered in this section. In addition, at least one jet must satisfy a minimum \(p_{\textrm{T}\,\,}\)requirement, as listed in Table 1. This requirement is needed to restrict selected events to a phase space where the trigger efficiency is within \({\sim }20\%\) of its plateau value. A fully efficient trigger is not necessary for this study, as the time cut is not applied at trigger level and a smaller trigger efficiency would not affect the comparison between the cut and no-cut scenarios. In order to properly compare data and MC events, jets are calibrated using the full calibration chain, including the in situ correction.Footnote 6 All jets are required to have \(p_{\textrm{T}\,\,}> 20~\textrm{GeV}\). The Seed Extended cut and Seed Extended plus \(X_{\textrm{UL}}=20\) cut are compared with no-cut spectra for data and MC events. Figures 10, 11 and 12 show different jet kinematic properties, while topo-cluster properties are shown in Figs. 13, 14 and 15. Each histogram shows the distributions for events selected by a different trigger. Uncertainties shown for MC distributions are a combination of the MC statistical uncertainty and the luminosity uncertainty. The latter is based on the 2.4% luminosity uncertainty in 2017 [4]. Uncertainties shown for data are statistical only.

Table 1 Triggers used for data selection

The data and MC distributions are not in perfect agreement, and exhibit relatively uniform differences of about 30–40%. This is to be expected as the MC sample used is a dijet simulation at leading order: even though the hadronisation and jet characteristics are well described by PYTHIA 8.230, the overall MC normalisation is likely to be too low due to the missing higher orders. Moreover, low-\(p_{\textrm{T}\,\,}\)events are dominated by soft QCD radiation, which is known to be difficult to model. A complete jet cross-section measurement would have to use an improved simulation and consider additional correction factors. Since this study only seeks to verify that the time cut’s behaviour in data reflects what is observed in MC simulation, this level of agreement is considered acceptable.

The effect of the time cut depends on the trigger. Events passing HLT_j45, which has both a higher \(p_{\textrm{T}\,\,}\)threshold and a non-random L1 seed, are not affected at all by the time cut. The numbers of events passing the other three triggers exhibit a consistent reduction of \({\sim }20\%\) for values of the leading-jet \(p_{\textrm{T}\,\,}\)between 30 and \(50~\textrm{GeV}\) (Fig. 10). Effects of similar size can be seen throughout the tested kinematic spectra.

Except for the HLT_j45 trigger, the time cut tends to have a more pronounced effect on data than on MC events, the difference being of \(\mathcal {O}(10\%)\) in most of the phase space. This is most likely due to the simulation not accounting for events consisting exclusively of pile-up. Such events are expected to be present in non-negligible numbers for the other three triggers, which have relatively low \(p_{\textrm{T}\,\,}\)thresholds and are seeded by random events. In addition, the MC simulation does not reproduce bunch-to-bunch luminosity fluctuations, which have a more significant effect on the reconstruction of out-of-time signals [3].

Fig. 10
figure 10

Leading jet \(p_{\textrm{T}\,\,}\)spectrum in data and MC selected multi-jet events. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and d HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

Fig. 11
figure 11

Jet \(p_{\textrm{T}\,\,}\)spectrum in data and MC selected multi-jet events. All jets passing the minimum \(p_{\textrm{T}\,\,}\)requirement (\(p_{\textrm{T}\,\,}> 20~\textrm{GeV}\)) are shown. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and d HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

Fig. 12
figure 12

Jet \(|\eta |\) spectrum in data and MC selected multi-jet events. All jets passing the minimum \(p_{\textrm{T}\,\,}\)requirement (\(p_{\textrm{T}\,\,}> 20~\textrm{GeV}\)) are shown. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and d HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

Fig. 13
figure 13

Topo-cluster time spectrum in data and MC selected multi-jet events. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and (d) HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

Fig. 14
figure 14

Topo-cluster \(|\eta |\) spectrum in data and MC selected multi-jet events. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and d HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and recomputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

Fig. 15
figure 15

Topo-cluster \(p_{\textrm{T}\,\,}\)spectrum in data and MC selected multi-jet events. Four triggers are compared: a HLT_j15, b HLT_j25, c HLT_j45, and d HLT_j45_L1RD0_FILLED. The error bars convey the statistical and luminosity uncertainties. Uncertainties in the cut/no-cut ratios are obtained by splitting the available sample into subsamples and re-omputing the cut/no-cut ratio for each subsample. The standard deviation of the distribution of the ratio is used to estimate the ratio’s uncertainty. The luminosity uncertainty does not apply to the cut/no-cut ratio. In the plots, lines represent MC events and markers represent data, different colours and styles represent different cuts

7.1 Checks for time cut inefficiencies

The time cut relies on the accurate measurement of signal timing provided by the calorimeters. The precision of the calorimeter timing measurement is guaranteed by periodic realignment and constant monitoring, as discussed in Sect. 2.1. As a further safety check, the robustness of the cut against a local time miscalibration was tested. The events selected for this check were those in which one calorimeter channel had been flagged as producing fake out-of-time signals due to cross-talk. The effect of the time cut on particles hitting the affected region was then compared with the effect of the time cut on particles hitting unaffected regions of the same subdetector.

The affected channel was located in the second layer of the LAr EM endcap calorimeter, on the C-side (along the negative z-axis). As an example, Fig. 16 shows a combination of six events in which an electron from a \(Z\rightarrow ee\) decay hit the affected area. Events were reconstructed with no cut, the Seed Extended cut and the Seed Extended plus \(X_{\textrm{UL}}=20\) cut. These events were identified first by applying a standard \(Z\rightarrow ee\) event selection. Events are required to pass photon triggers analogous to those used in diphoton resonance searches [48, 49] and to contain at least two electrons with \(p_{\textrm{T}\,\,}> 10\) GeV and \(|\eta | < 2.47\) (excluding the transition region \(1.37< |\eta | < 1.52\) between the LAr EM barrel and endcap calorimeters), satisfying the Medium identification selection and FCLoose isolation criteria [50]. The reconstructed track matched to each electron candidate must be consistent with having originated from the primary vertex: its longitudinal impact parameter \(z_0\) and transverse impact parameter \(d_0\) must satisfy \(|z_0 \cdot \sin {\theta }{|} < 0.5\) mm and \(|d_0|/\sigma _{d_0} < 5\) mm respectively. Furthermore, the dielectron pair is required to have an invariant mass \(m_{ee}\) satisfying \(68~\textrm{GeV}< m_{ee} < 108~\textrm{GeV}\) and a pseudorapidity separation \(|\Delta \eta (e_1, e_2)| > 0.1\). Electrons hitting the affected channel were then flagged by requiring that the most energetic cell in the second layer of the EM calorimeter has high enough energy (\(E_{\textrm{cell}}^{\textrm{max}}>5~\textrm{GeV}\)) and its measured time satisfies \({|}t_{\textrm{cell}}^{\textrm{max}}{|}>12.5~\textrm{ns}\). Figure 16 shows that the Seed Extended time cut does reject the energy from one spot in the centre of the topo-cluster. Despite this, the overall topo-cluster is not lost. Moreover, the addition of the Upper Limit restores most of the lost energy, so that no cell is entirely missing from the electron topo-cluster.

Fig. 16
figure 16

The \(\eta \times \phi \) distribution of cells belonging to topo-clusters in the second layer of the LAr EM endcap calorimeter, side C. The colour scale represents the recorded energy. Six events, in which \(Z\rightarrow ee\) electrons hit the affected area, are averaged. The three panels show results with a no time cut, b the Seed Extended cut and c the Seed Extended plus \(X_{\textrm{UL}}=20\) cut

To better quantify the effect, the average \(\eta \times \phi \) location of flagged \(Z\rightarrow ee\) electrons was used to define a \(3 \times 3\)-cell Test Region containing the affected channel in the C-side LAr EM endcap’s second layer. The \(\eta \times \phi \) ranges corresponding to this area are listed in Table 2. In order to compare the time cut’s effect in the Test Region with the normal time-cut behaviour, three Control Regions were also defined by inverting the sign of either \(\eta \), \(\phi \) or both in the Test Region’s definition. The Control Regions’ \(\eta \times \phi \) boundaries are also listed in Table 2.

Events from one Luminosity Block of 2018 data, during which the Test Region was known to be affected by cross-talk, were used to test the time cut’s behaviour. Rather than restricting the set of events to those passing a \(Z\rightarrow ee\) selection, a more generic event selection is applied: for each region, events are considered if at least one cell of that region has a recorded transverse momentum of \(p_{\textrm{T}\,\,}> 10~\textrm{GeV}\).

Four variables of interest were considered. The cell occupancy \(N^{\textrm{cells}}\) and the cells’ total energy \(E^{\textrm{cells}}\) are respectively defined as the number and total energy of cells in a given region that are part of a topo-cluster: they quantify whether single cells are lost due to the miscalibrated channel. The topo-cluster occupancy \(N^{\textrm{clus}}\) and topo-cluster total energy \(E^{\textrm{clus}}\) are the number and total energy of reconstructed topo-clusters that share at least one cell with a given region: they provide a way to evaluate whether entire topo-clusters are lost or their energy is significantly impacted by the time cut. In order to better quantify the effect of the time cut, the fraction \(\mathcal {F}_{\textrm{unchanged}}\) of events for which a given variable in a given region is unchanged by the cut is computed. For each event, \(N^{\textrm{cells}}\) (\(N^{\textrm{clus}}\)) is considered unchanged if the number of cells in the region (topo-clusters overlapping with the region) remains the same when applying the time cut. The total energies \(E^{\textrm{cells}}\) and \(E^{\textrm{clus}}\) are considered unchanged if the difference between the total energies computed with and without the time cut (\(E^{\textrm{cells}(\text {clus})}_{\textrm{cut}}\) and \(E^{\textrm{cells}(\text {clus})}_{\textrm{no cut}}\) respectively) satisfies:

$$\begin{aligned} |E^{\textrm{cells}(\text {clus})}_{\textrm{cut}} - E^{\textrm{cells} (\text {clus})}_{\textrm{no cut}}{|} < \sigma (E^{\textrm{cells} (\text {clus})}_{\textrm{no cut}}), \end{aligned}$$
Table 2 Definition of test and control regions
Fig. 17
figure 17

Comparison between the Test Region and three Control Regions for a the fraction of events in which the cell occupancy \(N^{\textrm{cells}}\) is left unchanged by the Seed Extended cut, b the fraction of events in which \(N^{\textrm{cells}}\) is left unchanged by the Seed Extended plus \(X_{\textrm{UL}}=20\) cut, c the fraction of events in which the cells’ total energy \(E^{\textrm{cells}}\) is left unchanged by the Seed Extended cut, and (d) the fraction of events in which \(E^{\textrm{cells}}\) is left unchanged by the Seed Extended plus \(X_{\textrm{UL}}=20\) cut. The cell occupancy (cells’ total energy) in a given region is defined as the number (energy sum) of cells in the region that are included in a topo-cluster. Uncertainties on the event fractions are computed as the standard deviation of a binomial distribution. The dashed line represents the average of the three Control Regions

Fig. 18
figure 18

Comparison between the Test Region and three Control Regions for a the fraction of events in which the topo-cluster occupancy \(N^{\textrm{clus}}\) is left unchanged by the Seed Extended cut, b the fraction of events in which \(N^{\textrm{clus}}\) is left unchanged by the Seed Extended plus \(X_{\textrm{UL}}=20\) cut, c the fraction of events in which the topo-cluster total energy \(E^{\textrm{clus}}\) is left unchanged by the Seed Extended cut, and d the fraction of events in which \(E^{\textrm{clus}}\) is left unchanged by the Seed Extended plus \(X_{\textrm{UL}}=20\) cut. The topo-cluster occupancy (total energy) in a given region is defined as the number (energy sum) of topo-clusters that contain at least one cell belonging to the region. Uncertainties on the event fractions are computed as the standard deviation of a binomial distribution. The dashed line represents the average of the three Control Regions

where \(\sigma (E^{\textrm{cells}(\text {clus})}_{\textrm{no cut}})\) is the expected calorimeter resolution computed for either the cells’ total energy (\(E^{\textrm{cells}}\)) or total cluster energy (\(E^{\textrm{clus}}\)) without any cut. The resolution is taken from Ref. [1] to be:

$$\begin{aligned} \frac{\sigma (E^{\textrm{cells}(\text {clus})}_{\textrm{no cut}})}{E^{\textrm{cells}(\text {clus})}_{\textrm{no cut}}} = \frac{10\%}{\sqrt{|E^{\textrm{cells}(\text {clus})}_{\textrm{no cut}}{|}}} \oplus 0.7\%. \end{aligned}$$

Figures 17 and 18 show \(\mathcal {F}_{\textrm{unchanged}}\) for the four variables of interest. Figure 17 is indicative of the time cut’s impact on the cell content for clusters in the Test Region compared to that for clusters in the Control Regions. The fraction of events with unchanged \(N^{\textrm{cells}}\) is shown in Fig. 17a, b: for the Seed Extended cut, \(\mathcal {F}_{\textrm{unchanged}}\) in the Test Region is smaller than the average of the control values by slightly more than \(1\sigma \). Even though the phenomenon has a small significance due to the limited sample size, the effect of the time cut can be seen to disappear when the same events are processed with the Seed Extended plus \(X_{\textrm{UL}}=20\) cut, confirming that cell losses are cured by the Upper Limit. The same behaviour is visible for \(E^{\textrm{cells}}\) (Fig. 17c, d), indicating that the loss of a cell, when it occurs, causes a significant energy variation, while no decrease in \(E^{\textrm{cells}}\) can be seen in the case of the Seed Extended plus \(X_{\textrm{UL}}=20\) cut. Figure 18 shows the fraction of events in which the topo-cluster occupancy and total energy are unchanged, thus investigating the effect of the time cut on topo-clusters as a whole, when they overlap the Test or Control Regions. The topo-cluster occupancy (Fig. 18a, b) remains consistent with the control values even for the Seed Extended cut, confirming that the signal topo-cluster is not entirely lost. The topo-cluster total energy \(E^{\textrm{clus}}\) is impacted by the Seed Extended cut (Fig. 18c), reflecting the loss of a cell in a way similar to that causing losses in the cells’ total energy, while also being restored to its original value by applying the Upper Limit (Fig. 18d).

Table 3 Collection size changes for data events on disk with the introduction of the time cut. Changes of \(\mathcal {O}(100~\textrm{b})\) are not significant for most categories since they can arise from changes in the alignment of the (otherwise unchanged) objects and the subsequent compression when writing to disk. Larger size changes are reported relative to the original size on disk. The “Cells in topo-clusters” line is limited to clusters matched to tracks, electrons and muons
Table 4 Collection size changes for MC events on disk with the introduction of the time cut. Changes of \(\mathcal {O}(100~\textrm{b})\) are not significant for most categories since they can arise from changes in the alignment of the (otherwise unchanged) objects and the subsequent compression when writing to disk. The “Cells in topo-clusters” line is limited to clusters matched to tracks, electrons and muons

7.2 Impact on the ATLAS event size

Calorimeter topo-clusters are used as input to the reconstruction of particles other than jets, most notably electrons, photons and \(\tau \)-leptons. The time cut will then have pile-up-suppressing effects on these particles as well. A complete discussion of the effects of the time cut on electrons, photons and \(\tau \)-leptons is beyond the scope of this paper. However, the widespread usage of topo-clusters in the ATLAS event reconstruction implies that the time cut can have a large beneficial effect on the overall consumption of computing resources by ATLAS data and MC samples.

The change in the ATLAS event size due to the time cut was evaluated for the introduction of the Seed Extended plus \(X_{\textrm{UL}}=20\) time cut into the default ATLAS reconstruction at the beginning of Run 3, before the start of 2023 data-taking. The test is performed by reconstructing one data sample and one MC sample (\(t\overline{t}\) production in the fully hadronic decay) before and after the introduction of the time cut and later comparing the average event sizes on disk. In both samples the detector conditions are those of Run 3. Results are shown in Tables 3 and 4. The principal particle collections are shown, together with their relative size change. As expected, the largest effects are observed for the particle-flow object and topo-cluster collections, which are reduced in size by \({\sim }16\%\) to \({\sim }18\%\), while smaller differences are present for electrons/photons and \(\tau \)-leptons.Footnote 7 The overall event size is reduced by about \(6\%\) in data and \(7\%\) in MC simulation.

8 Conclusion

This paper presents the design and evaluation of a cell-level time criterion used in the ATLAS topo-cluster reconstruction algorithm. This criterion (the “time cut”) removes cells compatible with out-of-time pile-up signals. A second requirement, the Upper Limit, is imposed to prevent cells with a large amount of energy from being removed, including those expected from long-lived particles. The impact of the time cut on the reconstruction of hadronic signals was studied at both cluster level and jet level, using a combination of MC samples and ATLAS data recorded in 2017 and 2018.

The time cut was found to significantly reduce the contribution from out-of-time pile-up while not hindering signal reconstruction. Studies of MC events indicate that the multiplicity of out-of-time pile-up jets is reduced by \({\sim }50\%\) at \(p_{\textrm{T}\,\,}\sim 20~\textrm{GeV}\) and by \({\sim }80\%\) above \(p_{\textrm{T}\,\,}\sim 50~\textrm{GeV}\), across the rapidity region \(|y|\lesssim 3.5\). Studies of data events found that the time cut has negligible effect on events accepted by a single-jet trigger, requiring one jet with \(p_{\textrm{T}\,\,}> 45~\textrm{GeV}\) and seeded by a single jet at L1. On the other hand, the time cut does affect events enriched in out-of-time pile-up, typically those accepted by a random L1 trigger. In this case, the effect of the time cut is largest for \(p_{\textrm{T}\,\,}\lesssim 30\) GeV and it is found to have a stronger effect on data events than on MC events, typical differences being of \(\mathcal {O}(10\%)\). The effect is believed to be caused by the absence of pile-up-only events in the MC simulation. Additional tests for signal inefficiency in data were carried out and show that the time cut would not cause a loss of clusters in the case of a calorimeter channel miscalibration. The addition of the Upper Limit is also found to prevent the loss of individual cells within clusters in such a case.

The time cut was also found to impact the jet calibration, a repetition of the pile-up correction step being necessary. The time cut provides percent-level improvement in the jet resolution at \(p_{t} \lesssim 30~\textrm{GeV}\). More extensive investigation is needed to evaluate in detail the effect of the time cut on the jet energy resolution and jet energy scale uncertainty, e.g. the pile-up-related JES uncertainty components. The full calibration chain will be re-evaluated after the introduction of the time cut into the ATLAS reconstruction software.

The Seed Extended plus \(X_{\textrm{UL}}=20\) cut was found to be the best option among those studied. This cut was adopted as the default for offline topo-cluster reconstruction in ATLAS Run 3 data. The introduction of the Seed Extended plus \(X_{\textrm{UL}}=20\) cut into the ATLAS offline reconstruction is also found to reduce the ATLAS event size on disk by \(6\%\) in data and by \(7\%\) in MC simulation. Further studies are necessary before the cut can be introduced at trigger level.