Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks

Neubüser, Coralie; Kieseler, Jan; Lujan, Paul

doi:10.1140/epjc/s10052-022-10031-7

Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks

Regular Article - Experimental Physics
Open access
Published: 29 January 2022

Volume 82, article number 92, (2022)
Cite this article

Download PDF

You have full access to this open access article

The European Physical Journal C Aims and scope Submit manuscript

Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks

Download PDF

854 Accesses
3 Citations
3 Altmetric
Explore all metrics

Abstract

We investigate the effect of longitudinal and transverse calorimeter segmentation on event-by-event software compensation for hadronic showers. To factorize out sampling and detector effects, events are simulated in which a single charged pion is shot at a homogenous lead glass calorimeter, split into longitudinal and transverse segments of varying size, and the total energy loss within each segment is used as the signal. As an approximation of an optimal reconstruction, a neural network-based energy regression is trained based on these signals. The architecture is based on blocks of convolutional kernels customized for shower energy regression using local energy densities; biases at the edges of the training dataset are mitigated using a histogram technique. With this approximation, we find that a longitudinal and transverse segment size less than or equal to 0.5 and 1.3 nuclear interaction lengths, respectively, is necessary to achieve an optimal energy measurement. In addition, an intrinsic energy resolution of $8\%/\sqrt{E}$ for pion showers is observed.

Calorimetry with deep learning: particle simulation and reconstruction for collider physics

Article Open access 31 July 2020

Improved particle-flow event reconstruction with scalable neural networks for current and future particle detectors

Article Open access 10 April 2024

Optimising simulations for diphoton production at hadron colliders using amplitude neural networks

Article Open access 16 August 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Both existing high-energy physics experiments, such as those at the CERN LHC, and future experiments at future colliders, like the Future Circular Collider (FCC), rely heavily on the performance of hadron calorimeters and their particle flow capabilities for measuring jet and missing transverse momentum ($p_{T}$) [1,2,3,4,5,6,7,8,9]. Hadron calorimeters are currently characterized not only in terms of their intrinsic energy resolution, but by their imaging capabilities, which allow for offline corrections using smart algorithms. Due to the diverse composition of hadronic showers and the differences in the calorimeter response, a correct energy measurement becomes challenging. In general, the components of hadronic showers can be divided into electromagnetic (EM) and hadronic parts. The hadronic part of the shower consists of particles such as neutrinos and neutrons which are partially invisible to the detector. This can be affected by the chosen active detector material, where, e.g., plastic scintillators allow for neutron detection via strong interaction with the atomic nucleus. The undetectable particles in the hadronic shower result in an unequal detector response; that is, $e/h\ne 1$, where e and h are the calorimeter response to electromagnetic and hadronic shower fractions, respectively.

Many hadronic calorimeters currently in use and planned for future experiments are sampling calorimeters, which consist of alternating active and passive absorber layers [10,11,12,13]. The sampling of the hadronic shower allows for tuning of the hadronic and electromagnetic shower responses. In the past, the e/h ratio has been adjusted closer to 1 by either suppressing the electromagnetic response, e.g., by using high-Z absorbers, or by enhancing the hadronic response, using neutron-sensitive active materials. Calorimeters that have a ratio $e/h\sim 1$ are called “compensating” calorimeters. These optimizations in the active and passive materials often require a decreased sampling fraction (ratio of active/passive material), which itself degrades the calorimeter energy resolution by increasing the stochastic term $\alpha $ of

$$\begin{aligned} \frac{\sigma _{E}}{\langle E\rangle }=\frac{\alpha }{\sqrt{E}} \oplus c. \end{aligned}$$

(1)

The stochastic term is dominated by the sampling fraction (per layer) and the frequency (the number of layers) for sampling calorimeters, and expresses the dependence of the calorimeter resolution on the fluctuations of the number of particles within the hadronic shower (following a Poisson distribution). The constant term c expresses linearly energy-dependent uncertainties, such as energy losses due to particles escaping the detector, caused by limited calorimeter sizes. The fluctuations on the EM-to-hadronic shower fraction increase logarithmically with energy and can thus contribute to both terms. This contribution can be removed either by intrinsic compensation, or by an event-by-event measurement of the EM fraction, which is called software compensation.

Due to the cost and mechanical stability benefits, absorbers made of steel or lead are widely in use. These materials have been found to require very small sampling fractions in, e.g., scintillator-steel calorimeters in order to achieve compensating behavior. Since such low sampling fractions would degrade the performance, especially for particles at low energies ($<50$ GeV), the solution to correct for fluctuations in the electromagnetic shower fraction is to use software compensation techniques.

In order to allow algorithms to distinguish between the dense electromagnetic shower core and other shower parts, e.g., disappearing tracks, the granularity of the calorimeter plays a key role. The first attempt in so-called imaging calorimetry has been made by the CALICE collaboration, which started a R&D program of calorimeters for a future e$^{-}$e$^{+}$ linear collider [14, 15], where the calorimeter designs have been optimised for particle flow algorithms [5]. These algorithms allow for jet energy measurements using the best suited sub-detector to reconstruct each jet particle. The prototypes of these calorimeters have been realised with active layers made of silicon for the EM shower part and scintillator or resistive plate chambers for the measurement of hadronic showers. The active layers were tested and interleaved within both steel and tungsten absorber stacks [16, 17] and achieved such good results in test beams [18] that the CMS Collaboration decided to adopt this concept in a full silicon-tungsten/scintillator-steel endcap calorimeter [12, 19]. The developments in, e.g., silicon photomultiplier (SiPM) technologies have been key to measuring the scintillation light produced in calorimeter cell sizes of $3\times 3\times 0.5$ cm$^{3}$ [20]. The impact of software compensation techniques on the performance of particle flow algorithms has been studied in a specific detector design [9], and proven to provide a significant improvement to the jet energy measurement by using a corrected calorimeter cluster which is matched to tracks in the tracking system.

The next step towards a calorimeter design optimized for the use of software compensation techniques is to study the necessary granularity that allows an algorithm to determine most accurately the hadronic shower energy.

In this paper, we will discuss the performance of a software compensation technique using a deep neural network (DNN), with a specific focus on the dependence on the transverse and longitudinal granularity. For the purpose of this study, we consider a homogeneous idealistic calorimeter simulated with Geant4 [21]. The performance is evaluated in terms of energy resolution and linearity for single charged pions. The resolution of particle-flow algorithms is also limited by the accuracy of the association between charged particle tracks in the tracker and energy depositions in the calorimeter. In this context it has been shown that DNNs can provide a new avenue for particle flow in general [22, 23]; this, however, is beyond the scope of this paper. Here, we show how a DNN can be utilised to approximate a generic close-to-optimal reconstruction algorithm that can be optimised to the granularity in an automated fashion. This can help pave the way towards a more ambitious global optimisation of detector design parameters as suggested, e.g., in Ref. [24].

Table 1 Granularity configurations considered in this analysis

Full size table

2 Calorimeter and dataset

The studied calorimeter is a homogeneous lead tungstate calorimeter, which follows the EM calorimeter concept of the CMS experiment [25]. In order to concentrate this study on the capabilities of the DNN to correct e/h fluctuations, we do not consider any passive absorber material in the following. But the impact of sampling fluctuations has been tested and the results are summarized in Appendix A for the example of a full PbWO$_4$ calorimeter with a passive layer fraction of 95%. Qualitatively, the sampling calorimeter shows similar behavior to the homogenous calorimeter studied, but at present the effect of the sampling term in the resolution is not sufficiently well measured to form any precise conclusions.

The dimensions are $1\times 1\times 2.5$ m$^3$, which ensures complete shower containment within the calorimeter volume and corresponds to $10.4\,\lambda $ and $280\,X_{0}$ of total depth. The transverse segmentation is increased from no segmentation up to $30\times 30$ segments in x and y (designated stages A–F), and from 1 to 60 segments (designated stages 0–7) in the longitudinal direction. A list of the configurations can be found in Table 1.

The data set consists of approximately $5\times 10^6$ charged pion events, generated using the FTFP_BERT physics list of Geant4 [21] 10.04 patch 0. The training data set comprises pions with energies sampled from a flat distribution between 1 and 110 GeV. The test data set covers 11 discrete energies of 5 to 105 GeV in 10 GeV steps. In both cases, pions are shot at the calorimeter center with a normal incident angle. The training data set covers a slightly larger energy range to suppress bias effects caused by a difference between the mean and the expectation value of the reconstructed energy at the edges during training. The Geant4 simulation has been performed in the highest granularity, while for the tests and training of different segmentation configurations, the same dataset has been used. For this purpose, the energy deposits (sum of total energy losses over time of the event) in the cells have been merged corresponding to the tested cell sizes. This method avoids inconsistencies that are otherwise to be expected due to the different number of surfaces and material borders through which Geant4 propagates the particles.

3 Neural network architecture and training

At the core of the neural network architecture used here is a software compensation block that uses convolutional neural network (CNN) layers [26] to achieve local identification of the subshowers. Due to the regular grid-like structure of the calorimeter, graph neural networks such as discussed e.g. in Ref. [27] are not necessary. The architecture chosen here is similar to the one introduced in Ref. [13], which is used as a subblock in the overall model. This subblock consists of 3 parallel paths: in the first path, the energy of all cells within the kernel range K is summed up and forwarded to the next block, while this kernel is moved with a stride of size K; the second path consists of a CNN layer with the same kernel size and $F=16$ filters; and the third path contains in total three subsequent CNN layers, out of which the first two have kernel sizes (in x, y, and depth) of $K_a = (1,\ k,\ 3)$ and $K_b=(k,\ 1,\ 3)$, with no stride and 32 filters, each. Here, k is an adjustable parameter depending on granularity, as described later. The final layer of this path is a CNN layer with a kernel size of K with a stride of K and F filters, such that the output of all paths can be combined. This combination is done by adding the output of the CNN layers of all paths feature by feature. All layers use a tanh activation function. The weights of the layers in the third path are initialised with a Gaussian distribution centred at 0 with a width of $10^{-3}$, and receive a small L2 regularisation of $10^{-5}$. This structure is optimised to derive small corrections to the simple energy sum by detecting the different shower shape of electromagnetic subshowers.

In the final model, the input is passed through a batch normalisation layer [28], normalising all inputs except for the per-cell energy. If fewer than 6 calorimeter layers are present or the transverse granularity in either direction is less than 6, the input is directly flattened and passed to 3 dense layers, the first two of which contain 128 and 64 nodes using ELU activation [29], before being finally passed to the energy prediction layer with 1 node. In all other cases, the input is first passed through a set of the subblocks described above before being fed through the same structure with dense layers. These subblocks adapt to the input: if the corresponding granularity is less than $6\times 6$ cells in the transverse directions, a stride of $1\times 1$ is used, and the input k for the kernel size determination is set to $k=1$. Otherwise, a stride of $2\times 2$ and $k=3$ are used in these directions. The subblock is repeated until the dimensionality in x, y, or depth is less than or equal to 6. At this point, the output is fed to the three final dense layers.

The model is trained using the Adam optimiser [30] using TensorFlow [31] and Keras [32] within the DeepJetCore framework [33]. The training consists of five steps: the first four steps use a loss function $L_\mathrm {calo}$ that follows the expected calorimeter resolution:

$$\begin{aligned} L_\mathrm {calo} = \frac{(E_\mathrm {true} - E_\mathrm {pred})^2}{E_\mathrm {true}} \mathrm {,} \end{aligned}$$

(2)

where $E_{\mathrm {pred}}$ is the energy of the particle predicted by the DNN. These steps are trained for 1, 19, 60, and 20 epochs with learning rates of $10^{-4}$, $10^{-4}$, $10^{-5}$, and $10^{-5}$, and batch sizes of 256, 512, 1280, and 1280, respectively. Between the third and fourth step, the batch normalisation is frozen.

The mean and expectation value for $E_\mathrm {true}$ differ at the edges of the training sample. This typically leads to edge effects, which introduce a bias towards higher predicted values at the low edge, and towards lower predicted values at the high edge. To mitigate this effect, we freeze all layers except for the last dense layers, and introduce a loss that follows a $\chi ^2$ distribution taking the difference of the average predicted and truth energy in bins of $E_\mathrm {true}$, and accounting for the number of samples in that bin. The bin boundaries are randomly chosen for each batch to avoid a global bias. Using this loss, the model is trained for another 50 epochs with a learning rate of $10^{-5}$ and a batch size of 1280.

4 Results

The energy resolution is evaluated as the ratio of the width to the most probable value of the distribution of the reconstructed energy. These distributions, as shown for example in Fig. 1a, follow a Gaussian function. The standard deviation can thus be extracted from a fit. This fit is limited within 2$\sigma $ around the most probable value $\mu $, following the procedure widely used in calorimeter performance studies. As a comparison and validation, the energy resolution has also been evaluated from the root mean square (RMS) and mean, which is sensitive to the tails of the distribution. The energy resolution over the full available energy range is shown for stage 4, which corresponds to a granularity of 15 longitudinal layers, in Fig. 1b. The points are fitted following Eq. (1), and the values of the stochastic and constant term are shown in the legend. The constant term is set to 0 if the fitted value deviates from 0 by less than its uncertainty. An overall 10–20% degradation in energy resolution from the Gaussian fit to the RMS method is observed. In the following, the energy resolutions obtained for different granularities will refer to the results obtained from the Gaussian fit.

Table 2 Summary of energy resolution fit results. The top set shows the different longitudinal segmentation scenarios with no transverse segmentation, while the other two sets show two specific longitudinal stages with different transverse segmentation scenarios, as described in Table 1. In the last row the result of a simple energy sum is given as reference

Full size table

The results, in terms of the stochastic term $\alpha $ and constant term c for all studied longitudinal and transverse granularities, are summarized in Table 2. The theory of the different contributions to the energy resolution of hadronic showers [34] considers that the stochastic term is in fact a quadratic sum of two major effects, $\alpha = \alpha _{\mathrm {int}}\oplus \alpha _{\mathrm {sampl}}$, where the first intrinsic term is irreducible and determined by the fluctuations of the initial energy that is transformed into ionising shower particles, and the second is the term due to the sampling fraction. These fluctuations are material dependent, due to material-dependent nuclear binding energy losses, and have been found to be on the order of 19%/11% in the ZEUS uranium/lead-scintillator calorimeter prototypes [35].

We assume that the DNN is able to identify and re-weight the electromagnetic and hadronic shower fractions, due to the topological differences of EM and hadronic subshowers ($\lambda _{\pi }/X_{0}\sim 27$). Thus, we expect the stochastic and constant terms to improve with respect to an energy measurement based on a simple sum over calorimeter cells, and to decrease with increased granularity. Table 2 shows the resulting measured stochastic and constant terms (using both the Gaussian fit and the RMS to obtain the resolution) for three different sets of scenarios: first, the different longitudinal granularities with no transverse segmentation, the results for which are plotted in Fig. 2; second, longitudinal stage 0 with different transverse granularities (Fig. 3); and third, longitudinal stage 5 with different transverse granularities (Fig. 4). For reference, we also compare the results obtained with the DNN with a simple energy sum over all energy deposits in the calorimeter cells. This does not include any further energy calibration, which is visible in a significant deviation from unity in the linearity (compare Fig. 2a). This could however easily be recovered with standard methods of energy calibration. The observed increase in the response with energy corresponds to the increasing EM fraction within the hadronic shower. Overall, at the finest granularities, we observe that the constant term goes to zero, while the stochastic term decreases by approximately 50% with respect to the scenario with no segmentation, reaching a minimum of 8%, which can be considered as an upper limit on the intrinsic stochastic term $\alpha _{\mathrm {int}}$. The difference between the parameters from the RMS and the Gaussian fit is indicative of a contribution from moderately pronounced tails, which are also shown in Fig. 1.

The constant term is consistently removed as soon as the first segmentation in transverse granularity into $3\times 3$ cells is implemented. Figure 5 shows an event display of a 35 GeV pion shower; the bottom shows the impact of a $3\times 3$ transverse segmentation. We can see that already at this stage, a significant enough energy fraction of about 12% (shown as $\langle E_{\mathrm {out}}/E_{\mathrm {tot}}\rangle $ in the legend) is found in the outer quadrants. In comparison, a single shower is represented in 3D on the top, and visualises the imaging power of the finest chosen granularity of the homogeneous PbWO$_4$ calorimeter.

Figure 6 summarizes the energy resolution as a function of longitudinal and transverse granularity. We observe that the behavior of the resolution as a function of granularity exhibits the same pattern regardless of the incident particle energy. For the transverse granularity, the resolution reaches an optimal value at a cell size of $\approx 1\lambda _{\pi }$, and finer segmentation does not yield any appreciable further benefit. In the longitudinal direction, the energy resolution continues to improve as the layer size is decreased, reaching the minimum at the finest granularity considered ($\approx 0.2\lambda _{\pi }$ or $\approx 5 X_0$).

Figure 7 summarizes the fitted parameters $\alpha $ and c in the energy resolution function in Eq. (1), as a function of longitudinal and transverse granularity. In the transverse direction, we observe that the constant term goes to zero at a cell size of 1.4 $\lambda _{\pi }$ ($37 X_0$), and further decrease in the cell size does not further improve the stochastic term $\alpha $. In the longitudinal case, a layer width smaller than 10 $X_0$ results only a minor improvement of about 10%, which suggests that layer widths of about 10 $X_0$ could offer a good balance between the obtained resolution and the detector complexity.

5 Conclusions

When calorimeters are designed for new high-energy physics experiments, often the approach has been to pick a technology before optimising the reconstruction of jet particles. From the perspective of testing various options, this not only requires significant computing power due to the introduced details of signal processing (digitisation) in the simulations, but also means that the simulations are unable to answer basic questions due to the high complexity. For example, a smaller cell size improves the spatial and pointing resolution, which should help the particle-flow algorithm to reconstruct the jet. However, the signal height per cell decreases, which can introduce an energy loss due to a lower signal-to-noise ratio. Thus, a high-level optimisation becomes blind to the individual impact for each effect. Instead, a different approach could be to first identify the necessary input for reconstruction algorithms which allows for optimal performance, before selecting the detector technology.

Moving towards that approach, we have used a model calorimeter to show how DNNs can be used to study the effect of the cell granularity on the hadronic energy reconstruction, without the need for manual optimisation of the algorithm for each granularity choice. In this model, the impact of the sampling fraction has been intentionally excluded. Even though we are aware that the type of chosen active and passive material will impact the shower development, we believe that this study can pave the way towards a more global optimisation of calorimeter designs exploiting the versatility of DNN based reconstruction algorithms.

For this particular detector setup (with $\lambda _{\pi }/X_{0}\sim 27$), we conclude that cell sizes of at most 1 nuclear interaction length, and longitudinal layers of 5–10 $X_{0}$ thickness are needed, in order to optimize the software compensation to obtain an e/h response close to 1, and improve towards the intrinsic stochastic term of 8%. Following this approach, one could imagine further studies to determine the optimal cell and layer sizes as a function of the $\lambda _{\pi }/X_{0}$ ratio. However, this exceeds the scope of this paper.

Data Availability Statement

This manuscript has no associated data or the data will not be deposited. [Authors’ comment: The raw data supporting the conclusion of this article will be made available by the authors upon request, without undueo reservation.]

References

CMS Collaboration, JINST 12, P10003 (2017). https://doi.org/10.1088/1748-0221/12/10/P10003
ATLAS Collaboration, Eur. Phys. J. C 77, 466 (2017). https://doi.org/10.1140/epjc/s10052-017-5031-2
M. Ruan, H. Videau, in Proceedings, International Conference on Calorimetry for the High Energy Frontier (CHEF 2013), Paris, April 22–25, 2013, p. 316 (2013)
M.A. Thomson, Nucl. Instrum. Meth. A 611, 25 (2009). https://doi.org/10.1016/j.nima.2009.09.009
Article ADS Google Scholar
J.S. Marshall, A. Münnich, M.A. Thomson, Nucl. Instrum. Meth. A 700, 153 (2013). https://doi.org/10.1016/j.nima.2012.10.038
Article ADS Google Scholar
J.S. Marshall, M.A. Thomson, in Proceedings, International Conference on Calorimetry for the High Energy Frontier (CHEF 2013), Paris, April 22–25, 2013, p. 305 (2013)
J.S. Marshall, M.A. Thomson, Eur. Phys. J. C 75, 439 (2015). https://doi.org/10.1140/epjc/s10052-015-3659-3
Article ADS Google Scholar
F. Sefkow, A. White, K. Kawagoe, R. Pöschl, J. Repond, Rev. Mod. Phys. 88, 015003 (2016). https://doi.org/10.1103/RevModPhys.88.015003
Article ADS Google Scholar
H.L. Tran, K. Krüger, F. Sefkow, S. Green, J. Marshall, M. Thomson, F. Simon, Eur. Phys. J. C 77, 698 (2017). https://doi.org/10.1140/epjc/s10052-017-5298-3
Article ADS Google Scholar
CMS Collaboration, The CMS hadron calorimeter project: Technical Design Report. Technical Design Report CERN-LHCC-97-031. CERN (1997). https://cds.cern.ch/record/357153
ATLAS Collaboration, ATLAS liquid-argon calorimeter: Technical Design Report. Technical Design Report CERN-LHCC-96-041, CERN (1996). https://cds.cern.ch/record/331061
CMS Collaboration, The phase-2 upgrade of the CMS endcap calorimeter. Technical Design Report CERN-LHCC-2017-023, CMS-TDR-019, CERN (2017). https://cds.cern.ch/record/2293646
C. Neubüser, M. Aleksa, A.M. Henriques Correia, J. Faltova, M. Selvaggi, C. Helsens, A. Zaborowska, P.P. Allport, R.R. Bosley, J. Kieseler, A. Karyukhin, J.S. Schliwinski, N. Watson, R.R. Stein, A. Winter, O. Solovyanov, H.F. Pais Da Silva, J. Gentil, R. Goncalo, N. Topiline, Calorimeters for the FCC-hh. FCC Document CERN-FCC-PHYS-2019-0003. CERN (2019). https://cds.cern.ch/record/2705432
Y. Israeli, JINST 13(05), C05002 (2018). https://doi.org/10.1088/1748-0221/13/05/C05002
Article Google Scholar
CALICE Collaboration, Nucl. Instrum. Meth. A 939, 89 (2019). https://doi.org/10.1016/j.nima.2019.05.013
CALICE Collaboration, JINST 10, P12006 (2015). https://doi.org/10.1088/1748-0221/10/12/P12006
CALICE Collaboration, JINST 7, P09017 (2012). https://doi.org/10.1088/1748-0221/7/09/P09017
CALICE Collaboration, Nucl. Instrum. Meth. A 937, 41 (2019). https://doi.org/10.1016/j.nima.2019.04.111
T. Quast, JINST 13(02), C02044 (2018). https://doi.org/10.1088/1748-0221/13/02/C02044
Article Google Scholar
F. Sefkow, F. Simon, J. Phys. Conf. Ser. 1162, 012012 (2019). https://doi.org/10.1088/1742-6596/1162/1/012012
Article Google Scholar
S. Agostinelli et al., Nucl. Instrum. Meth. A 506(3), 250 (2003). https://doi.org/10.1016/S0168-9002(03)01368-8
Article ADS Google Scholar
J. Pata, J. Duarte, J.R. Vlimant, M. Pierini, M. Spiropulu, Eur. Phys. J. C 81(5), 381 (2021). https://doi.org/10.1140/epjc/s10052-021-09158-w
Article ADS Google Scholar
J. Kieseler, Eur. Phys. J. C 80(9), 886 (2020). https://doi.org/10.1140/epjc/s10052-020-08461-2
Article ADS Google Scholar
A.G. Baydin et al., Nucl. Phys. News 31(1), 25 (2021). https://doi.org/10.1080/10619127.2021.1881364
Article Google Scholar
CMS Collaboration, The CMS electromagnetic calorimeter project: Technical Design Report. Technical Design Report CERN-LHCC-97-033. CERN (1997). http://cds.cern.ch/record/349375
Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Proc. IEEE 86, 2278 (1998). https://doi.org/10.1109/5.726791
S.R. Qasim, J. Kieseler, Y. Iiyama, M. Pierini, Eur. Phys. J. C 79(7), 608 (2019). https://doi.org/10.1140/epjc/s10052-019-7113-9
Article ADS Google Scholar
S. Ioffe, C. Szegedy, Proc. Mach. Learn. Res. 37, 448 (2015). http://proceedings.mlr.press/v37/ioffe15.html
D.A. Clevert, T. Unterthiner, S. Hochreiter, Fast and accurate deep network learning by exponential linear units (ELUs) (2015). arXiv:1511.07289
D.P. Kingma, J. Ba, Adam: a method for stochastic optimization (2014). arXiv:1412.6980
M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro et al., TensorFlow: large-scale machine learning on heterogeneous systems. https://www.tensorflow.org/ (2015)
F. Chollet et al., Keras. https://keras.io (2015)
J. Kieseler, M. Stoye, M. Verzetti, P. Silva, S.S. Mehta, A. Stakia, Y. Iiyama, E. Bols, S.R. Qasim, H. Kirschenmann et al., DeepJetCore (2020). https://doi.org/10.5281/zenodo.3670882
Article Google Scholar
C.W. Fabjan, R. Wigmans, Rep. Prog. Phys. 52, 1519 (1989). https://doi.org/10.1088/0034-4885/52/12/002
Article ADS Google Scholar
H. Tiecke, Nucl. Instrum. Meth. A 277, 42 (1989). https://doi.org/10.1016/0168-9002(89)90533-0
Article Google Scholar

Download references

Acknowledgements

The training of the models was performed on the GPU machines of the CERN EP/CMG group.

Author information

Authors and Affiliations

INFN TIFPA, Via Sommarive 14, 38123, Trento, Italy
Coralie Neubüser
CERN, 1211, Genève 23, Switzerland
Jan Kieseler
University of Canterbury, Private Bag 4800, Christchurch, 8140, New Zealand
Paul Lujan

Authors

Coralie Neubüser
View author publications
You can also search for this author in PubMed Google Scholar
Jan Kieseler
View author publications
You can also search for this author in PubMed Google Scholar
Paul Lujan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Coralie Neubüser.

Appendix A: Impact of sampling fraction on the stochastic term

The impact of sampling fluctuations on the energy resolution after the DNN reconstruction has been studied on a PbWO$_4$–PbWO$_4$ calorimeter with an absorber fraction of 95%, which results in a sampling fraction of 6.4% for electrons. The default energy resolution, using a simple energy sum, for pions is shown in Fig. 8a. The fit follows the shape:

$$\begin{aligned} \frac{\sigma ^\mathrm{PbWO_4-PbWO_4}_\mathrm{sampling}}{\sqrt{E}}\oplus \frac{\sigma ^\mathrm{HomoPbWO_4}_\mathrm{stochastic}}{\sqrt{E}}\oplus \sigma ^\mathrm{HomoPbWO_4}_\mathrm{const}, \end{aligned}$$

(A.1)

where the stochastic and constant terms are fixed to the values obtained for the homogenous PbWO$_4$ calorimeter, for the resolution evaluated from Gaussian fits and the RMS of the energy distributions, respectively. The remaining sampling term sums up to $\left( 56.1/59.1\pm 0.2\right) $% in the Gaussian and RMS cases, respectively, and is able to describe the data points very well.

The same procedure has been used in order to describe the results of the DNN for stages 0A to 0F of the PbWO$_4$–PbWO$_4$ calorimeter. The resolutions with the corresponding fits are shown in Fig. 8b, which result in extracted sampling terms ranging between 42 and 52%.

The sampling term can thus not be treated independently from the other stochastic fluctuations. At the highest granularities, the DNN is overperforming and able to partially recover sampling fluctuations. Thus, the granularity configuration needed for an optimal DNN performance needs an additional validation for specific sampling fractions.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Funded by SCOAP³

Reprints and permissions

About this article

Cite this article

Neubüser, C., Kieseler, J. & Lujan, P. Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks. Eur. Phys. J. C 82, 92 (2022). https://doi.org/10.1140/epjc/s10052-022-10031-7

Download citation

Received: 21 January 2021
Accepted: 17 January 2022
Published: 29 January 2022
DOI: https://doi.org/10.1140/epjc/s10052-022-10031-7

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks

Abstract

Similar content being viewed by others

Calorimetry with deep learning: particle simulation and reconstruction for collider physics

Improved particle-flow event reconstruction with scalable neural networks for current and future particle detectors

Optimising simulations for diphoton production at hadron colliders using amplitude neural networks

1 Introduction

2 Calorimeter and dataset

3 Neural network architecture and training

4 Results

5 Conclusions

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix A: Impact of sampling fraction on the stochastic term

Rights and permissions

About this article

Cite this article

Navigation

Optimising longitudinal and lateral calorimeter granularity for software compensation in hadronic showers using deep neural networks

Abstract

Similar content being viewed by others

Calorimetry with deep learning: particle simulation and reconstruction for collider physics

Improved particle-flow event reconstruction with scalable neural networks for current and future particle detectors

Optimising simulations for diphoton production at hadron colliders using amplitude neural networks

1 Introduction

2 Calorimeter and dataset

3 Neural network architecture and training

4 Results

5 Conclusions

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix A: Impact of sampling fraction on the stochastic term

Appendix A: Impact of sampling fraction on the stochastic term

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation