Electron-hadron shower discrimination in a liquid argon time projection chamber

Back, J. J.; Barker, G. J.; Bennieston, A. J.; Boyd, S. B.; Morgan, B.; Ramachers, Y. A.

doi:10.1140/epjc/s10052-013-2369-y

Electron-hadron shower discrimination in a liquid argon time projection chamber

Special Article - Tools for Experiment and Theory
Open access
Published: 21 March 2013

Volume 73, article number 2369, (2013)
Cite this article

Download PDF

You have full access to this open access article

The European Physical Journal C Aims and scope Submit manuscript

Electron-hadron shower discrimination in a liquid argon time projection chamber

Download PDF

J. J. Back¹,
G. J. Barker¹,
A. J. Bennieston¹,
S. B. Boyd¹,
B. Morgan¹ &
…
Y. A. Ramachers¹

1384 Accesses
2 Citations
Explore all metrics

Abstract

By exploiting structural differences between electromagnetic and hadronic showers in a multivariate analysis we present an efficient Electron-Hadron discrimination algorithm for liquid argon time projection chambers, validated using Geant4 simulated data.

Software Performance of the ATLAS Track Reconstruction for LHC Run 3

Article Open access 02 April 2024

Introduction to experimental nuclear astrophysics

Article 07 March 2019

FCC-ee: The Lepton Collider

Article Open access 04 June 2019

1 Introduction

Liquid Argon Time Projection Chambers (LAr-TPCs) are recognised as being a potential detector technology for a next-generation Neutrino oscillation experiment and their development is the subject of vibrant R&D programmes in Europe, Japan and the USA (see Refs. [1–3] and references therein). The simultaneous tracking and calorimetry capability, with millimetric granularity, makes LAr-TPCs ideal to image Neutrino interactions over a wide range of energies as demonstrated recently by ICARUS [4].

Despite the physics promise of LAr-TPCs, a fully automated (that is, with zero human interaction) software system for reconstructing events has proven difficult to achieve. With the availability of more and more event data, recent years have seen progress in limited clustering of structures and tracking. However, a general event is likely to contain a mixture of showers and tracks originating from an unknown vertex anywhere in the sensitive volume. One major challenge for automated reconstruction is to disentangle the fundamentally different shower and track structures and hence segment the event into its final state particles. Accurate segmentation is critical for subsequent particle identification on the substructures. Neutrino oscillation analyses proposed for next-generation facilities rely heavily on accurate interaction classification, motivating studies of this task.

In this article, we concentrate on the particle identification step, based on extensive Monte-Carlo simulations, assuming exact segmentation, i.e. all hits belonging to each single particle have been clustered with full efficiency and purity. We defer discussion of techniques for this more challenging task to later publications. We present a study using a set of effective and computationally simple variables which allow the discrimination of Electron shower structures from all relevant hadronic shower structures in a fine-grained tracking detector such as a LAr-TPC. For the intended physics application, the dominant Hadron shower structures result from Pions, Protons or Kaons in the final state. In this context, Electron-Hadron shower discrimination is important for Neutrino oscillation studies in order to measure Electron-neutrino events in a Muon-neutrino beam, resulting in an energetic Electron in the LAr medium after a charged-current interaction with a nucleus. Therefore, the Electron is considered to be the “signal” and anything non-electron, i.e. Hadron showers, is labelled as “background”, unless stated otherwise.

Additionally, the energy resolution of any LAr-TPC will depend on the nature of the particle depositing energy due to quenching factors, i.e. particle dependent ionization efficiencies in liquid argon. Electromagnetic and hadronic particle energy depositions will each require separate energy calibrations depending on their specific ionization as a function of energy, hence for their energy measurement particle identification would be beneficial.

The set of variables also prove to be useful for other particle identification tasks. Track structures associated with Protons or Muons can be identified by these variables as well as, to a lesser extent, hadronic final states such as neutral and charged Pions and Kaons.

In the following, we describe the Monte-Carlo data production in order to define our case studies, then the analysis algorithm by first introducing the discrimination variables before finally presenting their applicability to identifying particles in various Neutrino interaction event topologies.

2 Monte-Carlo data production

Several simulated single particle event classes were prepared as input to the analysis. We have tested the algorithm with a Muon–neutrino flux generated as part of the LAGUNA-LBNO FP7 programme [5, 6] to evaluate the physics potential of a long baseline experiment between CERN and an underground far detector location in Pyhasalmi Finland (CN2PY). The CN2PY Neutrino flux [7] (un-oscillated) from decaying negatively charged Pions (for a distance of 2300 km) provided the input for the first case study. Neutrino reaction final states were produced using the GENIE event generator (version 2.6.4) [8]. The scaled Neutrino energy distributions are shown in Fig. 1 with relative weighting factors given in Table 1.

Table 1 The relative neutrino flavour weighting factors for the CN2PY Neutrino beam (un-oscillated) from negatively charged Pions taken as input for this study. Note that the uniform Neutrino energy beam case study also takes these flavour weighting factors into account

Full size table

The GENIE simulated events were un-filtered, with all possible final state particles allowed. The total number of each individual particle final state from GENIE in each of the four Neutrino fluxes then results in individual weighting factors for that particle species during the subsequent analysis. The complete set of weighting factors used during the multivariate analysis, see below, is listed in Tables 2 and 3.

Table 2 The relative weighting factors for the final state particles resulting from the CN2PY Neutrino beam simulation containing particles given in the first column. These weighting factors are each taken into account during the multivariate analysis stage and calculated by multiplying relative simulated final state particle numbers with the corresponding neutrino flavour weighting factors (see the discrepancy on magnitude between \(\bar{\nu}_{\mu}\) values and all other weighting factors due to the small neutrino flavour weighting factors). Note that Muon and Electron energy spectra are taken from Genie simulations, also using the corresponding neutrino flavour weighting factor

Full size table

Table 3 Same as Table 2 but for the second case study of a uniform neutrino energy beam

Full size table

These single particle energy distributions from the GENIE simulations then serve as the particle source in the Geant4 (version 9.5.0) Monte-Carlo transport simulation toolkit [9]. The simulation models an homogeneous liquid argon detector as a cylinder with a radius and height of 100 m, in order to fully contain the showers. Particles start at the centre of the cylindrical volume and are tracked through the volume with interactions modelled by the QGSP_BIC_HP physics list, with all electromagnetic and hadronic processes enabled, as recommended by the Geant4 collaboration for our energy range of interest. The LAr volume is divided into (1×1×1) mm³ cubic voxels, and all primary and secondary particles are tracked through voxels down to zero energy, or until they leave the volume. Energy deposits by charged particles passing through voxels are tallied into a map between voxel coordinates and the total energy deposited. No attempt was made to model the readout system, but quenching of ionisation events specific to LAr detectors was taken into account following [10].

Results are collected and processed in a final step by four selected multivariate analysis algorithms (MVA) taken from the TMVA toolkit [11]. Each MVA is based on supervised learning, requiring separate training and test data samples. Each sample is chosen from a random split in half of the available data. The final evaluation delivers optimal cuts for each selected MVA method, producing efficiencies for signal selection, background selection and corresponding purities. Note that the complete set of weighting factors as listed in Tables 2 and 3 is used in each MVA method. The chosen methods are: the k-nearest neighbour search (KNN), two variations of boosted decision trees (BDT, BDTG) and a neural network (MLPBNN). These four represent the most successful out of a substantially larger number of methods in the TMVA toolkit during preliminary tests on our project.

Our second case study attempts a pure algorithm test by running the identical analysis as discussed above over a sample of final state particles sampled from a uniform Neutrino energy beam (same flavour weighting factors, same energy range as the CN2PY case). This case studies the impact of a less well constrained neutrino beam energy distribution.

3 Electron-shower discrimination algorithm

Below, we detail the calculation and physics motivation of four discriminating variables. The main assumption underpinning all of the following is that clustered energy deposits, hits, in the detector from a particular particle have been collected as a single object by the reconstruction, i.e. a cluster. Results are collected and processed in a final step by four selected multivariate analysis algorithms (MVA) taken from the TMVA toolkit [11]. This final step is discussed in more detail in Sect. 2.

This crucial clustering step for any LAr detector analysis is assumed to have taken place already. It should be stressed that a reliable, complete and automatic initial clustering step is, in fact, among the unsolved challenges of LAr data analysis, owing to the richness of topological event structures in such a detector, and will not be addressed here.

The first step of the discrimination algorithm consists of transforming the isolated cluster using a principal component analysis (PCA) [12]. The data at this stage consists purely of detector hits forming a point-cloud of unknown shape. Dealing with LAr detectors translates into the capability of assigning three spatial coordinates and a charge value to each individual hit (with corresponding uncertainties).

The reason for the initial PCA step is that it allows us to consider the unknown point-cloud to be aligned and ordered in a consistent way. The PCA is set up such that it calculates the three principal spatial components (ignoring the charge of the hit) as eigenvalues of the transformation matrix, which itself consists of the corresponding eigenvectors. Applying the transformation aligns the coordinate axes to the principal axes, using the convention that the dominant principle axis becomes the ‘x’-axis in a right-handed Cartesian coordinate system. This allows to utilise transverse and longitudinal geometric information due to the presence of a well defined axis for the structure.

3.1 The lateral projection variable “lat”

This discrimination variable is motivated by previous and ongoing accelerator experiments and their analysis of electromagnetic (EM) and Pion showers (see for example Ref. [13]). The entire point-cloud is projected onto the plane of minor principal components (the lateral plane to the major principal axis). A core region is defined around the projected mean value and the ratio of energy (charge) deposited in the core to the total energy (charge) yields the “lat” variable.

Charged Hadron showers feature either a dense core region, often resembling a track-like structure with only a thin “halo” around it, or dissolve into a wide-spread, loose cloud of hits. Proton tracks might or might not shower at all. EM showers, by contrast, tend to produce more constant halo-to-core ratios, as shown in Figs. 2, 3 and 4.

The precise definition of the core region turns out to be non-critical. For simplicity, a circular shape is chosen, which is centred on the mean value of the projected point-cloud and is defined with half the Molière radius in liquid argon, which is equal to 9.61 cm [14]. Each hit inside the core contributes to the sum of energy in the core, which is then divided by the total energy of the shower (the sum of the energies of all hits).

3.2 The hit concentration variable “con”

While the previous variable emphasises a subdivision of the shower structure into two main components (core and halo), the hit concentration variable “con” attempts to use the same lateral information globally. This is independent of any material constant, such as the Molière radius.

The variable first calculates the radial distance of each hit from the main principal axis, i.e. the radial distance from the origin in the lateral projection plane. A minimum threshold radius is set in order to avoid the Coulomb singularity (for practical purposes chosen as 1 mm in this analysis), and finally the hit concentration is calculated as the sum of charge (energy) of the hits divided by their radial distance.

The idea is to calculate the Coulomb potential energy of the point-cloud around the main principal axis. This approach delivers complementary information to the previous rigid division into two parts, since it emphasises the extremes such as track-like structures as being particularly highly concentrated, and very loose structures as being particularly low in concentration. Figures 2, 3 and 4 show the main benefit of this variable. The Hadron showers are dominated by less densely concentrated structures (they peak at low Coulomb energies around the main principal axis). A second population of Hadron events contain Proton tracks as well as sufficiently dense Pion tracks, contributing a larger value to this concentration variable. EM showers are rather non-specific but are significantly more concentrated than Hadron showers.

3.3 Spatial extent variables “extx, y”

The final two proposed discrimination variables are obtained by quantifying the spatial extent of the hit cloud in the three available principal component axes. A definition of the extent in purely geometrical terms turns out to be very useful. The algorithm calculates extent in all three available axes but one of the two transverse axes is discarded since it does not contribute any new information. This azimuthal symmetry assumption around the dominant principal axis was extensively tested and confirmed during this study.

The spatial extent is calculated by obtaining the convex hull in 3D around the hit cloud and finding the maximum distance (extent) of the hull on all three axes. Building the convex hull around an object is a standard algorithm in image analysis [16]. Even though its application in 3D is rather exotic, ready-made implementations of the code exist, for instance in the scientific python library, SciPy [17]. The meaning of the term convex hull can be visualised by considering stiff packaging paper, wrapped around an irregularly shaped object, i.e. the hit cloud, in order to enclose it completely. The paper would represent the convex hull, without the folds and overlaps real paper would produce.

The convex hull is obtained by calculating the Delaunay tessellation [16] of the hit cloud and finding the outermost Delaunay triangles, made of data members, enveloping the entire cloud. The difference between the maximum and minimum coordinate value on each (principal component) coordinate axis is calculated, giving the extent of the hull in all three dimensions.

4 Results

4.1 Electron to combined hadron/lepton background discrimination

Results in this section summarise the performance of the proposed discriminating variables when attempting to identify Electrons in a data sample consisting of all dominant final state particles, i.e. charged and neutral Pion, Protons, Kaons and Muons. The most straightforward way of illustrating the effectiveness of the proposed variables can be seen in Fig. 5, showing the background rejection as a function of signal efficiency for all four chosen methods, the nearest neighbour (KNN), two variations of boosted decision trees (BDT, BDTG) and a neural network (MLPBNN). These plots would ideally show a complete background rejection for all signal efficiencies up to 100 %. The closer the curves follow these ideal lines on the figure, the better the discrimination of the selected MVA method. The MVA methods mentioned above have been chosen as the most effective among a list of 24 possible methods offered in the TMVA toolkit.

Quantitative results for efficiencies and purities in EM shower discrimination are given in Table 4, and illustrate a successful (well above 90 % signal efficiency and background rejection) analysis technique in distinguishing EM showers from Hadron showers in a LAr detector at all interesting Neutrino energies. Finally, the method appears relatively insensitive to presence or absence of a priori beam knowledge.

Table 4 Particle identification efficiencies and contamination (background efficiency or ‘contamination’ is equal to 1.0—background rejection) on electron shower detection in a combined background of Hadron showers and Muons for all selected MVA methods. Weighting factors for each final state particle are always taken into account for signal as well as background particles. Efficiencies are calculated for the optimal decision boundary values defined by maximising the signal-to-total event ratio, \(S/\sqrt{(S+B)}\)

Full size table

4.2 Additional applications

The variables discussed above lend themselves to further applications simply because they are sensitive to only the geometrical shapes of clouds of hits in 3D space as opposed to kinematic variables. As a first application we consider neutral current (NC) interactions in LAr. NC interactions form an important background to Neutrino oscillation signals since they can produce neutral pions in an event. Due to their prompt decay into a pair of photons, with the inevitable EM showers, they constitute a problematic background to Electron-neutrino charged-current interactions. This means that identifying final state π ⁰ would be beneficial for several reconstruction tasks. One study [18] delivered a strong discrimination factor of 10⁻³ for mis-identified π ⁰ in a LArTPC, a number which is widely quoted. Our efforts here deliver a weaker π ⁰ discrimination, see below. However it does not rely on complex computational tasks such as shower separation and fitting and subsequent vertex finding in a potentially very busy event structure, i.e. the reason why [18] relies on visual event selection.

The optimal signal and background efficiencies for the π ⁰ selection are shown in Table 5 (see also Fig. 6). Just as for the study on Electron to Hadron discrimination, all subsequent studies on changing the signal, here to the π ⁰ particle, consider the complete set of background particles and weighting factors, listed in Table 2 and additionally the relevant leptons, i.e. Muons and Electrons. All final state particles other than the selected signal particle constitute background.

Table 5 Particle identification efficiencies and contamination (background efficiency or ‘contamination’ is equal to 1.0—background rejection) in a combined background of Hadron showers and Leptons for the best MVA method, boosted decision trees, BDT. Efficiencies are calculated for the optimal decision boundary values defined by maximising the signal-to-total event ratio, \(S/\sqrt{(S+B)}\). Signal particles are listed in the top row

Full size table

Other final state particles can also be identified as signals with varying degrees of success. In Table 5 we list results for Muons (see Fig. 7), Protons, Kaons and π ⁺, where π ⁻ results are practically identical to Kaon results, hence have been left out.

4.3 Detector threshold effects

Detector-specific effects depend heavily on our primary assumption of successful clustering of hits belonging to a single particle in an overall unspecified detector event. However, any automated analysis will at some point have to solve this challenge for finely-grained LAr detector devices.

Nevertheless, threshold variations would remove (or add) hits to the total simulated single particle event and potentially change geometrical structures. In order to test the robustness of our method against such a realistic detector effect, we studied two scenarios: one realistic hit energy cut at 120 keV/mm for quenched (measured) energies and one essentially without an hit energy cut (1 keV/mm). The threshold choice of 120 keV/mm originates from the simulated onset of the Muon energy loss peak which would be left practically fully intact by this cut. The minimal cut scenario at 1 keV/mm roughly doubles the total amount of hits for most single particles simulated.

The change from these threshold effects on all of the particle identification results shown earlier is negligible, typically less than 0.3 % on any signal or background efficiency quantity.

Similar robustness should be pointed out with respect to hit smearing effects, i.e. the case when individual hits are represented by more realistic volumes of three to six mm cubes (or even asymmetric dimensions). Our proposed variables are purely measures of the geometry of structures in a LArTPC. As long as the smallest spatial measure, i.e. a hit, is small compared with the extent of shower/track structures, we do not expect any of our analysis results to change significantly.

5 Conclusion

We have presented a study identifying particles in event structures obtained in high-granularity detectors such as a liquid argon TPC using four discriminating variables. Combining them using MVA algorithms from the TMVA toolkit gives signal efficiencies for Electron and Muon identification well above 90 % with high background rejection efficiencies on all Neutrino interaction channels. Protons, Kaons and charged Pions show anything from acceptable to poor background rejection but high signal efficiency. Neutral Pion identification can be made very efficient if a background contamination of around 16 % is not a concern. These results depend on the two specific case studies undertaken here: first for a possible neutrino beam from CERN to Finland and second, for a similar neutrino beam (same flavour weighting factors) but uniform neutrino energy distribution.

The data used in this work has been assumed to consist of hits containing three spatial coordinate values and one energy value, which should be the case for LAr TPC detectors. Furthermore, it is assumed that each single structure to be analysed has been clustered correctly which is still an outstanding challenge for automated data analysis on high granularity detector data. Nevertheless, if such a clustering can be achieved by some means, then this work proposes an efficient and reliable method to identify Electron (as well as Positron) events in a combined hadronic and leptonic background. These would be the primary signatures of charged current Electron–Neutrino interactions which constitute the main target of future long baseline experiments. It also allows the identification of neutral Pions without additional knowledge of decay vertices (or similar quantities) in a mixed background of other Hadrons and Leptons, and can be used for Muon identification and as an efficient first pass on Proton, Kaon and charged Pion identification.

References

A. Rubbia, J. Phys. Conf. Ser. 308, 012030 (2011)
Article ADS Google Scholar
B. Fleming, J. Phys. Conf. Ser. 308, 012007 (2011)
Article ADS Google Scholar
M. Tanaka, J. Phys. Conf. Ser. 308, 012022 (2011)
Article ADS Google Scholar
C. Rubbia et al., J. Instrum. 6, P07011 (2011)
Article Google Scholar
A. Ereditato, A. Rubbia, Nucl. Phys. B, Proc. Suppl. 154, 163 (2006)
Article ADS Google Scholar
E.F. Martinez, T. Li, S. Pascoli, O. Mena, Phys. Rev. D 81, 073010 (2010)
Article ADS Google Scholar
A. Longhin, PoS (ICHEP 2010) 325
C. Andreopoulos, Nucl. Instrum. Methods Phys. Res., Sect. A 614, 87 (2010)
Article ADS Google Scholar
S. Agostinelli et al., Nucl. Instrum. Methods Phys. Res., Sect. A 506, 250 (2003)
Article ADS Google Scholar
S. Amoruso et al., Nucl. Instrum. Methods Phys. Res., Sect. A 523, 275 (2004)
Article ADS Google Scholar
P. Speckmayer, A. Hoecker, J. Stelzer, H. Voss, J. Phys. Conf. Ser. 219, 032057 (2010)
Article ADS Google Scholar
I.T. Jolliffe, Principal Component Analysis, 2nd edn. (Springer, New York, 2002)
MATH Google Scholar
B. Andrieu et al., Nucl. Instrum. Methods Phys. Res., Sect. A 344, 492 (1994)
Article ADS Google Scholar
E. Aprile, A.E. Bolotnikov, A.I. Bolozdynya, T. Doke, Noble Gas Detectors (Wiley-VCH, Weinheim, 2006)
Book Google Scholar
P. Huber, M. Lindner, W. Winter, Comput. Phys. Commun. 167, 195 (2005)
Article ADS Google Scholar
C.B. Barber, D.P. Dobkin, H.T. Huhdanpaa, ACM Trans. Math. Softw. 22, 469 (1996)
Article MathSciNet MATH Google Scholar
T.E. Oliphant, Comput. Sci. Eng. 9, 10 (2007)
Article Google Scholar
A. Ankowski, et al. (ICARUS coll.), Acta Phys. Pol. B 41, 103 (2010)
Google Scholar

Download references

Acknowledgements

We acknowledge the contribution made to this study by Timothy Hughes and Laurence Woodward who worked with the corresponding author as final year project students on this topic. Support on the STFC consolidated grant ST/H00369X/1 is also acknowledged.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

Dept. of Physics, University of Warwick, Coventry, CV4 7AL, UK
J. J. Back, G. J. Barker, A. J. Bennieston, S. B. Boyd, B. Morgan & Y. A. Ramachers

Authors

J. J. Back
View author publications
You can also search for this author in PubMed Google Scholar
G. J. Barker
View author publications
You can also search for this author in PubMed Google Scholar
A. J. Bennieston
View author publications
You can also search for this author in PubMed Google Scholar
S. B. Boyd
View author publications
You can also search for this author in PubMed Google Scholar
B. Morgan
View author publications
You can also search for this author in PubMed Google Scholar
Y. A. Ramachers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Y. A. Ramachers.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Back, J.J., Barker, G.J., Bennieston, A.J. et al. Electron-hadron shower discrimination in a liquid argon time projection chamber. Eur. Phys. J. C 73, 2369 (2013). https://doi.org/10.1140/epjc/s10052-013-2369-y

Download citation

Received: 26 October 2012
Revised: 18 February 2013
Published: 21 March 2013
DOI: https://doi.org/10.1140/epjc/s10052-013-2369-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Electron-hadron shower discrimination in a liquid argon time projection chamber

Abstract

Similar content being viewed by others

Software Performance of the ATLAS Track Reconstruction for LHC Run 3