Abstract
The basic goal of data analysis is to establish a link between a set of measurements, in the form of electronically stored data using some format, and a theoretical model, which is intended to describe the phenomena at the origin of these measurements and usually is summarized by a set of equations with some parameters. The key elements of data analysis are data abstraction and data reduction, followed by statistical inference such as hypothesis testing or interval estimation. Abstraction means that the original set of raw measurements, e.g., a collection of electronic pulses induced by a particle passing through a detector, is converted (“reconstructed”) into physical quantities and properties that can be assigned to the particle, such as its momentum or its energy. Typically this process is accompanied by data reduction, i.e., the overall data volume is reduced when going from the original set of measurements to a compilation of reconstructed physical quantities.
In this chapter, I will describe the steps involved in order to achieve the abovementioned data abstraction and reduction in the case of particle physics experiments and mention recent developments regarding the use of modern machine learning methods for the interpretation of the extremely large data sets. Examples will be given for measurements carried out at e+e− as well as hadron colliders. The basic concepts behind reconstruction algorithms, such as track finding in tracking detectors and energy measurements in calorimeters, will be discussed, along with higher-level algorithms such as particle jet reconstruction. Finally, the typical software and computing environment of large collider experiments will be described, which is necessary in order to achieve the outlined goals of data analysis.
References
Aaboud M et al [ATLAS Collaboration] (2017) Jet reconstruction and performance using particle flow with the ATLAS Detector. Eur Phys J C77(7):466, preprint [arXiv:1703.10485 [hep-ex]]
Berthold MR, Hand DJ (2003) Intelligent data analysis, 2nd edn. Springer-Verlag, Berlin Heidelberg
Blazey GC, Flaugher BL (1999) Jet and Dijet Production at the Tevatron, Ann Rev Nucl Part Sci 49:633
Brandt S (1998) Data analysis: statistical and computational methods for scientists and engineers, 4th edition 2014, Springer International Publishing Switzerland
Buskulic D et al [ALEPH Collaboration] (1995) Performance of the ALEPH detector at LEP. Nucl Instrum Methods A360:481
Cowan G (1998) Statistical data analysis. Clarendon Press, Oxford
Dissertori G, Knowles IG, Schmelling M (2003) Quantum chromodynamics: high energy experiments and theory, 2nd edn. Clarendon Press, Oxford
Fishman G (1996) Monte Carlo: concepts, algorithms and applications. Springer, New York
Frühwirth R, Regler M, Bock RK, Grote H, Notz D (2000) Data analysis techniques for high-energy physics, 2nd edn. Cambridge University Press, Cambridge
Griffiths D (2008) Introduction to elementary particles, 2nd edn. Wiley-VCH, Weinheim
Guest D, Cranmer K, Whiteson D. (2018) Deep learning and its application to LHC physics, preprint arXiv:1806.11484 [hep-ex]
Koop G (2009) Analysis of economic data, 3rd edn. John Wiley & Sons, Chichester, West Sussex
Lazar NA (2009) The statistical analysis of functional MRI data. Springer, New York
Lista L (2017) Statistical methods for data analysis in particle physics. Lecture notes in physics, vol 941, 2nd edn. Springer, Heidelberg
Lyman Ott R, Longnecker MT (2008) An introduction to statistical methods and data analysis, 6th edn. Brooks/Cole, Belmont
Marsh C, Elliott J (2009) Exploring data: an introduction to data analysis for social scientists, 2nd edn. Polity Press, Cambridge
Meschi E, Monteiro T, Seez C, Pratibha V (2001) Electron reconstruction in the CMS electromagnetic calorimeter, public CMS NOTE 2001-034, Jun 2001, http://cds.cern.ch/record/687345/?ln=pt; Seez C, private communication
Nisbet R, Elder J IV, Miner G (1997) Handbook of statistical analysis and data mining applications. Elsevier, Amsterdam
Quinn GP, Keough MJ (2002) Experimental design and data analysis for biologists. Cambridge University Press, Cambridge
Radovic A et al (2018) Machine learning at the energy and intensity frontiers of particle physics. Nature 560(7716):41
Roff DA (2006) Introduction to computer-intensive methods of data analysis in biology. Cambridge University Press, Cambridge, United Kingdom
Salam GP (2010) Towards jetography. Eur Phys J C67:637, preprint arXiv:0906.1833 [hep-ph]
Sirunyan AM et al [CMS Collaboration] (2017) Particle-flow reconstruction and global event description with the CMS detector. JINST 12(10):P10003, preprint [arXiv:1706.04965 [physics.ins-det]]
Sivia D, Skilling J (2006) Data analysis: a Bayesian tutorial, 2nd edn. Oxford University Press, Oxford
Thomson M (2013) Modern particle physics. Cambridge University Press, Cambridge
Wigmans R (2003) Calorimetry: energy measurement in particle physics. Clarendon Press, Oxford
Zaidi H (2006) Quantitative analysis in nuclear medicine imaging. Springer Science+Business Media, New York
Zupan B, Keravnou E, Lavrac N (1997) Intelligent data analysis in medicine and pharmacology. Springer, Berlin
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this entry
Cite this entry
Dissertori, G. (2021). Data Analysis. In: Fleck, I., Titov, M., Grupen, C., Buvat, I. (eds) Handbook of Particle Detection and Imaging. Springer, Cham. https://doi.org/10.1007/978-3-319-93785-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-93785-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93784-7
Online ISBN: 978-3-319-93785-4
eBook Packages: Physics and AstronomyReference Module Physical and Materials ScienceReference Module Chemistry, Materials and Physics