Abstract
High-multiplicity all-hadronic final states are an important, but difficult final state for searching for physics beyond the Standard Model. A powerful search method is to look for large jets with accidental substructure due to multiple hard partons falling within a single jet. One way for estimating the background in this search is to exploit an approximate factorization in quantum chromodynamics whereby the jet mass distribution is determined only by its kinematic properties. Traditionally, this approach has been executed using histograms constructed in a background-rich region. We propose a new approach based on Generative Adversarial Networks (GANs). These neural network approaches are naturally unbinned and can be readily conditioned on multiple jet properties. In addition to using vanilla GANs for this purpose, a modification to the traditional WGAN approach has been investigated where weight clipping is replaced by drawing weights from a naturally compact set (in this case, the circle). Both the vanilla and modified WGAN approaches significantly outperform the histogram method, especially when modeling the dependence on features not used in the histogram construction. These results can be useful for enhancing the sensitivity of LHC searches to high-multiplicity final states involving many quarks and gluons and serve as a useful benchmark where GANs may have immediate benefit to the HEP community.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
A.J. Larkoski, I. Moult and B. Nachman, Jet substructure at the Large Hadron Collider: a review of recent advances in theory and machine learning, arXiv:1709.04464 [INSPIRE].
L. Asquith et al., Jet substructure at the Large Hadron Collider: experimental review, arXiv:1803.06991 [INSPIRE].
T. Cohen, E. Izaguirre, M. Lisanti and H.K. Lou, Jet substructure by accident, JHEP 03 (2013) 161 [arXiv:1212.1456] [INSPIRE].
S. El Hedri, A. Hook, M. Jankowiak and J.G. Wacker, Learning how to count: a high multiplicity search for the LHC, JHEP 08 (2013) 136 [arXiv:1302.1870] [INSPIRE].
A. Hook, E. Izaguirre, M. Lisanti and J.G. Wacker, High multiplicity searches at the LHC using jet masses, Phys. Rev. D 85 (2012) 055029 [arXiv:1202.0558] [INSPIRE].
J.C. Collins, D.E. Soper and G.F. Sterman, Factorization of hard processes in QCD, Adv. Ser. Direct. High Energy Phys. 5 (1989) 1 [hep-ph/0409313] [INSPIRE].
T. Cohen et al., Jet substructure templates: data-driven QCD backgrounds for fat jet searches, JHEP 05 (2014) 005 [arXiv:1402.0516] [INSPIRE].
ATLAS collaboration, Search for massive supersymmetric particles decaying to many jets using the ATLAS detector in pp collisions at \( \sqrt{s} \) = 8 TeV, Phys. Rev. D 91 (2015) 112016 [Erratum ibid. D 93 (2016) 039901] [arXiv:1502.05686] [INSPIRE].
ATLAS collaboration, Search for R-parity-violating supersymmetric particles in multi-jet final states produced in p-p collisions at \( \sqrt{s} \) = 13 TeV using the ATLAS detector at the LHC, Phys. Lett. B 785 (2018) 136 [arXiv:1804.03568] [INSPIRE].
I.J. Goodfellow et al., Generative adversarial networks, arXiv:1406.2661 [INSPIRE].
M. Paganini, L. de Oliveira and B. Nachman, Accelerating science with generative adversarial networks: an application to 3D particle showers in multilayer calorimeters, Phys. Rev. Lett. 120 (2018) 042003 [arXiv:1705.02355] [INSPIRE].
M. Paganini, L. de Oliveira and B. Nachman, CaloGAN: simulating 3D high energy particle showers in multilayer electromagnetic calorimeters with generative adversarial networks, Phys. Rev. D 97 (2018) 014021 [arXiv:1712.10321] [INSPIRE].
L. de Oliveira, M. Paganini and B. Nachman, Controlling physical attributes in gan-accelerated simulation of electromagnetic calorimeters, J. Phys. Conf. Ser. 1085 (2018) 042017 [arXiv:1711.08813] [INSPIRE].
V. Chekalina et al., Generative models for fast calorimeter simulation, in the proceedings of the 23rd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2018), July 9–13, Sofia, Bulgaria (2018), arXiv:1812.01319 [INSPIRE].
F. Carminati et al., Three dimensional generative adversarial networks for fast simulation, J. Phys. Conf. Ser. 1085 (2018) 032016.
S. Vallecorsa, Generative models for fast simulation, J. Phys. Conf. Ser. 1085 (2018) 022005.
M. Erdmann, J. Glombitza and T. Quast, Precise simulation of electromagnetic calorimeter showers using a Wasserstein generative adversarial network, Comput. Softw. Big Sci. 3 (2019) 4 [arXiv:1807.01954] [INSPIRE].
P. Musella and F. Pandolfi, Fast and accurate simulation of particle detectors using generative adversarial networks, Comput. Softw. Big Sci. 2 (2018) 8 [arXiv:1805.00850] [INSPIRE].
M. Erdmann, L. Geiger, J. Glombitza and D. Schmidt, Generating and refining particle detector simulations using the Wasserstein distance in adversarial networks, Comput. Softw. Big Sci. 2 (2018) 4 [arXiv:1802.03325] [INSPIRE].
ATLAS collaboration, Deep generative models for fast shower simulation in ATLAS, ATL-SOFT-PUB-2018-001 (2018).
L. de Oliveira, M. Paganini and B. Nachman, Learning particle physics by example: location-aware generative adversarial networks for physics synthesis, Comput. Softw. Big Sci. 1 (2017) 4 [arXiv:1701.05927] [INSPIRE].
H. Erbin and S. Krippendorf, GANs for generating EFT models, arXiv:1809.02612 [INSPIRE].
D.P. Kingma and M. Welling, Auto-encoding variational bayes, arXiv:1312.6114 [INSPIRE].
D.J. Rezende, S. Mohamed, and D. Wierstra, Stochastic backpropagation and approximate inference in deep generative models, in the proceedings of the 31st International Conference on International Conference on Machine Learning (ICML’14), June 21–26, Beijing, China (2014).
C. Bishop, Mixture density networks, Neural Computing Research Group Report NCRG/94/004 (1994).
N. Srivastava et al., Dropout: a simple way to prevent neural networks from overfitting, J. Mach. Learn. Res. 15 (2014) 1929.
D.P. Kingma and J. Ba, Adam: a method for stochastic optimization, arXiv:1412.6980 [INSPIRE].
M. Abadi et al., Tensorflow: a system for large-scale machine learning, OSDI 16 (2016) 265.
A. O’Hagan and T. Leonard, Bayes estimation subject to uncertainty about parameter constraints, Biometrika 63 (1976) 201.
M. Arjovsky, S. Chintala and L. Bottou, Wasserstein GAN, arXiv:1701.07875.
M. Arjovsky and L. Bottou, Towards principled methods for training generative adversarial networks, arXiv:1701.04862.
I. Gulrajani et al., Improved training of wasserstein gans, arXiv:1704.00028.
X. Guo, J. Hong, T. Lin and N. Yang, Relaxed Wasserstein with Applications to GANs, arXiv:1705.07164.
Z. Huang, C. Wan, T. Probst and L.V. Gool, Deep learning on Lie groups for skeleton-based action recognition, arXiv:1612.05877.
Yu. A. Golfand and E.P. Likhtman, Extension of the algebra of Poincaré group generators and violation of p invariance, JETP Lett. 13 (1971) 323 [INSPIRE].
D.V. Volkov and V.P. Akulov, Is the neutrino a Goldstone particle?, Phys. Lett. 46B (1973) 109 [INSPIRE].
J. Wess and B. Zumino, Supergauge transformations in four-dimensions, Nucl. Phys. B 70 (1974) 39 [INSPIRE].
J. Wess and B. Zumino, Supergauge invariant extension of quantum electrodynamics, Nucl. Phys. B 78 (1974) 1 [INSPIRE].
S. Ferrara and B. Zumino, Supergauge invariant Yang-Mills theories, Nucl. Phys. B 79 (1974) 413 [INSPIRE].
A. Salam and J.A. Strathdee, Supersymmetry and Nonabelian Gauges, Phys. Lett. 51B (1974) 353 [INSPIRE].
G.R. Farrar and P. Fayet, Phenomenology of the production, decay, and detection of new hadronic states associated with supersymmetry, Phys. Lett. B 76 (1978) 5575.
S. Dimopoulos and H. Georgi, Softly broken supersymmetry and SU(5), Nucl. Phys. B 193 (1981) 150 [INSPIRE].
S. Weinberg, Supersymmetry at ordinary energies. 1. Masses and conservation laws, Phys. Rev. D 26 (1982) 287 [INSPIRE].
N. Sakai and T. Yanagida, Proton decay in a class of supersymmetric grand unified models, Nucl. Phys. B 197 (1982) 3533.
S. Dimopoulos, S. Raby and F. Wilczek, Proton decay in supersymmetric models, Phys. Lett. B 112 (1982) 2133.
T. Sjöstrand, S. Mrenna and P.Z. Skands, PYTHIA 6.4 physics and manual, JHEP 05 (2006) 026 [hep-ph/0603175] [INSPIRE].
DELPHES 3 collaboration, DELPHES 3, a modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [arXiv:1307.6346] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, FastJet user manual, Eur. Phys. J. C 72 (2012) 1896 [arXiv:1111.6097] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, The anti-k t jet clustering algorithm, JHEP 04 (2008) 063 [arXiv:0802.1189] [INSPIRE].
D. Krohn, J. Thaler and L.-T. Wang, Jet trimming, JHEP 02 (2010) 084 [arXiv:0912.1342] [INSPIRE].
BaBar collaboration, The BABAR physics book: physics at an asymmetric B factory, talk given at the the Workshop on Physics at an Asymmetric B Factory, September 22–24, Pasadena, U.S.A. (1998).
A. Hocker et al., TMVA — Toolkit for Multivariate Data Analysis, physics/0703039 [INSPIRE].
Y. Sakaki, Quark jet rates and quark/gluon discrimination in multi-jet final states, arXiv:1807.01421 [INSPIRE].
Open Access
This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
Author information
Authors and Affiliations
Corresponding author
Additional information
ArXiv ePrint: 1903.02556
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lin, J., Bhimji, W. & Nachman, B. Machine learning templates for QCD factorization in the search for physics beyond the standard model. J. High Energ. Phys. 2019, 181 (2019). https://doi.org/10.1007/JHEP05(2019)181
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/JHEP05(2019)181