Abstract
There has been substantial progress in applying machine learning techniques to classification problems in collider and jet physics. But as these techniques grow in sophistication, they are becoming more sensitive to subtle features of jets that may not be well modeled in simulation. Therefore, relying on simulations for training will lead to sub-optimal performance in data, but the lack of true class labels makes it difficult to train on real data. To address this challenge we introduce a new approach, called Tag N’ Train (TNT), that can be applied to unlabeled data that has two distinct sub-objects. The technique uses a weak classifier for one of the objects to tag signal-rich and background-rich samples. These samples are then used to train a stronger classifier for the other object. We demonstrate the power of this method by applying it to a dijet resonance search. By starting with autoencoders trained directly on data as the weak classifiers, we use TNT to train substantially improved classifiers. We show that Tag N’ Train can be a powerful tool in model-agnostic searches and discuss other potential applications.
Article PDF
Similar content being viewed by others
Explore related subjects
Find the latest articles, discoveries, and news in related topics.Avoid common mistakes on your manuscript.
References
ATLAS collaboration, Exotic physics searches, https://twiki.cern.ch/twiki/bin/view/AtlasPublic/ExoticsPublicResults (2020).
ATLAS collaboration, Higgs and diboson searches, https://twiki.cern.ch/twiki/bin/view/AtlasPublic/ExoticsPublicResults (2020).
CMS collaboration, Cms exotica public physics results, https://twiki.cern.ch/twiki/bin/view/CMSPublic/PhysicsResultsEXO (2020).
CMS collaboration, Cms beyond-two-generations (b2g) public physics results, https://twiki.cern.ch/twiki/bin/view/CMSPublic/PhysicsResultsB2G (2020).
LHCb collaboration, Publications of the qcd, electroweak and exotica working group, http:// lhcbproject.web.cern.ch/lhcbproject/Publications/LHCbProjectPublic/Summary_QEE.html (2020).
S. Rappoccio, The experimental status of direct searches for exotic physics beyond the standard model at the Large Hadron Collider, Rev. Phys. 4 (2019) 100027 [arXiv:1810.10579] [INSPIRE].
J.H. Kim, K. Kong, B. Nachman and D. Whiteson, The motivation and status of two-body resonance decays after the LHC Run 2 and beyond, JHEP 04 (2020) 030 [arXiv:1907.06659] [INSPIRE].
N. Craig, P. Draper, K. Kong, Y. Ng and D. Whiteson, The unexplored landscape of two-body resonances, Acta Phys. Polon. B 50 (2019) 837 [arXiv:1610.09392] [INSPIRE].
D0 collaboration, Search for new physics in eμX data at DØ using SLEUTH: A quasi-model-independent search strategy for new physics, Phys. Rev. D 62 (2000) 092004 [hep-ex/0006011] [INSPIRE].
D0 collaboration, A quasi-model-independent search for new high pT physics at d0, Phys. Rev. Lett. 86 (2001) 3712 [hep-ex/0011071] [INSPIRE].
D0 collaboration, Quasi-model-independent search for new physics at large transverse momentum, Phys. Rev. D 64 (2001) 012004 [hep-ex/0011067] [INSPIRE].
H1 collaboration, A General Search for New Phenomena at HERA, Phys. Lett. B 674 (2009) 257 [arXiv:0901.0507] [INSPIRE].
H1 collaboration, A General search for new phenomena in ep scattering at HERA, Phys. Lett. B 602 (2004) 14 [hep-ex/0408044] [INSPIRE].
CDF collaboration, Model-Independent and Quasi-Model-Independent Search for New Physics at CDF, Phys. Rev. D 78 (2008) 012002 [arXiv:0712.1311] [INSPIRE].
CDF collaboration, Global Search for New Physics with 2.0 fb−1 at CDF, Phys. Rev. D 79 (2009) 011101 [arXiv:0809.3781] [INSPIRE].
CMS collaboration, MUSiC, a Model Unspecific Search for New Physics, in pp Collisions at \( \sqrt{s} \) = 8 TeV, CMS-PAS-EXO-14-016.
CMS collaboration, Model Unspecific Search for New Physics in pp Collisions at \( \sqrt{s} \) = 7 TeV, CMS-PAS-EXO-10-021.
ATLAS collaboration, A strategy for a general search for new phenomena using data-derived signal regions and its application within the ATLAS experiment, Eur. Phys. J. C 79 (2019) 120 [arXiv:1807.07447] [INSPIRE].
ATLAS collaboration, A general search for new phenomena with the ATLAS detector in pp collisions at \( \sqrt{s} \) = 8 TeV, ATLAS-CONF-2014-006.
ATLAS collaboration, A general search for new phenomena with the ATLAS detector in pp collisions at \( \sqrt{s} \) = 7 TeV, ATLAS-CONF-2012-107.
A.J. Larkoski, I. Moult and B. Nachman, Jet Substructure at the Large Hadron Collider: A Review of Recent Advances in Theory and Machine Learning, Phys. Rept. 841 (2020) 1 [arXiv:1709.04464] [INSPIRE].
L.G. Almeida, M. Backović, M. Cliche, S.J. Lee and M. Perelstein, Playing Tag with ANN: Boosted Top Identification with Pattern Recognition, JHEP 07 (2015) 086 [arXiv:1501.05968] [INSPIRE].
P. Baldi, K. Bauer, C. Eng, P. Sadowski and D. Whiteson, Jet Substructure Classification in High-Energy Physics with Deep Neural Networks, Phys. Rev. D 93 (2016) 094034 [arXiv:1603.09349] [INSPIRE].
G. Kasieczka, T. Plehn, M. Russell and T. Schell, Deep-learning Top Taggers or The End of QCD?, JHEP 05 (2017) 006 [arXiv:1701.08784] [INSPIRE].
G. Louppe, K. Cho, C. Becot and K. Cranmer, QCD-Aware Recursive Neural Networks for Jet Physics, JHEP 01 (2019) 057 [arXiv:1702.00748] [INSPIRE].
A. Butter et al., The Machine Learning Landscape of Top Taggers, SciPost Phys. 7 (2019) 014 [arXiv:1902.09914] [INSPIRE].
A. Butter, G. Kasieczka, T. Plehn and M. Russell, Deep-learned Top Tagging with a Lorentz Layer, SciPost Phys. 5 (2018) 028 [arXiv:1707.08966] [INSPIRE].
J. Cogan, M. Kagan, E. Strauss and A. Schwarztman, Jet-Images: Computer Vision Inspired Techniques for Jet Tagging, JHEP 02 (2015) 118 [arXiv:1407.5675] [INSPIRE].
L. de Oliveira, M. Kagan, L. Mackey, B. Nachman and A. Schwartzman, Jet-images — deep learning edition, JHEP 07 (2016) 069 [arXiv:1511.05190] [INSPIRE].
P.T. Komiske, E.M. Metodiev and M.D. Schwartz, Deep learning in color: towards automated quark/gluon jet discrimination, JHEP 01 (2017) 110 [arXiv:1612.01551] [INSPIRE].
S. Macaluso and D. Shih, Pulling Out All the Tops with Computer Vision and Deep Learning, JHEP 10 (2018) 121 [arXiv:1803.00107] [INSPIRE].
P.T. Komiske, E.M. Metodiev and J. Thaler, Energy Flow Networks: Deep Sets for Particle Jets, JHEP 01 (2019) 121 [arXiv:1810.05165] [INSPIRE].
J. Barnard, E.N. Dawe, M.J. Dolan and N. Rajcic, Parton Shower Uncertainties in Jet Substructure Analyses with Deep Neural Networks, Phys. Rev. D 95 (2017) 014018 [arXiv:1609.00607] [INSPIRE].
E.M. Metodiev, B. Nachman and J. Thaler, Classification without labels: Learning from mixed samples in high energy physics, JHEP 10 (2017) 174 [arXiv:1708.02949] [INSPIRE].
J.H. Collins, K. Howe and B. Nachman, Anomaly Detection for Resonant New Physics with Machine Learning, Phys. Rev. Lett. 121 (2018) 241803 [arXiv:1805.02664] [INSPIRE].
J.H. Collins, K. Howe and B. Nachman, Extending the search for new resonances with machine learning, Phys. Rev. D 99 (2019) 014038 [arXiv:1902.02634] [INSPIRE].
T. Heimel, G. Kasieczka, T. Plehn and J.M. Thompson, QCD or What?, SciPost Phys. 6 (2019) 030 [arXiv:1808.08979] [INSPIRE].
M. Farina, Y. Nakai and D. Shih, Searching for New Physics with Deep Autoencoders, Phys. Rev. D 101 (2020) 075021 [arXiv:1808.08992] [INSPIRE].
A. Blance, M. Spannowsky and P. Waite, Adversarially-trained autoencoders for robust unsupervised new physics searches, JHEP 10 (2019) 047 [arXiv:1905.10384] [INSPIRE].
T.S. Roy and A.H. Vijay, A robust anomaly finder based on autoencoders, arXiv:1903.02032 [INSPIRE].
R.T. D’Agnolo and A. Wulzer, Learning New Physics from a Machine, Phys. Rev. D 99 (2019) 015014 [arXiv:1806.02350] [INSPIRE].
R.T. D’Agnolo, G. Grosso, M. Pierini, A. Wulzer and M. Zanetti, Learning Multivariate New Physics, arXiv:1912.12155 [INSPIRE].
O. Cerri, T.Q. Nguyen, M. Pierini, M. Spiropulu and J.-R. Vlimant, Variational Autoencoders for New Physics Mining at the Large Hadron Collider, JHEP 05 (2019) 036 [arXiv:1811.10276] [INSPIRE].
J. Hajer, Y.-Y. Li, T. Liu and H. Wang, Novelty Detection Meets Collider Physics, Phys. Rev. D 101 (2020) 076015 [arXiv:1807.10261] [INSPIRE].
A. De Simone and T. Jacques, Guiding New Physics Searches with Unsupervised Learning, Eur. Phys. J. C 79 (2019) 289 [arXiv:1807.06038] [INSPIRE].
A. Mullin, H. Pacey, M. Parker, M. White and S. Williams, Does SUSY have friends? A new approach for LHC event analysis, arXiv:1912.10625 [INSPIRE].
A. Casa and G. Menardi, Nonparametric semisupervised classification for signal detection in high energy physics, arXiv:1809.02977 [INSPIRE].
B.M. Dillon, D.A. Faroughy and J.F. Kamenik, Uncovering latent jet substructure, Phys. Rev. D 100 (2019) 056002 [arXiv:1904.04200] [INSPIRE].
J.A. Aguilar-Saavedra, J.H. Collins and R.K. Mishra, A generic anti-QCD jet tagger, JHEP 11 (2017) 163 [arXiv:1709.01087] [INSPIRE].
B. Nachman and D. Shih, Anomaly Detection with Density Estimation, Phys. Rev. D 101 (2020) 075042 [arXiv:2001.04990] [INSPIRE].
A. Andreassen, B. Nachman and D. Shih, Simulation Assisted Likelihood-free Anomaly Detection, Phys. Rev. D 101 (2020) 095004 [arXiv:2001.05001] [INSPIRE].
CMS collaboration, Performance of the CMS muon detector and muon reconstruction with proton-proton collisions at \( \sqrt{s} \) = 13 TeV, 2018 JINST 13 P06015 [arXiv:1804.04528] [INSPIRE].
ATLAS collaboration, Muon reconstruction performance of the ATLAS detector in proton-proton collision data at \( \sqrt{s} \) = 13 TeV, Eur. Phys. J. C 76 (2016) 292 [arXiv:1603.05598] [INSPIRE].
G. Kasieczka, B. Nachman and D. Shih, R&D Dataset for LHC Olympics 2020 Anomaly Detection Challenge, (Apr., 2019) https://doi.org/10.5281/zenodo.2629073.
T. Sjöstrand, S. Mrenna and P.Z. Skands, PYTHIA 6.4 Physics and Manual, JHEP 05 (2006) 026 [hep-ph/0603175] [INSPIRE].
T. Sjöstrand, S. Mrenna and P.Z. Skands, A Brief Introduction to PYTHIA 8.1, Comput. Phys. Commun. 178 (2008) 852 [arXiv:0710.3820] [INSPIRE].
DELPHES 3 collaboration, DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [arXiv:1307.6346] [INSPIRE].
M. Cacciari and G.P. Salam, Dispelling the N 3 myth for the kt jet-finder, Phys. Lett. B 641 (2006) 57 [hep-ph/0512210] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, FastJet User Manual, Eur. Phys. J. C 72 (2012) 1896 [arXiv:1111.6097] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, The anti-kt jet clustering algorithm, JHEP 04 (2008) 063 [arXiv:0802.1189] [INSPIRE].
F. Chollet et al., Keras, https://keras.io, (2015).
M. Abadi et al., TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, arXiv:1603.04467 [INSPIRE].
J. Dolen, P. Harris, S. Marzani, S. Rappoccio and N. Tran, Thinking outside the ROCs: Designing Decorrelated Taggers (DDT) for jet substructure, JHEP 05 (2016) 156 [arXiv:1603.00027] [INSPIRE].
G. Louppe, M. Kagan and K. Cranmer, Learning to Pivot with Adversarial Networks, arXiv:1611.01046 [INSPIRE].
C. Shimmin, P. Sadowski, P. Baldi, E. Weik, D. Whiteson, E. Goul et al., Decorrelated Jet Substructure Tagging using Adversarial Neural Networks, Phys. Rev. D 96 (2017) 074034 [arXiv:1703.03507] [INSPIRE].
L. Bradshaw, R.K. Mishra, A. Mitridate and B. Ostdiek, Mass Agnostic Jet Taggers, SciPost Phys. 8 (2020) 011 [arXiv:1908.08959] [INSPIRE].
G. Kasieczka and D. Shih, Robust Jet Classifiers through Distance Correlation, Phys. Rev. Lett. 125 (2020) 122001 [arXiv:2001.05310] [INSPIRE].
B. Kim et al., Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (tcav), arXiv:1711.11279.
K. Simonyan, A. Vedaldi and A. Zisserman, Deep inside convolutional networks: Visualising image classification models and saliency maps, arXiv:1312.6034.
C. Olah, A. Mordvintsev and L. Schubert, Feature visualization, Distill (2017).
J. Thaler and K. Van Tilburg, Identifying Boosted Objects with N-subjettiness, JHEP 03 (2011) 015 [arXiv:1011.2268] [INSPIRE].
D.P. Kingma and M. Welling, Auto-Encoding Variational Bayes, arXiv:1312.6114 [INSPIRE].
D.P. Kingma, T. Salimans, R. Jozefowicz, X. Chen, I. Sutskever and M. Welling, Improving variational inference with inverse autoregressive flow, arXiv:1606.04934.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ArXiv ePrint: 2002.12376
Rights and permissions
Open Access . This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Amram, O., Suarez, C.M. Tag N’ Train: a technique to train improved classifiers on unlabeled data. J. High Energ. Phys. 2021, 153 (2021). https://doi.org/10.1007/JHEP01(2021)153
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/JHEP01(2021)153