Deep-learning top taggers or the end of QCD?

  • Gregor Kasieczka
  • Tilman Plehn
  • Michael Russell
  • Torben Schell
Open Access
Regular Article - Experimental Physics


Machine learning based on convolutional neural networks can be used to study jet images from the LHC. Top tagging in fat jets offers a well-defined framework to establish our DeepTop approach and compare its performance to QCD-based top taggers. We first optimize a network architecture to identify top quarks in Monte Carlo simulations of the Standard Model production channel. Using standard fat jets we then compare its performance to a multivariate QCD-based top tagger. We find that both approaches lead to comparable performance, establishing convolutional networks as a promising new approach for multivariate hypothesis-based top tagging.


Jet substructure QCD Hadron-Hadron scattering (experiments) Top physics 


Open Access

This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.


  1. [1]
    J.M. Butterworth, A.R. Davison, M. Rubin and G.P. Salam, Jet substructure as a new Higgs search channel at the LHC, Phys. Rev. Lett. 100 (2008) 242001 [arXiv:0802.2470] [INSPIRE].ADSCrossRefGoogle Scholar
  2. [2]
    M.H. Seymour, Searches for new particles using cone and cluster jet algorithms: A comparative study, Z. Phys. C 62 (1994) 127 [INSPIRE].ADSGoogle Scholar
  3. [3]
    J.M. Butterworth, B.E. Cox and J.R. Forshaw, WW scattering at the CERN LHC, Phys. Rev. D 65 (2002) 096014 [hep-ph/0201098] [INSPIRE].
  4. [4]
    Y. Cui, Z. Han and M.D. Schwartz, W-jet Tagging: Optimizing the Identification of Boosted Hadronically-Decaying W Bosons, Phys. Rev. D 83 (2011) 074023 [arXiv:1012.2077] [INSPIRE].ADSGoogle Scholar
  5. [5]
    W. Skiba and D. Tucker-Smith, Using jet mass to discover vector quarks at the LHC, Phys. Rev. D 75 (2007) 115010 [hep-ph/0701247] [INSPIRE].
  6. [6]
    B. Holdom, t-prime at the LHC: The physics of discovery, JHEP 03 (2007) 063 [hep-ph/0702037] [INSPIRE].
  7. [7]
    M. Gerbush, T.J. Khoo, D.J. Phalen, A. Pierce and D. Tucker-Smith, Color-octet scalars at the CERN LHC, Phys. Rev. D 77 (2008) 095003 [arXiv:0710.3133] [INSPIRE].ADSGoogle Scholar
  8. [8]
    D.E. Kaplan, K. Rehermann, M.D. Schwartz and B. Tweedie, Top Tagging: A Method for Identifying Boosted Hadronically Decaying Top Quarks, Phys. Rev. Lett. 101 (2008) 142001 [arXiv:0806.0848] [INSPIRE].ADSCrossRefGoogle Scholar
  9. [9]
    L.G. Almeida, S.J. Lee, G. Perez, I. Sung and J. Virzi, Top Jets at the LHC, Phys. Rev. D 79 (2009) 074012 [arXiv:0810.0934] [INSPIRE].ADSGoogle Scholar
  10. [10]
    L.G. Almeida, S.J. Lee, G. Perez, G.F. Sterman, I. Sung and J. Virzi, Substructure of high-p T Jets at the LHC, Phys. Rev. D 79 (2009) 074017 [arXiv:0807.0234] [INSPIRE].ADSGoogle Scholar
  11. [11]
    L.G. Almeida, S.J. Lee, G. Perez, G. Sterman and I. Sung, Template Overlap Method for Massive Jets, Phys. Rev. D 82 (2010) 054034 [arXiv:1006.2035] [INSPIRE].ADSGoogle Scholar
  12. [12]
    M. Backović and J. Juknevich, TemplateTagger v1.0.0: A Template Matching Tool for Jet Substructure, Comput. Phys. Commun. 185 (2014) 1322 [arXiv:1212.2978] [INSPIRE].ADSCrossRefGoogle Scholar
  13. [13]
    T. Plehn, G.P. Salam and M. Spannowsky, Fat Jets for a Light Higgs, Phys. Rev. Lett. 104 (2010) 111801 [arXiv:0910.5472] [INSPIRE].ADSCrossRefGoogle Scholar
  14. [14]
    T. Plehn, M. Spannowsky, M. Takeuchi and D. Zerwas, Stop Reconstruction with Tagged Tops, JHEP 10 (2010) 078 [arXiv:1006.2833] [INSPIRE].ADSCrossRefGoogle Scholar
  15. [15]
    D.E. Soper and M. Spannowsky, Finding top quarks with shower deconstruction, Phys. Rev. D 87 (2013) 054012 [arXiv:1211.3140] [INSPIRE].ADSGoogle Scholar
  16. [16]
    A. Abdesselam et al., Boosted objects: A probe of beyond the Standard Model physics, Eur. Phys. J. C 71 (2011) 1661 [arXiv:1012.5412] [INSPIRE].ADSCrossRefGoogle Scholar
  17. [17]
    T. Plehn and M. Spannowsky, Top Tagging, J. Phys. G 39 (2012) 083001 [arXiv:1112.4441] [INSPIRE].ADSCrossRefGoogle Scholar
  18. [18]
    A. Altheimer et al., Boosted objects and jet substructure at the LHC. Report of BOOST2012, held at IFIC Valencia, 23rd-27th of July 2012, Eur. Phys. J. C 74 (2014) 2792 [arXiv:1311.2708] [INSPIRE].
  19. [19]
    S. Schätzel, Boosted Top Quarks and Jet Structure, Eur. Phys. J. C 75 (2015) 415 [arXiv:1403.5176] [INSPIRE].ADSCrossRefGoogle Scholar
  20. [20]
    V. Rentala, W. Shepherd and T.M.P. Tait, Tagging Boosted Ws with Wavelets, JHEP 08 (2014) 042 [arXiv:1404.1929] [INSPIRE].ADSCrossRefGoogle Scholar
  21. [21]
    J.W. Monk, Wavelet Analysis: Event De-noising, Shower Evolution and Jet Substructure Without Jets, arXiv:1405.5008 [INSPIRE].
  22. [22]
    J. Cogan, M. Kagan, E. Strauss and A. Schwarztman, Jet-Images: Computer Vision Inspired Techniques for Jet Tagging, JHEP 02 (2015) 118 [arXiv:1407.5675] [INSPIRE].ADSCrossRefGoogle Scholar
  23. [23]
    L. de Oliveira, M. Kagan, L. Mackey, B. Nachman and A. Schwartzman, Jet-images — deep learning edition, JHEP 07 (2016) 069 [arXiv:1511.05190] [INSPIRE].CrossRefGoogle Scholar
  24. [24]
    P. Baldi, K. Bauer, C. Eng, P. Sadowski and D. Whiteson, Jet Substructure Classification in High-Energy Physics with Deep Neural Networks, Phys. Rev. D 93 (2016) 094034 [arXiv:1603.09349] [INSPIRE].ADSGoogle Scholar
  25. [25]
    L. de Oliveira, M. Paganini and B. Nachman, Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics Synthesis, arXiv:1701.05927 [INSPIRE].
  26. [26]
    L.G. Almeida, M. Backović, M. Cliche, S.J. Lee and M. Perelstein, Playing Tag with ANN: Boosted Top Identification with Pattern Recognition, JHEP 07 (2015) 086 [arXiv:1501.05968] [INSPIRE].ADSCrossRefGoogle Scholar
  27. [27]
    P.T. Komiske, E.M. Metodiev and M.D. Schwartz, Deep learning in color: towards automated quark/gluon jet discrimination, JHEP 01 (2017) 110 [arXiv:1612.01551] [INSPIRE].ADSCrossRefGoogle Scholar
  28. [28]
    Y. LeCun, Y. Bengio and G. Hinton, Deep learning, Nature 521 (2015) 436.ADSCrossRefGoogle Scholar
  29. [29]
  30. [30]
  31. [31]
    T. Plehn, M. Spannowsky and M. Takeuchi, How to Improve Top Tagging, Phys. Rev. D 85 (2012) 034029 [arXiv:1111.5034] [INSPIRE].ADSGoogle Scholar
  32. [32]
    C. Anders, C. Bernaciak, G. Kasieczka, T. Plehn and T. Schell, Benchmarking an even better top tagger algorithm, Phys. Rev. D 89 (2014) 074047 [arXiv:1312.1504] [INSPIRE].ADSGoogle Scholar
  33. [33]
    G. Kasieczka, T. Plehn, T. Schell, T. Strebler and G.P. Salam, Resonance Searches with an Updated Top Tagger, JHEP 06 (2015) 203 [arXiv:1503.05921] [INSPIRE].ADSCrossRefGoogle Scholar
  34. [34]
    P. Baldi, P. Sadowski and D. Whiteson, Enhanced Higgs Boson to τ + τ Search with Deep Learning, Phys. Rev. Lett. 114 (2015) 111801 [arXiv:1410.3469] [INSPIRE].ADSCrossRefGoogle Scholar
  35. [35]
    J. Searcy, L. Huang, M.-A. Pleier and J. Zhu, Determination of the WW polarization fractions in ppW ± W ± jj using a deep machine learning technique, Phys. Rev. D 93 (2016) 094033 [arXiv:1510.01691] [INSPIRE].ADSGoogle Scholar
  36. [36]
    P. Baldi, K. Cranmer, T. Faucett, P. Sadowski and D. Whiteson, Parameterized neural networks for high-energy physics, Eur. Phys. J. C 76 (2016) 235 [arXiv:1601.07913] [INSPIRE].ADSCrossRefGoogle Scholar
  37. [37]
    D. Guest, J. Collado, P. Baldi, S.-C. Hsu, G. Urban and D. Whiteson, Jet Flavor Classification in High-Energy Physics with Deep Neural Networks, Phys. Rev. D 94 (2016) 112002 [arXiv:1607.08633] [INSPIRE].ADSGoogle Scholar
  38. [38]
    R. Santos, J. Webster, S. Ryu, J. Adelman, S. Chekanov and J. Zhou, Machine learning techniques in searches for \( t\overline{t}h \) in the \( h\to b\overline{b} \) decay channel, 2017 JINST 12 P04014 [arXiv:1610.03088] [INSPIRE].
  39. [39]
    A. Alves, Stacking machine learning classifiers to identify Higgs bosons at the LHC, arXiv:1612.07725 [INSPIRE].
  40. [40]
    Theano Development Team, Theano: A Python framework for fast computation of mathematical expressions, arXiv:1605.02688.
  41. [41]
  42. [42]
    T. Sjöstrand et al., An Introduction to PYTHIA 8.2, Comput. Phys. Commun. 191 (2015) 159 [arXiv:1410.3012] [INSPIRE].
  43. [43]
    DELPHES 3 collaboration, J. de Favereau et al., DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [arXiv:1307.6346] [INSPIRE].
  44. [44]
    M. Cacciari and G.P. Salam, Dispelling the N 3 myth for the k t jet-finder, Phys. Lett. B 641 (2006) 57 [hep-ph/0512210] [INSPIRE].
  45. [45]
    M. Cacciari, G.P. Salam and G. Soyez, FastJet User Manual, Eur. Phys. J. C 72 (2012) 1896 [arXiv:1111.6097] [INSPIRE].ADSCrossRefGoogle Scholar
  46. [46]
    M. Cacciari, G.P. Salam and G. Soyez, The anti-k(t) jet clustering algorithm, JHEP 04 (2008) 063 [arXiv:0802.1189] [INSPIRE].ADSCrossRefGoogle Scholar
  47. [47]
    Y.L. Dokshitzer, G.D. Leder, S. Moretti and B.R. Webber, Better jet clustering algorithms, JHEP 08 (1997) 001 [hep-ph/9707323] [INSPIRE].
  48. [48]
    M. Wobisch and T. Wengler, Hadronization corrections to jet cross-sections in deep inelastic scattering, hep-ph/9907280 [INSPIRE].
  49. [49]
    F. Pedregosa et al., Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res. 12 (2011) 2825.MathSciNetzbMATHGoogle Scholar
  50. [50]
    X. Glorot and Y. Bengio Understanding the difficulty of training deep feedforward neural networks., in the proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS-10), Chia Laguna Resort, Sardinia, Italy, May 13–15, 2010.Google Scholar
  51. [51]
    J. Barnard, E.N. Dawe, M.J. Dolan and N. Rajcic, Parton Shower Uncertainties in Jet Substructure Analyses with Deep Neural Networks, Phys. Rev. D 95 (2017) 014018 [arXiv:1609.00607] [INSPIRE].ADSGoogle Scholar
  52. [52]
    A.J. Larkoski, S. Marzani, G. Soyez and J. Thaler, Soft Drop, JHEP 05 (2014) 146 [arXiv:1402.2657] [INSPIRE].ADSCrossRefGoogle Scholar
  53. [53]
    S.D. Ellis, C.K. Vermilion and J.R. Walsh, Techniques for improved heavy particle searches with jet substructure, Phys. Rev. D 80 (2009) 051501 [arXiv:0903.5081] [INSPIRE].ADSGoogle Scholar
  54. [54]
    S.D. Ellis, C.K. Vermilion and J.R. Walsh, Recombination Algorithms and Jet Substructure: Pruning as a Tool for Heavy Particle Searches, Phys. Rev. D 81 (2010) 094023 [arXiv:0912.0033] [INSPIRE].ADSGoogle Scholar
  55. [55]
    G. Louppe, M. Kagan and K. Cranmer, Learning to Pivot with Adversarial Networks, arXiv:1611.01046 [INSPIRE].
  56. [56]
    C. Shimmin et al., Decorrelated Jet Substructure Tagging using Adversarial Neural Networks, arXiv:1703.03507 [INSPIRE].
  57. [57]
    J. Thaler and K. Van Tilburg, Identifying Boosted Objects with N-subjettiness, JHEP 03 (2011) 015 [arXiv:1011.2268] [INSPIRE].ADSCrossRefGoogle Scholar
  58. [58]
    J. Thaler and K. Van Tilburg, Maximizing Boosted Top Identification by Minimizing N-subjettiness, JHEP 02 (2012) 093 [arXiv:1108.2701] [INSPIRE].ADSCrossRefGoogle Scholar
  59. [59]
    I.W. Stewart, F.J. Tackmann and W.J. Waalewijn, N-Jettiness: An Inclusive Event Shape to Veto Jets, Phys. Rev. Lett. 105 (2010) 092002 [arXiv:1004.2489] [INSPIRE].ADSCrossRefGoogle Scholar

Copyright information

© The Author(s) 2017

Authors and Affiliations

  • Gregor Kasieczka
    • 1
  • Tilman Plehn
    • 2
  • Michael Russell
    • 3
  • Torben Schell
    • 2
  1. 1.Institute for Particle PhysicsETH ZürichZürichSwitzerland
  2. 2.Institut für Theoretische PhysikUniversität HeidelbergHeidelbergGermany
  3. 3.School of Physics and AstronomyUniversity of GlasgowGlasgowUnited Kingdom

Personalised recommendations