Weakly supervised classification in high energy physics
- 234 Downloads
As machine learning algorithms become increasingly sophisticated to exploit subtle features of the data, they often become more dependent on simulations. This paper presents a new approach called weakly supervised classification in which class proportions are the only input into the machine learning algorithm. Using one of the most challenging binary classification tasks in high energy physics — quark versus gluon tagging — we show that weakly supervised classification can match the performance of fully supervised algorithms. Furthermore, by design, the new algorithm is insensitive to any mis-modeling of discriminating features in the data by the simulation. Weakly supervised classification is a general procedure that can be applied to a wide variety of learning problems to boost performance and robustness when detailed simulations are not reliable or not available.
This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
- CMS collaboration, Performance of quark/gluon discrimination in 8 TeV pp data, CMS-PAS-JME-13-002 (2013)
- CMS collaboration, Boosted top jet tagging at CMS, CMS-PAS-JME-13-007 (2013).Google Scholar
- D. Kotzias, M. Denil, N. de Freitas and P. Smyth, From group to individual labels using deep features, in the proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining (KDD15), August 10–13, Sydney, Australia (2015).Google Scholar
- G. Patrini, R. Nock, P. Rivera and T. Caetano, (Almost) No label no cry, in Advances in Neural Information Processing Systems 27, Z. Ghahramani et al. eds., Curran Associates Inc., U.S.A. (2014).Google Scholar
- D.P. Kingma and J. Ba, Adam: A method for stochastic optimization, arXiv:1412.6980.
- F. Chollet, Keras, https://github.com/fchollet/keras (2015).
- CMS collaboration, V tagging observables and correlations, CMS-PAS-JME-14-002 (2014).
- ALEPH collaboration, D. Buskulic et al., Quark and gluon jet properties in symmetric three jet events, Phys. Lett. B 384 (1996) 353 [INSPIRE].
- Particle Data Group collaboration, K.A. Olive et al., Review of particle physics, Chin. Phys. C 38 (2014) 090001 [INSPIRE].