The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics
Abstract
A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a set of simulated collider events. Participants in these Olympics have developed their methods using an R&D dataset and then tested them on black boxes: datasets with an unknown anomaly (or not). Furthermore, methods made use of modern machine learning tools and were based on unsupervised learning (autoencoders, generative adversarial networks, normalizing flows), weakly supervised learning, and semi-supervised learning. This paper will review the LHC Olympics 2020 challenge, including an overview of the competition, a description of methods deployed in the competition, lessons learned from the experience, and implications for data analyses with future datasets as well as future colliders.
- Authors:
- more »
- Publication Date:
- Research Org.:
- Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- Contributing Org.:
- LHC Olympics Challenge Team
- OSTI Identifier:
- 1909683
- Alternate Identifier(s):
- OSTI ID: 1863790; OSTI ID: 1867888
- Grant/Contract Number:
- SC0011090; SC0012567; AC02-05CH11231; AC02-76SF00515
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Reports on Progress in Physics
- Additional Journal Information:
- Journal Volume: 84; Journal Issue: 12; Journal ID: ISSN 0034-4885
- Publisher:
- IOP Publishing
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS
Citation Formats
Kasieczka, Gregor, Nachman, Benjamin, Shih, David, Amram, Oz, Andreassen, Anders, Benkendorfer, Kees, Bortolato, Blaz, Brooijmans, Gustaaf, Canelli, Florencia, Collins, Jack H., Dai, Biwei, De Freitas, Felipe F., Dillon, Barry M., Dinu, Ioan-Mihail, Dong, Zhongtian, Donini, Julien, Duarte, Javier, Faroughy, D. A., Gonski, Julia, Harris, Philip, Kahn, Alan, Kamenik, Jernej F., Khosa, Charanjit K., Komiske, Patrick, Le Pottier, Luc, Martín-Ramiro, Pablo, Matevc, Andrej, Metodiev, Eric, Mikuni, Vinicius, Murphy, Christopher W., Ochoa, Inês, Park, Sang Eon, Pierini, Maurizio, Rankin, Dylan, Sanz, Veronica, Sarda, Nilai, Seljak, Urŏ, Smolkovic, Aleks, Stein, George, Suarez, Cristina Mantilla, Szewc, Manuel, Thaler, Jesse, Tsan, Steven, Udrescu, Silviu-Marian, Vaslin, Louis, Vlimant, Jean-Roch, Williams, Daniel, and Yunus, Mikaeel. The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics. United States: N. p., 2021.
Web. doi:10.1088/1361-6633/ac36b9.
Kasieczka, Gregor, Nachman, Benjamin, Shih, David, Amram, Oz, Andreassen, Anders, Benkendorfer, Kees, Bortolato, Blaz, Brooijmans, Gustaaf, Canelli, Florencia, Collins, Jack H., Dai, Biwei, De Freitas, Felipe F., Dillon, Barry M., Dinu, Ioan-Mihail, Dong, Zhongtian, Donini, Julien, Duarte, Javier, Faroughy, D. A., Gonski, Julia, Harris, Philip, Kahn, Alan, Kamenik, Jernej F., Khosa, Charanjit K., Komiske, Patrick, Le Pottier, Luc, Martín-Ramiro, Pablo, Matevc, Andrej, Metodiev, Eric, Mikuni, Vinicius, Murphy, Christopher W., Ochoa, Inês, Park, Sang Eon, Pierini, Maurizio, Rankin, Dylan, Sanz, Veronica, Sarda, Nilai, Seljak, Urŏ, Smolkovic, Aleks, Stein, George, Suarez, Cristina Mantilla, Szewc, Manuel, Thaler, Jesse, Tsan, Steven, Udrescu, Silviu-Marian, Vaslin, Louis, Vlimant, Jean-Roch, Williams, Daniel, & Yunus, Mikaeel. The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics. United States. https://doi.org/10.1088/1361-6633/ac36b9
Kasieczka, Gregor, Nachman, Benjamin, Shih, David, Amram, Oz, Andreassen, Anders, Benkendorfer, Kees, Bortolato, Blaz, Brooijmans, Gustaaf, Canelli, Florencia, Collins, Jack H., Dai, Biwei, De Freitas, Felipe F., Dillon, Barry M., Dinu, Ioan-Mihail, Dong, Zhongtian, Donini, Julien, Duarte, Javier, Faroughy, D. A., Gonski, Julia, Harris, Philip, Kahn, Alan, Kamenik, Jernej F., Khosa, Charanjit K., Komiske, Patrick, Le Pottier, Luc, Martín-Ramiro, Pablo, Matevc, Andrej, Metodiev, Eric, Mikuni, Vinicius, Murphy, Christopher W., Ochoa, Inês, Park, Sang Eon, Pierini, Maurizio, Rankin, Dylan, Sanz, Veronica, Sarda, Nilai, Seljak, Urŏ, Smolkovic, Aleks, Stein, George, Suarez, Cristina Mantilla, Szewc, Manuel, Thaler, Jesse, Tsan, Steven, Udrescu, Silviu-Marian, Vaslin, Louis, Vlimant, Jean-Roch, Williams, Daniel, and Yunus, Mikaeel. Tue .
"The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics". United States. https://doi.org/10.1088/1361-6633/ac36b9. https://www.osti.gov/servlets/purl/1909683.
@article{osti_1909683,
title = {The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics},
author = {Kasieczka, Gregor and Nachman, Benjamin and Shih, David and Amram, Oz and Andreassen, Anders and Benkendorfer, Kees and Bortolato, Blaz and Brooijmans, Gustaaf and Canelli, Florencia and Collins, Jack H. and Dai, Biwei and De Freitas, Felipe F. and Dillon, Barry M. and Dinu, Ioan-Mihail and Dong, Zhongtian and Donini, Julien and Duarte, Javier and Faroughy, D. A. and Gonski, Julia and Harris, Philip and Kahn, Alan and Kamenik, Jernej F. and Khosa, Charanjit K. and Komiske, Patrick and Le Pottier, Luc and Martín-Ramiro, Pablo and Matevc, Andrej and Metodiev, Eric and Mikuni, Vinicius and Murphy, Christopher W. and Ochoa, Inês and Park, Sang Eon and Pierini, Maurizio and Rankin, Dylan and Sanz, Veronica and Sarda, Nilai and Seljak, Urŏ and Smolkovic, Aleks and Stein, George and Suarez, Cristina Mantilla and Szewc, Manuel and Thaler, Jesse and Tsan, Steven and Udrescu, Silviu-Marian and Vaslin, Louis and Vlimant, Jean-Roch and Williams, Daniel and Yunus, Mikaeel},
abstractNote = {A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a set of simulated collider events. Participants in these Olympics have developed their methods using an R&D dataset and then tested them on black boxes: datasets with an unknown anomaly (or not). Furthermore, methods made use of modern machine learning tools and were based on unsupervised learning (autoencoders, generative adversarial networks, normalizing flows), weakly supervised learning, and semi-supervised learning. This paper will review the LHC Olympics 2020 challenge, including an overview of the competition, a description of methods deployed in the competition, lessons learned from the experience, and implications for data analyses with future datasets as well as future colliders.},
doi = {10.1088/1361-6633/ac36b9},
journal = {Reports on Progress in Physics},
number = 12,
volume = 84,
place = {United States},
year = {Tue Dec 07 00:00:00 EST 2021},
month = {Tue Dec 07 00:00:00 EST 2021}
}
Works referenced in this record:
A guide to constraining effective field theories with machine learning
journal, September 2018
- Brehmer, Johann; Cranmer, Kyle; Louppe, Gilles
- Physical Review D, Vol. 98, Issue 5
A general search for new phenomena in ep scattering at HERA
journal, November 2004
- Aktas, A.; Andreev, V.; Anthonis, T.
- Physics Letters B, Vol. 602, Issue 1-2
RECAST — extending the impact of existing analyses
journal, April 2011
- Cranmer, Kyle; Yavin, Itay
- Journal of High Energy Physics, Vol. 2011, Issue 4
Official Datasets for LHC Olympics 2020 Anomaly Detection Challenge
dataset, January 2019
- Kasieczka, Gregor; Nachman, Benjamin; Shih, David
- Zenodo
LHC signals from cascade decays of warped vector resonances
journal, May 2017
- Agashe, Kaustubh S.; Collins, Jack H.; Du, Peizhi
- Journal of High Energy Physics, Vol. 2017, Issue 5
Official Datasets for LHC Olympics 2020 Anomaly Detection Challenge
dataset, January 2019
- Kasieczka, Gregor; Nachman, Benjamin; Shih, David
- Zenodo
The anti- k t jet clustering algorithm
journal, April 2008
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- Journal of High Energy Physics, Vol. 2008, Issue 04
Quasi-model-independent search for new physics at large transverse momentum
journal, June 2001
- Abazov, V. M.; Abbott, B.; Abdesselam, A.
- Physical Review D, Vol. 64, Issue 1
Focal Loss for Dense Object Detection
conference, October 2017
- Lin, Tsung-Yi; Goyal, Priya; Girshick, Ross
- 2017 IEEE International Conference on Computer Vision (ICCV)
Maximizing boosted top identification by minimizing N-subjettiness
journal, February 2012
- Thaler, Jesse; Van Tilburg, Ken
- Journal of High Energy Physics, Vol. 2012, Issue 2
Neural networks for full phase-space reweighting and parameter tuning
journal, May 2020
- Andreassen, Anders; Nachman, Benjamin
- Physical Review D, Vol. 101, Issue 9
Flavor universal resonances and warped gravity
journal, January 2017
- Agashe, Kaustubh; Du, Peizhi; Hong, Sungwoo
- Journal of High Energy Physics, Vol. 2017, Issue 1
R&D Dataset for LHC Olympics 2020 Anomaly Detection Challenge
dataset, January 2019
- Kasieczka, Gregor; Nachman, Ben; Shih, David
- Zenodo
DELPHES 3: a modular framework for fast simulation of a generic collider experiment
journal, February 2014
- de Favereau, J.; Delaere, C.; Demin, P.
- Journal of High Energy Physics, Vol. 2014, Issue 2
Learning new physics from a machine
journal, January 2019
- D’Agnolo, Raffaele Tito; Wulzer, Andrea
- Physical Review D, Vol. 99, Issue 1
DELPHES 3: A modular framework for fast-simulation of generic collider experiments
journal, June 2014
- Selvaggi, Michele
- Journal of Physics: Conference Series, Vol. 523
Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC
journal, September 2012
- Chatrchyan, S.; Khachatryan, V.; Sirunyan, A. M.
- Physics Letters B, Vol. 716, Issue 1
PYTHIA 6.4 physics and manual
journal, May 2006
- Sjöstrand, Torbjörn; Mrenna, Stephen; Skands, Peter
- Journal of High Energy Physics, Vol. 2006, Issue 05
Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC
journal, September 2012
- Aad, G.; Abajyan, T.; Abbott, B.
- Physics Letters B, Vol. 716, Issue 1
OmniFold: A Method to Simultaneously Unfold All Observables
journal, May 2020
- Andreassen, Anders; Komiske, Patrick T.; Metodiev, Eric M.
- Physical Review Letters, Vol. 124, Issue 18
Mining gold from implicit models to improve likelihood-free inference
journal, February 2020
- Brehmer, Johann; Louppe, Gilles; Pavez, Juan
- Proceedings of the National Academy of Sciences, Vol. 117, Issue 10
FastJet user manual: (for version 3.0.2)
journal, March 2012
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- The European Physical Journal C, Vol. 72, Issue 3
A general search for new phenomena at HERA
journal, April 2009
- Aaron, F. D.; Alexa, C.; Andreev, V.
- Physics Letters B, Vol. 674, Issue 4-5
ABCNet: An attention-based method for particle tagging
text, January 2020
- Mikuni, Vinicius; Canelli, Florencia
- arXiv
Pileup per particle identification
journal, October 2014
- Bertolini, Daniele; Harris, Philip; Low, Matthew
- Journal of High Energy Physics, Vol. 2014, Issue 10
A measurement of the calorimeter response to single hadrons and determination of the jet energy scale uncertainty using LHC Run-1 pp-collision data with the ATLAS detector
journal, January 2017
- Aaboud, M.; Aad, G.; Abbott, B.
- The European Physical Journal C, Vol. 77, Issue 1
New features in Delphes 3
journal, May 2015
- Mertens, Alexandre
- Journal of Physics: Conference Series, Vol. 608
Dynamic Graph CNN for Learning on Point Clouds
journal, October 2019
- Wang, Yue; Sun, Yongbin; Liu, Ziwei
- ACM Transactions on Graphics, Vol. 38, Issue 5
Search for narrow and broad dijet resonances in proton-proton collisions at s = 13 $$ \sqrt{s}=13 $$ TeV and constraints on dark matter mediators and other new particles
journal, August 2018
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Journal of High Energy Physics, Vol. 2018, Issue 8
Uncovering latent jet substructure
journal, September 2019
- Dillon, Barry M.; Faroughy, Darius A.; Kamenik, Jernej F.
- Physical Review D, Vol. 100, Issue 5
Pion-Pion Interaction in the Reaction
journal, June 1962
- Button, Janice; Kalbfleisch, George R.; Lynch, Gerald R.
- Physical Review, Vol. 126, Issue 5
Herwig++ physics and manual
journal, November 2008
- Bähr, Manuel; Gieseke, Stefan; Gigg, Martyn A.
- The European Physical Journal C, Vol. 58, Issue 4
Deep Learning and Its Application to LHC Physics
journal, October 2018
- Guest, Dan; Cranmer, Kyle; Whiteson, Daniel
- Annual Review of Nuclear and Particle Science, Vol. 68, Issue 1
How to GAN LHC events
journal, January 2019
- Butter, Anja; Plehn, Tilman; Winterhalder, Ramon
- SciPost Physics, Vol. 7, Issue 6
Semi-supervised anomaly detection – towards model-independent searches of new physics
journal, June 2012
- Kuusela, Mikael; Vatanen, Tommi; Malmi, Eric
- Journal of Physics: Conference Series, Vol. 368
DELPHES 3: A modular framework for fast-simulation of generic collider experiments
journal, June 2014
- Selvaggi, Michele
- Journal of Physics: Conference Series, Vol. 523
Search for new resonances in mass distributions of jet pairs using 139 fb−1 of pp collisions at s$$ \sqrt{\mathrm{s}} $$ = 13 TeV with the ATLAS detector
journal, March 2020
- Aad, G.; Abbott, B.; Abbott, D. C.
- Journal of High Energy Physics, Vol. 2020, Issue 3
Adversarially-trained autoencoders for robust unsupervised new physics searches
journal, October 2019
- Blance, Andrew; Spannowsky, Michael; Waite, Philip
- Journal of High Energy Physics, Vol. 2019, Issue 10
Simulation assisted likelihood-free anomaly detection
journal, May 2020
- Andreassen, Anders; Nachman, Benjamin; Shih, David
- Physical Review D, Vol. 101, Issue 9
Parameterized neural networks for high-energy physics
journal, April 2016
- Baldi, Pierre; Cranmer, Kyle; Faucett, Taylor
- The European Physical Journal C, Vol. 76, Issue 5
Fast inference of deep neural networks in FPGAs for particle physics
journal, July 2018
- Duarte, J.; Han, S.; Harris, P.
- Journal of Instrumentation, Vol. 13, Issue 07
Model-independent and quasi-model-independent search for new physics at CDF
journal, July 2008
- Aaltonen, T.; Abulencia, A.; Adelman, J.
- Physical Review D, Vol. 78, Issue 1
ABCNet: an attention-based method for particle tagging
journal, June 2020
- Mikuni, V.; Canelli, F.
- The European Physical Journal Plus, Vol. 135, Issue 6
Measurement of the production cross section in the all-jet final state in pp collisions at
journal, April 2020
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Physics Letters B, Vol. 803
On hypothesis testing, trials factor, hypertests and the BumpHunter
preprint, January 2011
- Choudalakis, Georgios
- arXiv
The Machine Learning landscape of top taggers
journal, January 2019
- Kasieczka, Gregor; Plehn, Tilman; Butter, Anja
- SciPost Physics, Vol. 7, Issue 1
Properties of jet fragmentation using charged particles measured with the ATLAS detector in collisions at
journal, September 2019
- Aad, G.; Abbott, B.; Abbott, D. C.
- Physical Review D, Vol. 100, Issue 5
How much information is in a jet?
journal, June 2017
- Datta, Kaustuv; Larkoski, Andrew
- Journal of High Energy Physics, Vol. 2017, Issue 6
An operational definition of quark and gluon jets
text, January 2018
- Komiske, Patrick T.; Metodiev, Eric M.; Thaler, Jesse
- arXiv
IPython: A System for Interactive Scientific Computing
journal, January 2007
- Perez, Fernando; Granger, Brian E.
- Computing in Science & Engineering, Vol. 9, Issue 3
Trial factors for the look elsewhere effect in high energy physics
journal, October 2010
- Gross, Eilam; Vitells, Ofer
- The European Physical Journal C, Vol. 70, Issue 1-2
Learning the latent structure of collider events
journal, October 2020
- Dillon, B. M.; Faroughy, D. A.; Kamenik, J. F.
- Journal of High Energy Physics, Vol. 2020, Issue 10
Muon reconstruction performance of the ATLAS detector in proton–proton collision data at $$\sqrt{s}$$ s =13 TeV
journal, May 2016
- Aad, G.; Abbott, B.; Abdallah, J.
- The European Physical Journal C, Vol. 76, Issue 5
A brief introduction to weakly supervised learning
journal, August 2017
- Zhou, Zhi-Hua
- National Science Review, Vol. 5, Issue 1
Algorithm AS 136: A K-Means Clustering Algorithm
journal, January 1979
- Hartigan, J. A.; Wong, M. A.
- Applied Statistics, Vol. 28, Issue 1
Global search for new physics with at CDF
journal, January 2009
- Aaltonen, T.; Adelman, J.; Akimoto, T.
- Physical Review D, Vol. 79, Issue 1
An introduction to PYTHIA 8.2
journal, June 2015
- Sjöstrand, Torbjörn; Ask, Stefan; Christiansen, Jesper R.
- Computer Physics Communications, Vol. 191
Identifying boosted objects with N-subjettiness
journal, March 2011
- Thaler, Jesse; Van Tilburg, Ken
- Journal of High Energy Physics, Vol. 2011, Issue 3
GAPNet: Graph Attention based Point Neural Network for Exploiting Local Feature of Point Cloud
preprint, January 2019
- Chen, Can; Fragonara, Luca Zanotti; Tsourdos, Antonios
- arXiv
The Unexplored Landscape of Two-body Resonances
journal, January 2019
- Craig, N.; Draper, P.; Kong, K.
- Acta Physica Polonica B, Vol. 50, Issue 5
Quasi-Model-Independent Search for New High Physics at D0
journal, April 2001
- Abbott, B.; Abdesselam, A.; Abolins, M.
- Physical Review Letters, Vol. 86, Issue 17
Search for massive resonances decaying into WW, WZ or ZZ bosons in proton-proton collisions at s = 13 $$ \sqrt{s}=13 $$ TeV
journal, March 2017
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Journal of High Energy Physics, Vol. 2017, Issue 3
Deep Learning and its Application to LHC Physics
text, January 2018
- Guest, Dan; Cranmer, Kyle; Whiteson, Daniel
- arXiv
The RooStats project
conference, February 2011
- Moneta, Lorenzo; Cranmer, Kyle; Schott, Gregory
- Proceedings of 13th International Workshop on Advanced Computing and Analysis Techniques in Physics Research — PoS(ACAT2010)
The motivation and status of two-body resonance decays after the LHC Run 2 and beyond
journal, April 2020
- Kim, Jeong Han; Kong, Kyoungchul; Nachman, Benjamin
- Journal of High Energy Physics, Vol. 2020, Issue 4
Robotic swarm cooperation by co-adaptation
conference, July 2012
- De Rainville, François-Michel
- Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Adversarially-trained autoencoders for robust unsupervised new physics searches
text, January 2019
- Blance, Andrew; Spannowsky, Michael; Waite, Philip
- arXiv
A strategy for a general search for new phenomena using data-derived signal regions and its application within the ATLAS experiment
journal, February 2019
- Aaboud, M.; Aad, G.; Abbott, B.
- The European Physical Journal C, Vol. 79, Issue 2
Uncovering latent jet substructure
text, January 2019
- Dillon, Barry M.; Faroughy, Darius A.; Kamenik, Jernej F.
- arXiv
Search for new physics in data at DØ using SLEUTH: A quasi-model-independent search strategy for new physics
journal, October 2000
- Abbott, B.; Abolins, M.; Abramov, V.
- Physical Review D, Vol. 62, Issue 9
Search for diboson resonances with boson-tagged jets in pp collisions at with the ATLAS detector
journal, February 2018
- Aaboud, M.; Aad, G.; Abbott, B.
- Physics Letters B, Vol. 777