End-to-end jet classification of boosted top quarks with the CMS open data
Abstract
Here we describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique uses low-level detector representation of high-energy collision event as inputs to deep learning algorithms. In this study, we use low-level detector information from the simulated Compact Muon Solenoid (CMS) open data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an area under the receiver operator curve (AUC) score of 0.975 ± 0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to 0.9824 ± 0.0013, serving as the first performance benchmark for these CMS open data samples.
- Authors:
-
- Carnegie Mellon University, Pittsburgh, PA (United States)
- Brown University, Providence, RI (United States)
- Google Inc., Mountain View, CA (United States)
- University of Alabama, Tuscaloosa, AL (United States)
- Publication Date:
- Research Org.:
- Brown Univ., Providence, RI (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 1980034
- Grant/Contract Number:
- SC0010010
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Physical Review. D.
- Additional Journal Information:
- Journal Volume: 105; Journal Issue: 5; Journal ID: ISSN 2470-0010
- Publisher:
- American Physical Society (APS)
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 79 ASTRONOMY AND ASTROPHYSICS; 71 CLASSICAL AND QUANTUM MECHANICS, GENERAL PHYSICS; Hadronic decays; quark and gluon jets; artificial neural networks; top quark
Citation Formats
Andrews, M., Burkle, B., Chen, Y., DiCroce, D., Gleyzer, S., Heintz, U., Narain, M., Paulini, M., Pervan, N., Shafi, Y., Sun, W., Usai, E., and Yang, K. End-to-end jet classification of boosted top quarks with the CMS open data. United States: N. p., 2022.
Web. doi:10.1103/physrevd.105.052008.
Andrews, M., Burkle, B., Chen, Y., DiCroce, D., Gleyzer, S., Heintz, U., Narain, M., Paulini, M., Pervan, N., Shafi, Y., Sun, W., Usai, E., & Yang, K. End-to-end jet classification of boosted top quarks with the CMS open data. United States. https://doi.org/10.1103/physrevd.105.052008
Andrews, M., Burkle, B., Chen, Y., DiCroce, D., Gleyzer, S., Heintz, U., Narain, M., Paulini, M., Pervan, N., Shafi, Y., Sun, W., Usai, E., and Yang, K. Tue .
"End-to-end jet classification of boosted top quarks with the CMS open data". United States. https://doi.org/10.1103/physrevd.105.052008. https://www.osti.gov/servlets/purl/1980034.
@article{osti_1980034,
title = {End-to-end jet classification of boosted top quarks with the CMS open data},
author = {Andrews, M. and Burkle, B. and Chen, Y. and DiCroce, D. and Gleyzer, S. and Heintz, U. and Narain, M. and Paulini, M. and Pervan, N. and Shafi, Y. and Sun, W. and Usai, E. and Yang, K.},
abstractNote = {Here we describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique uses low-level detector representation of high-energy collision event as inputs to deep learning algorithms. In this study, we use low-level detector information from the simulated Compact Muon Solenoid (CMS) open data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an area under the receiver operator curve (AUC) score of 0.975 ± 0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to 0.9824 ± 0.0013, serving as the first performance benchmark for these CMS open data samples.},
doi = {10.1103/physrevd.105.052008},
journal = {Physical Review. D.},
number = 5,
volume = 105,
place = {United States},
year = {Tue Mar 22 00:00:00 EDT 2022},
month = {Tue Mar 22 00:00:00 EDT 2022}
}
Works referenced in this record:
Identification of heavy, energetic, hadronically decaying particles using machine-learning techniques
journal, June 2020
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Journal of Instrumentation, Vol. 15, Issue 06
Identification of heavy-flavour jets with the CMS detector in pp collisions at 13 TeV
journal, May 2018
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Journal of Instrumentation, Vol. 13, Issue 05
Review of Particle Physics
journal, August 2018
- Tanabashi, M.; Hagiwara, K.; Hikasa, K.
- Physical Review D, Vol. 98, Issue 3
The anti- k t jet clustering algorithm
journal, April 2008
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- Journal of High Energy Physics, Vol. 2008, Issue 04
Performance of top-quark and $$\varvec{W}$$ W -boson tagging with ATLAS in Run 2 of the LHC
journal, April 2019
- Aaboud, M.; Aad, G.; Abbott, B.
- The European Physical Journal C, Vol. 79, Issue 5
Identifying boosted objects with N-subjettiness
journal, March 2011
- Thaler, Jesse; Van Tilburg, Ken
- Journal of High Energy Physics, Vol. 2011, Issue 3
Deep-learned Top Tagging with a Lorentz Layer
journal, January 2018
- Butter, Anja; Kasieczka, Gregor; Plehn, Tilman
- SciPost Physics, Vol. 5, Issue 3
Soft drop
journal, May 2014
- Larkoski, Andrew J.; Marzani, Simone; Soyez, Gregory
- Journal of High Energy Physics, Vol. 2014, Issue 5
Energy flow networks: deep sets for particle jets
journal, January 2019
- Komiske, Patrick T.; Metodiev, Eric M.; Thaler, Jesse
- Journal of High Energy Physics, Vol. 2019, Issue 1
End-to-end jet classification of quarks and gluons with the CMS Open Data
journal, October 2020
- Andrews, M.; Alison, J.; An, S.
- Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 977
End-to-End Physics Event Classification with CMS Open Data: Applying Image-Based Deep Learning to Detector Data for the Direct Classification of Collision Events at the LHC
journal, March 2020
- Andrews, M.; Paulini, M.; Gleyzer, S.
- Computing and Software for Big Science, Vol. 4, Issue 1
PYTHIA 6.4 physics and manual
journal, May 2006
- Sjöstrand, Torbjörn; Mrenna, Stephen; Skands, Peter
- Journal of High Energy Physics, Vol. 2006, Issue 05
The Machine Learning landscape of top taggers
journal, January 2019
- Kasieczka, Gregor; Plehn, Tilman; Butter, Anja
- SciPost Physics, Vol. 7, Issue 1
Deep-learning top taggers or the end of QCD?
journal, May 2017
- Kasieczka, Gregor; Plehn, Tilman; Russell, Michael
- Journal of High Energy Physics, Vol. 2017, Issue 5
The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations
journal, July 2014
- Alwall, J.; Frederix, R.; Frixione, S.
- Journal of High Energy Physics, Vol. 2014, Issue 7
Particle-flow reconstruction and global event description with the CMS detector
journal, October 2017
- Sirunyan, A. M.; Tumasyan, A.; Adam, W.
- Journal of Instrumentation, Vol. 12, Issue 10