Classification without labels: learning from mixed samples in high energy physics

Metodiev, Eric M.; Nachman, Benjamin; Thaler, Jesse

doi:10.1007/JHEP10(2017)174

Title: Classification without labels: learning from mixed samples in high energy physics

Journal Article · Wed Oct 25 00:00:00 EDT 2017 · Journal of High Energy Physics (Online)

DOI:https://doi.org/10.1007/JHEP10(2017)174· OSTI ID:1421837

Metodiev, Eric M. ^[1]; Nachman, Benjamin ^[2]; Thaler, Jesse ^[1]

Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Modern machine learning techniques can be used to construct powerful models for difficult collider physics problems. In many applications, however, these models are trained on imperfect simulations due to a lack of truth-level information in the data, which risks the model learning artifacts of the simulation. In this paper, we introduce the paradigm of classification without labels (CWoLa) in which a classifier is trained to distinguish statistical mixtures of classes, which are common in collider physics. Crucially, neither individual labels nor class proportions are required, yet we prove that the optimal classifier in the CWoLa paradigm is also the optimal classifier in the traditional fully-supervised case where all label information is available. After demonstrating the power of this method in an analytical toy example, we consider a realistic benchmark for collider physics: distinguishing quark- versus gluon-initiated jets using mixed quark/gluon training samples. More generally, CWoLa can be applied to any classification problem where labels or class proportions are unknown or simulations are unreliable, but statistical mixtures of the classes are available.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: USDOE Office of Science (SC)

Grant/Contract Number:: AC02-05CH11231

OSTI ID:: 1421837

Journal Information:: Journal of High Energy Physics (Online), Vol. 2017, Issue 10; ISSN 1029-8479

Publisher:: Springer BerlinCopyright Statement

Country of Publication:: United States

Language:: English

Citation Metrics:

Cited by: 125 works

Citation information provided by
Web of Science

References (48)

Weak supervision and other non-standard classification problems: A taxonomy Hernández-González, Jerónimo; Inza, Iñaki; Lozano, Jose A. Pattern Recognition Letters, Vol. 69 https://doi.org/10.1016/j.patrec.2015.10.008	journal	January 2016
Jet Substructure as a New Higgs-Search Channel at the Large Hadron Collider Butterworth, Jonathan M.; Davison, Adam R.; Rubin, Mathieu Physical Review Letters, Vol. 100, Issue 24 https://doi.org/10.1103/PhysRevLett.100.242001	journal	June 2008
Jet shapes and jet algorithms in SCET Ellis, Stephen D.; Vermilion, Christopher K.; Walsh, Jonathan R. Journal of High Energy Physics, Vol. 2010, Issue 11 https://doi.org/10.1007/JHEP11(2010)101	journal	November 2010
The anti- k _t jet clustering algorithm Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory Journal of High Energy Physics, Vol. 2008, Issue 04 https://doi.org/10.1088/1126-6708/2008/04/063	journal	April 2008
Soft drop Larkoski, Andrew J.; Marzani, Simone; Soyez, Gregory Journal of High Energy Physics, Vol. 2014, Issue 5 https://doi.org/10.1007/JHEP05(2014)146	journal	May 2014
Substructure of high- $p_{T}$ jets at the LHC Almeida, Leandro G.; Lee, Seung J.; Perez, Gilad Physical Review D, Vol. 79, Issue 7 https://doi.org/10.1103/PhysRevD.79.074017	journal	April 2009
A brief introduction to PYTHIA 8.1 Sjöstrand, Torbjörn; Mrenna, Stephen; Skands, Peter Computer Physics Communications, Vol. 178, Issue 11 https://doi.org/10.1016/j.cpc.2008.01.036	journal	June 2008
How much information is in a jet? Datta, Kaustuv; Larkoski, Andrew Journal of High Energy Physics, Vol. 2017, Issue 6 https://doi.org/10.1007/JHEP06(2017)073	journal	June 2017
Identification of boosted, hadronically decaying W bosons and comparisons with ATLAS data taken at $$\sqrt{s} = 8$$ s = 8 TeV Aad, G.; Abbott, B.; Abdallah, J. The European Physical Journal C, Vol. 76, Issue 3 https://doi.org/10.1140/epjc/s10052-016-3978-z	journal	March 2016
Jet observables without jet algorithms Bertolini, Daniele; Chan, Tucker; Thaler, Jesse Journal of High Energy Physics, Vol. 2014, Issue 4 https://doi.org/10.1007/JHEP04(2014)013	journal	April 2014
Identification of b-quark jets with the CMS experiment collaboration, The CMS Journal of Instrumentation, Vol. 8, Issue 04 https://doi.org/10.1088/1748-0221/8/04/P04013	journal	April 2013
Event shape–energy flow correlations Berger, Carola F.; Kúcs, Tibor; Sterman, George Physical Review D, Vol. 68, Issue 1 https://doi.org/10.1103/PhysRevD.68.014012	journal	July 2003
Jet-images — deep learning edition de Oliveira, Luke; Kagan, Michael; Mackey, Lester Journal of High Energy Physics, Vol. 2016, Issue 7 https://doi.org/10.1007/JHEP07(2016)069	journal	July 2016
Deep-learning top taggers or the end of QCD? Kasieczka, Gregor; Plehn, Tilman; Russell, Michael Journal of High Energy Physics, Vol. 2017, Issue 5 https://doi.org/10.1007/JHEP05(2017)006	journal	May 2017
Deep learning in color: towards automated quark/gluon jet discrimination Komiske, Patrick T.; Metodiev, Eric M.; Schwartz, Matthew D. Journal of High Energy Physics, Vol. 2017, Issue 1 https://doi.org/10.1007/JHEP01(2017)110	journal	January 2017
Quark-gluon separation in three-jet events Nilles, H. P.; Streng, K. H. Physical Review D, Vol. 23, Issue 9 https://doi.org/10.1103/PhysRevD.23.1944	journal	May 1981
Factorization for groomed jet substructure beyond the next-to-leading logarithm Frye, Christopher; Larkoski, Andrew J.; Schwartz, Matthew D. Journal of High Energy Physics, Vol. 2016, Issue 7 https://doi.org/10.1007/JHEP07(2016)064	journal	July 2016
FastJet user manual: (for version 3.0.2) Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory The European Physical Journal C, Vol. 72, Issue 3 https://doi.org/10.1140/epjc/s10052-012-1896-2	journal	March 2012
Weakly supervised classification in high energy physics Dery, Lucio Mwinmaarong; Nachman, Benjamin; Rubbo, Francesco Journal of High Energy Physics, Vol. 2017, Issue 5 https://doi.org/10.1007/JHEP05(2017)145	journal	May 2017
Jet-images: computer vision inspired techniques for jet tagging Cogan, Josh; Kagan, Michael; Strauss, Emanuel Journal of High Energy Physics, Vol. 2015, Issue 2 https://doi.org/10.1007/JHEP02(2015)118	journal	February 2015
Pure samples of quark and gluon jets at the LHC Gallicchio, Jason; Schwartz, Matthew D. Journal of High Energy Physics, Vol. 2011, Issue 10 https://doi.org/10.1007/JHEP10(2011)103	journal	October 2011
Measurement of the charged-particle multiplicity inside jets from $$\sqrt{s}=8$$ s = 8 $${\mathrm{TeV}}$$ TeV pp collisions with the ATLAS detector Aad, G.; Abbott, B.; Abdallah, J. The European Physical Journal C, Vol. 76, Issue 6 https://doi.org/10.1140/epjc/s10052-016-4126-5	journal	June 2016
Systematics of quark/gluon tagging Gras, Philippe; Höche, Stefan; Kar, Deepak Journal of High Energy Physics, Vol. 2017, Issue 7 https://doi.org/10.1007/JHEP07(2017)091	journal	July 2017
Jet shapes with the broadening axis Larkoski, Andrew J.; Neill, Duff; Thaler, Jesse Journal of High Energy Physics, Vol. 2014, Issue 4 https://doi.org/10.1007/JHEP04(2014)017	journal	April 2014
Playing tag with ANN: boosted top identification with pattern recognition Almeida, Leandro G.; Backović, Mihailo; Cliche, Mathieu Journal of High Energy Physics, Vol. 2015, Issue 7 https://doi.org/10.1007/JHEP07(2015)086	journal	July 2015
Classification with asymmetric label noise: Consistency and maximal denoising Blanchard, Gilles; Flaska, Marek; Handy, Gregory Electronic Journal of Statistics, Vol. 10, Issue 2 https://doi.org/10.1214/16-EJS1193	journal	January 2016
Quark and gluon jet substructure Gallicchio, Jason; Schwartz, Matthew D. Journal of High Energy Physics, Vol. 2013, Issue 4 https://doi.org/10.1007/JHEP04(2013)090	journal	April 2013
Towards an understanding of jet substructure Dasgupta, Mrinal; Fregoso, Alessandro; Marzani, Simone Journal of High Energy Physics, Vol. 2013, Issue 9 https://doi.org/10.1007/JHEP09(2013)029	journal	September 2013
Using neural networks to identify jets Lönnblad, Leif; Peterson, Carsten; Rögnvaldsson, Thorsteinn Nuclear Physics B, Vol. 349, Issue 3 https://doi.org/10.1016/0550-3213(91)90392-B	journal	February 1991
Light-quark and gluon jet discrimination in $$pp$$ p p collisions at $$\sqrt{s}=7\mathrm {\ TeV}$$ s = 7 TeV with the ATLAS detector Aad, G.; Abbott, B.; Abdallah, J. The European Physical Journal C, Vol. 74, Issue 8 https://doi.org/10.1140/epjc/s10052-014-3023-z	journal	August 2014
Jet trimming Krohn, David; Thaler, Jesse; Wang, Lian-Tao Journal of High Energy Physics, Vol. 2010, Issue 2 https://doi.org/10.1007/JHEP02(2010)084	journal	February 2010
Performance of b -jet identification in the ATLAS experiment Collaboration, Atlas Journal of Instrumentation, Vol. 11, Issue 04, p. P04008-P04008 https://doi.org/10.1088/1748-0221/11/04/P04008	journal	January 2016
Quark and Gluon Tagging at the LHC Gallicchio, Jason; Schwartz, Matthew D. Physical Review Letters, Vol. 107, Issue 17 https://doi.org/10.1103/PhysRevLett.107.172001	journal	October 2011
Gaining (mutual) information about quark/gluon discrimination Larkoski, Andrew J.; Thaler, Jesse; Waalewijn, Wouter J. Journal of High Energy Physics, Vol. 2014, Issue 11 https://doi.org/10.1007/JHEP11(2014)129	journal	November 2014
On the Problem of the Most Efficient Tests of Statistical Hypotheses Neyman, J.; Pearson, E. S. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 231, Issue 694-706 https://doi.org/10.1098/rsta.1933.0009	journal	January 1933
Deep-learning top taggers or the end of QCD Kasieczka, Gregor; Plehn, Tilman; Russell, Michael ETH Zurich https://doi.org/10.3929/ethz-b-000191242	text	January 2017
Identification of b-quark jets with the CMS experiment Collaboration, Cms; Chatrchyan, Serguei; Bäni, Lukas ETH Zurich https://doi.org/10.3929/ethz-b-000065938	text	January 2013
The anti-k_t jet clustering algorithm Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory arXiv https://doi.org/10.48550/arxiv.0802.1189	text	January 2008
Substructure of high-p_T Jets at the LHC Almeida, Leandro G.; Lee, Seung J.; Perez, Gilad arXiv https://doi.org/10.48550/arxiv.0807.0234	text	January 2008
Jet Shapes and Jet Algorithms in SCET Ellis, Stephen D.; Hornig, Andrew; Lee, Christopher arXiv https://doi.org/10.48550/arxiv.1001.0014	text	January 2010
Quark and Gluon Tagging at the LHC Gallicchio, Jason; Schwartz, Matthew D. arXiv https://doi.org/10.48550/arxiv.1106.3076	text	January 2011
Classification with Asymmetric Label Noise: Consistency and Maximal Denoising Blanchard, Gilles; Flaska, Marek; Handy, Gregory arXiv https://doi.org/10.48550/arxiv.1303.1208	preprint	January 2013
Jet-Images: Computer Vision Inspired Techniques for Jet Tagging Cogan, Josh; Kagan, Michael; Strauss, Emanuel arXiv https://doi.org/10.48550/arxiv.1407.5675	text	January 2014
Factorization for groomed jet substructure beyond the next-to-leading logarithm Frye, Christopher; Larkoski, Andrew J.; Schwartz, Matthew D. arXiv https://doi.org/10.48550/arxiv.1603.09338	text	January 2016
Deep learning in color: towards automated quark/gluon jet discrimination Komiske, Patrick T.; Metodiev, Eric M.; Schwartz, Matthew D. arXiv https://doi.org/10.48550/arxiv.1612.01551	text	January 2016
Deep-learning Top Taggers or The End of QCD? Kasieczka, Gregor; Plehn, Tilman; Russell, Michael arXiv https://doi.org/10.48550/arxiv.1701.08784	text	January 2017
Weakly Supervised Classification in High Energy Physics Dery, Lucio Mwinmaarong; Nachman, Benjamin; Rubbo, Francesco arXiv https://doi.org/10.48550/arxiv.1702.00414	text	January 2017
Systematics of quark/gluon tagging Gras, Philippe; Höche, Stefan; Kar, Deepak arXiv https://doi.org/10.48550/arxiv.1704.03878	text	January 2017

Cited By (29)

A theory of quark vs. gluon discrimination Larkoski, Andrew J.; Metodiev, Eric M. Journal of High Energy Physics, Vol. 2019, Issue 10 https://doi.org/10.1007/jhep10(2019)014	journal	October 2019
Quark jet versus gluon jet: fully-connected neural networks with high-level features Luo, Hui; Luo, Ming-Xing; Wang, Kai Science China Physics, Mechanics & Astronomy, Vol. 62, Issue 9 https://doi.org/10.1007/s11433-019-9390-8	journal	June 2019
Identifying the Relevant Dependencies of the Neural Network Response on Characteristics of the Input Space Wunsch, Stefan; Friese, Raphael; Wolf, Roger Computing and Software for Big Science, Vol. 2, Issue 1 https://doi.org/10.1007/s41781-018-0012-1	journal	September 2018
Solving differential equations with neural networks: Applications to the calculation of cosmological phase transitions Piscopo, Maria Laura; Spannowsky, Michael; Waite, Philip Physical Review D, Vol. 100, Issue 1 https://doi.org/10.1103/physrevd.100.016002	journal	July 2019
Deep learning for $R$ -parity violating supersymmetry searches at the LHC Guo, Jun; Li, Jinmian; Li, Tianjun Physical Review D, Vol. 98, Issue 7 https://doi.org/10.1103/physrevd.98.076017	journal	October 2018
Production of $$\tau \tau jj$$ττjj final states at the LHC and the TauSpinner algorithm: the spin-2 case Bahmani, M.; Kalinowski, J.; Kotlarski, W. The European Physical Journal C, Vol. 78, Issue 1 https://doi.org/10.1140/epjc/s10052-017-5480-7	journal	January 2018
Machine learning uncertainties with adversarial neural networks Englert, Christoph; Galler, Peter; Harris, Philip The European Physical Journal C, Vol. 79, Issue 1 https://doi.org/10.1140/epjc/s10052-018-6511-8	journal	January 2019
JUNIPR: a framework for unsupervised machine learning in particle physics Andreassen, Anders; Feige, Ilya; Frye, Christopher The European Physical Journal C, Vol. 79, Issue 2 https://doi.org/10.1140/epjc/s10052-019-6607-9	journal	February 2019
Guiding new physics searches with unsupervised learning De Simone, Andrea; Jacques, Thomas The European Physical Journal C, Vol. 79, Issue 4 https://doi.org/10.1140/epjc/s10052-019-6787-3	journal	March 2019
Automating the construction of jet observables with machine learning Datta, Kaustuv; Larkoski, Andrew; Nachman, Benjamin ETH Zurich https://doi.org/10.3929/ethz-b-000380014	text	January 2019
The Machine Learning landscape of top taggers Kasieczka, Gregor; Plehn, Tilman; Butter, Anja RWTH Aachen University https://doi.org/10.18154/rwth-2019-07568	text	January 2019
QCD-Aware Recursive Neural Networks for Jet Physics Louppe, Gilles; Cho, Kyunghyun; Becot, Cyril arXiv https://doi.org/10.48550/arxiv.1702.00748	text	January 2017
(Machine) Learning to Do More with Less Cohen, Timothy; Freytsis, Marat; Ostdiek, Bryan arXiv https://doi.org/10.48550/arxiv.1706.09451	text	January 2017
Production of tau tau jj final states at the LHC and the TauSpinner algorithm: the spin-2 case Bahmani, M.; Kalinowski, J.; Kotlarski, W. arXiv https://doi.org/10.48550/arxiv.1708.03671	text	January 2017
Energy flow polynomials: A complete linear basis for jet substructure Komiske, Patrick T.; Metodiev, Eric M.; Thaler, Jesse arXiv https://doi.org/10.48550/arxiv.1712.07124	text	January 2017
Jet angularity measurements for single inclusive jet production Kang, Zhong-Bo; Lee, Kyle; Ringer, Felix arXiv https://doi.org/10.48550/arxiv.1801.00790	text	January 2018
Learning to Classify from Impure Samples with High-Dimensional Data Komiske, Patrick T.; Metodiev, Eric M.; Nachman, Benjamin arXiv https://doi.org/10.48550/arxiv.1801.10158	text	January 2018
Jet Charge and Machine Learning Fraser, Katherine; Schwartz, Matthew D. arXiv https://doi.org/10.48550/arxiv.1803.08066	text	January 2018
Identifying the relevant dependencies of the neural network response on characteristics of the input space Wunsch, Stefan; Friese, Raphael; Wolf, Roger arXiv https://doi.org/10.48550/arxiv.1803.08782	text	January 2018
Infrared Safety of a Neural-Net Top Tagging Algorithm Choi, Suyong; Lee, Seung J.; Perelstein, Maxim arXiv https://doi.org/10.48550/arxiv.1806.01263	text	January 2018
Machine Learning Uncertainties with Adversarial Neural Networks Englert, Christoph; Galler, Peter; Harris, Philip arXiv https://doi.org/10.48550/arxiv.1807.08763	text	January 2018
Reweighting a parton shower using a neural network: the final-state case Bothmann, Enrico; Del Debbio, Luigi arXiv https://doi.org/10.48550/arxiv.1808.07802	text	January 2018
Energy Flow Networks: Deep Sets for Particle Jets Komiske, Patrick T.; Metodiev, Eric M.; Thaler, Jesse arXiv https://doi.org/10.48550/arxiv.1810.05165	text	January 2018
Investigating the Topology Dependence of Quark and Gluon Jets Bright-Thonney, Samuel; Nachman, Benjamin arXiv https://doi.org/10.48550/arxiv.1810.05653	text	January 2018
Quark-Gluon Tagging: Machine Learning vs Detector Kasieczka, Gregor; Kiefer, Nicholas; Plehn, Tilman arXiv https://doi.org/10.48550/arxiv.1812.09223	text	January 2018
Automating the Construction of Jet Observables with Machine Learning Datta, Kaustuv; Larkoski, Andrew; Nachman, Benjamin arXiv https://doi.org/10.48550/arxiv.1902.07180	text	January 2019
Interpretable Deep Learning for Two-Prong Jet Classification with Jet Spectra Chakraborty, Amit; Lim, Sung Hak; Nojiri, Mihoko M. arXiv https://doi.org/10.48550/arxiv.1904.02092	text	January 2019
A Theory of Quark vs. Gluon Discrimination Larkoski, Andrew J.; Metodiev, Eric M. arXiv https://doi.org/10.48550/arxiv.1906.01639	text	January 2019
CapsNets Continuing the Convolutional Quest Diefenbacher, Sascha; Frost, Hermann; Kasieczka, Gregor arXiv https://doi.org/10.48550/arxiv.1906.11265	text	January 2019

Similar Records

Weakly supervised anomaly detection in the Milky Way

Journal Article · Mon Nov 27 00:00:00 EST 2023 · Monthly Notices of the Royal Astronomical Society · OSTI ID:1421837

Pettee, Mariel; Thanvantri, Sowmya; Nachman, Benjamin; +3 more

A Hybrid Semi-supervised Classification Scheme for Mining Multisource Geospatial Data

Journal Article · Sat Jan 01 00:00:00 EST 2011 · GeoInformatica: An International Journal on Advances of Computer Science for Geographic Information Systems · OSTI ID:1421837

Vatsavai, Raju; Bhaduri, Budhendra L

A Hybrid Classification Scheme for Mining Multisource Geospatial Data

Conference · Mon Jan 01 00:00:00 EST 2007 · OSTI ID:1421837

Vatsavai, Raju; Bhaduri, Budhendra L

Related Subjects

72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS
Jets

Title: Classification without labels: learning from mixed samples in high energy physics

Citation Formats

References (48)

Cited By (29)

Similar Records

Related Subjects