Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Resimulation-based self-supervised learning for pretraining physics foundation models

Journal Article · · Physical Review. D.
 [1];  [2];  [3];  [4];  [5]
  1. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States); Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA (United States)
  2. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States); Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA (United States); SLAC National Accelerator Laboratory (SLAC), Stanford, CA (United States)
  3. SLAC National Accelerator Laboratory (SLAC), Stanford, CA (United States)
  4. Imperial College, London (United Kingdom)
  5. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)

Self-supervised learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. However, SSL strategies must be adapted to the type of training data and downstream tasks required. We propose resimulation-based self-supervised representation learning (RS3L), a novel simulation-based SSL strategy that employs a method of resimulation to drive data augmentation for contrastive learning in the physical sciences, particularly, in fields that rely on stochastic simulators. By intervening in the middle of the simulation process and rerunning simulation components downstream of the intervention, we generate multiple realizations of an event, thus producing a set of augmentations covering all physics-driven variations available in the simulator. Using experiments from high-energy physics, we explore how this strategy may enable the development of a foundation model; we show how RS3L pretraining enables powerful performance in downstream tasks such as discrimination of a variety of objects and uncertainty mitigation. In addition to our results, we make the RS3L dataset publicly available for further studies on how to improve SSL strategies.

Research Organization:
SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science (BSS); National Science Foundation (NSF)
Grant/Contract Number:
AC02-76SF00515
OSTI ID:
2575499
Journal Information:
Physical Review. D., Journal Name: Physical Review. D. Journal Issue: 3 Vol. 111; ISSN 2470-0010; ISSN 2470-0029
Publisher:
American Physical Society (APS)Copyright Statement
Country of Publication:
United States
Language:
English

References (33)

Search for low mass vector resonances decaying into quark-antiquark pairs in proton-proton collisions at s = 13 $$ \sqrt{s}=13 $$ TeV journal January 2018
DELPHES 3: a modular framework for fast simulation of a generic collider experiment journal February 2014
Search for dark matter particles produced in association with a Higgs boson in proton-proton collisions at s$$ \sqrt{\mathrm{s}} $$ = 13 TeV journal March 2020
Parton distributions for the LHC run II journal April 2015
The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations journal July 2014
New angles on energy correlation functions journal December 2016
Inclusive search for highly boosted Higgs bosons decaying to bottom quark-antiquark pairs in proton-proton collisions at $$ \sqrt{s} $$ = 13 TeV journal December 2020
An introduction to PYTHIA 8.2 journal June 2015
Observation of H → b b ¯ decays and VH production with the ATLAS detector journal November 2018
Measurement of the associated production of a Higgs boson decaying into b-quarks with a vector boson at high transverse momentum in pp collisions at s = 13  TeV with the ATLAS detector journal May 2021
Large-scale chemical language representations capture molecular structure and properties journal December 2022
Large language model for molecular chemistry journal January 2023
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences journal April 2021
The anti- k t jet clustering algorithm journal April 2008
LHC Machine journal August 2008
The ATLAS Experiment at the CERN Large Hadron Collider journal August 2008
The CMS experiment at the CERN LHC journal August 2008
Graph neural networks in particle physics journal January 2021
Chemformer: a pre-trained transformer for computational chemistry journal January 2022
Pile-up mitigation using attention journal June 2022
Finetuning foundation models for joint analysis optimization in High Energy Physics journal June 2024
Masked particle modeling on sets: towards self-supervised high energy physics foundation models journal September 2024
OmniJet-α: the first cross-task foundation model for particle physics journal August 2024
Data compression and inference in cosmology with self-supervised machine learning journal November 2023
Search for low mass vector resonances decaying into quark-antiquark pairs in proton-proton collisions at s = 13     TeV journal December 2019
Measurement of simplified template cross sections of the Higgs boson produced in association with W or Z bosons in the H→bb¯ decay channel in proton-proton collisions at s=13  TeV journal May 2024
Observation of Higgs Boson Decay to Bottom Quarks journal September 2018
Search for dark matter produced in association with a Higgs boson decaying to a pair of bottom quarks in proton–proton collisions at $$\sqrt{s}=13\,\text {Te}\text {V} $$ s = 13 Te journal March 2019
Extraction and validation of a new set of CMS pythia8 tunes from underlying-event measurements journal January 2020
Measurements of WH and ZH production in the $$H \rightarrow b\bar{b}$$ decay channel in pp collisions at $$13\,\text {Te}\text {V}$$ with the ATLAS detector journal February 2021
Mathematical Methods of Organizing and Planning Production journal July 1960
Symmetries, safety, and self-supervision journal June 2022
Event generators for high-energy physics experiments journal May 2024

Similar Records

Data compression and inference in cosmology with self-supervised machine learning
Journal Article · Sun Nov 26 23:00:00 EST 2023 · Monthly Notices of the Royal Astronomical Society · OSTI ID:2578898

Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
Journal Article · Wed Jul 17 00:00:00 EDT 2024 · Machine Learning: Science and Technology · OSTI ID:2403614

Self-supervised Representation Learning for Astronomical Images
Journal Article · Mon Apr 26 00:00:00 EDT 2021 · The Astrophysical Journal. Letters · OSTI ID:1813371