DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Modeling performance of data collection systems for high-energy physics

Journal Article · · APL Machine Learning
DOI: https://doi.org/10.1063/5.0232456 · OSTI ID:2481625

Exponential increases in scientific experimental data are outpacing silicon technology progress, necessitating heterogeneous computing systems—particularly those utilizing machine learning (ML)—to meet future scientific computing demands. The growing importance and complexity of heterogeneous computing systems require systematic modeling to understand and predict the effective roles for ML. We present a model that addresses this need by framing the key aspects of data collection pipelines and constraints and combining them with the important vectors of technology that shape alternatives, computing metrics that allow complex alternatives to be compared. For instance, a data collection pipeline may be characterized by parameters such as sensor sampling rates and the overall relevancy of retrieved samples. Alternatives to this pipeline are enabled by development vectors including ML, parallelization, advancing CMOS, and neuromorphic computing. By calculating metrics for each alternative such as overall F1 score, power, hardware cost, and energy expended per relevant sample, our model allows alternative data collection systems to be rigorously compared. We apply this model to the Compact Muon Solenoid experiment and its planned high luminosity-large hadron collider upgrade, evaluating novel technologies for the data acquisition system (DAQ), including ML-based filtering and parallelized software. The results demonstrate that improvements to early DAQ stages significantly reduce resources required later, with a power reduction of 60% and increased relevant data retrieval per unit power (from 0.065 to 0.31 samples/kJ). However, we predict that further advances will be required in order to meet overall power and cost constraints for the DAQ.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
2481625
Journal Information:
APL Machine Learning, Journal Name: APL Machine Learning Journal Issue: 4 Vol. 2; ISSN 2770-9019
Publisher:
American Institute of PhysicsCopyright Statement
Country of Publication:
United States
Language:
English

References (16)

A Hybrid Approach to Atmospheric Modeling That Combines Machine Learning With a Physics‐Based Numerical Model journal February 2022
Recent advances and applications of deep learning methods in materials science journal April 2022
DUNE Software and Computing Challenges journal January 2021
An overview of the trigger system at the CMS experiment journal April 2022
The CMS trigger system journal January 2017
Particle-flow reconstruction and global event description with the CMS detector journal October 2017
Strategies for on-chip digital data compression for X-ray pixel detectors journal January 2021
The CMS experiment at the CERN LHC journal August 2008
Hardware for Deep Learning conference August 2023
AI Accelerator Survey and Trends conference September 2021
TeraPHY: A Chiplet Technology for Low-Power, High-Bandwidth In-Package Optical I/O journal March 2020
Edholm's law of bandwidth journal July 2004
JEDI-net: a jet identification algorithm based on interaction networks journal January 2020
Machine learning in high energy physics: a review of heavy-flavor jet tagging at the LHC journal July 2024
Unsupervised and lightly supervised learning in particle physics journal July 2024
A new golden age for computer architecture journal January 2019

Similar Records

Impact of scalar performance on vector and parallel processors
Technical Report · 1976 · OSTI ID:6144381

Use cases of lossy compression for floating-point data in scientific data sets
Journal Article · 2019 · International Journal of High Performance Computing Applications · OSTI ID:1560791

System design and algorithmic development for computational steering in distributed environments
Journal Article · 2010 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:982138

Related Subjects