FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing
Abstract
Large-scale particle physics experiments face challenging demands for high-throughput computing resources both now and in the future. New heterogeneous computing paradigms on dedicated hardware with increased parallelization, such as Field Programmable Gate Arrays (FPGAs), offer exciting solutions with large potential gains. The growing applications of machine learning algorithms in particle physics for simulation, reconstruction, and analysis are naturally deployed on such platforms. We demonstrate that the acceleration of machine learning inference as a web service represents a heterogeneous computing solution for particle physics experiments that potentially requires minimal modification to the current computing model. As examples, we retrain the ResNet-50 convolutional neural network to demonstrate state-of-the-art performance for top quark jet tagging at the LHC and apply a ResNet-50 model with transfer learning for neutrino event classification. Using Project Brainwave by Microsoft to accelerate the ResNet-50 image classification model, we achieve average inference times of 60 (10) ms with our experimental physics software framework using Brainwave as a cloud (edge or on-premises) service, representing an improvement by a factor of approximately 30 (175) in model inference latency over traditional CPU inference in current experimental hardware. A single FPGA service accessed by many CPUs achieves a throughput of 600–700 inferences permore »
- Authors:
-
more »
- Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
- Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
- Univ. of Washington, Seattle, WA (United States)
- Microsoft, Redmond, WA (United States)
- European Organization for Nuclear Research (CERN), Geneva (Switzerland); Univ. of Belgrade (Serbia)
- European Organization for Nuclear Research (CERN), Geneva (Switzerland)
- Univ. of Illinois, Chicago, IL (United States)
- Publication Date:
- Research Org.:
- Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- OSTI Identifier:
- 1565955
- Report Number(s):
- arXiv:1904.08986; FERMILAB-PUB-19-170-CD-CMS-E-ND
Journal ID: ISSN 2510-2036; oai:inspirehep.net:1730403; TRN: US2000947
- Grant/Contract Number:
- AC02-07CH11359
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Computing and Software for Big Science
- Additional Journal Information:
- Journal Volume: 3; Journal Issue: 1; Journal ID: ISSN 2510-2036
- Publisher:
- Springer
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 46 INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY; 72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS
Citation Formats
Duarte, Javier, Harris, Philip, Hauck, Scott, Holzman, Burt, Hsu, Shih-Chieh, Jindariani, Sergo, Khan, Suffian, Kreis, Benjamin, Lee, Brian, Liu, Mia, Lončar, Vladimir, Ngadiuba, Jennifer, Pedro, Kevin, Perez, Brandon, Pierini, Maurizio, Rankin, Dylan, Tran, Nhan, Trahms, Matthew, Tsaris, Aristeidis, Versteeg, Colin, Way, Ted W., Werran, Dustin, and Wu, Zhenbin. FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing. United States: N. p., 2019.
Web. doi:10.1007/s41781-019-0027-2.
Duarte, Javier, Harris, Philip, Hauck, Scott, Holzman, Burt, Hsu, Shih-Chieh, Jindariani, Sergo, Khan, Suffian, Kreis, Benjamin, Lee, Brian, Liu, Mia, Lončar, Vladimir, Ngadiuba, Jennifer, Pedro, Kevin, Perez, Brandon, Pierini, Maurizio, Rankin, Dylan, Tran, Nhan, Trahms, Matthew, Tsaris, Aristeidis, Versteeg, Colin, Way, Ted W., Werran, Dustin, & Wu, Zhenbin. FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing. United States. https://doi.org/10.1007/s41781-019-0027-2
Duarte, Javier, Harris, Philip, Hauck, Scott, Holzman, Burt, Hsu, Shih-Chieh, Jindariani, Sergo, Khan, Suffian, Kreis, Benjamin, Lee, Brian, Liu, Mia, Lončar, Vladimir, Ngadiuba, Jennifer, Pedro, Kevin, Perez, Brandon, Pierini, Maurizio, Rankin, Dylan, Tran, Nhan, Trahms, Matthew, Tsaris, Aristeidis, Versteeg, Colin, Way, Ted W., Werran, Dustin, and Wu, Zhenbin. Mon .
"FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing". United States. https://doi.org/10.1007/s41781-019-0027-2. https://www.osti.gov/servlets/purl/1565955.
@article{osti_1565955,
title = {FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing},
author = {Duarte, Javier and Harris, Philip and Hauck, Scott and Holzman, Burt and Hsu, Shih-Chieh and Jindariani, Sergo and Khan, Suffian and Kreis, Benjamin and Lee, Brian and Liu, Mia and Lončar, Vladimir and Ngadiuba, Jennifer and Pedro, Kevin and Perez, Brandon and Pierini, Maurizio and Rankin, Dylan and Tran, Nhan and Trahms, Matthew and Tsaris, Aristeidis and Versteeg, Colin and Way, Ted W. and Werran, Dustin and Wu, Zhenbin},
abstractNote = {Large-scale particle physics experiments face challenging demands for high-throughput computing resources both now and in the future. New heterogeneous computing paradigms on dedicated hardware with increased parallelization, such as Field Programmable Gate Arrays (FPGAs), offer exciting solutions with large potential gains. The growing applications of machine learning algorithms in particle physics for simulation, reconstruction, and analysis are naturally deployed on such platforms. We demonstrate that the acceleration of machine learning inference as a web service represents a heterogeneous computing solution for particle physics experiments that potentially requires minimal modification to the current computing model. As examples, we retrain the ResNet-50 convolutional neural network to demonstrate state-of-the-art performance for top quark jet tagging at the LHC and apply a ResNet-50 model with transfer learning for neutrino event classification. Using Project Brainwave by Microsoft to accelerate the ResNet-50 image classification model, we achieve average inference times of 60 (10) ms with our experimental physics software framework using Brainwave as a cloud (edge or on-premises) service, representing an improvement by a factor of approximately 30 (175) in model inference latency over traditional CPU inference in current experimental hardware. A single FPGA service accessed by many CPUs achieves a throughput of 600–700 inferences per second using an image batch of one, comparable to large batch-size GPU throughput and significantly better than small batch-size GPU throughput. Deployed as an edge or cloud service for the particle physics computing model, coprocessor accelerators can have a higher duty cycle and are potentially much more cost-effective.},
doi = {10.1007/s41781-019-0027-2},
journal = {Computing and Software for Big Science},
number = 1,
volume = 3,
place = {United States},
year = {Mon Oct 14 00:00:00 EDT 2019},
month = {Mon Oct 14 00:00:00 EDT 2019}
}
Figures / Tables:
Works referenced in this record:
Energy calibration and resolution of the CMS electromagnetic calorimeter in pp collisions at √s= 7 TeV
journal, September 2013
- Collaboration, Cms
- Journal of Instrumentation, Vol. 8, Issue 09, p. P09009-P09009
Densely Connected Convolutional Networks
conference, July 2017
- Huang, Gao; Liu, Zhuang; Maaten, Laurens van der
- 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
The anti- k t jet clustering algorithm
journal, April 2008
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- Journal of High Energy Physics, Vol. 2008, Issue 04
Deep-learned Top Tagging with a Lorentz Layer
journal, January 2018
- Butter, Anja; Kasieczka, Gregor; Plehn, Tilman
- SciPost Physics, Vol. 5, Issue 3
Fast inference of deep neural networks in FPGAs for particle physics
journal, July 2018
- Duarte, J.; Han, S.; Harris, P.
- Journal of Instrumentation, Vol. 13, Issue 07
Reionization and the Cosmic Dawn with the Square Kilometre Array
journal, April 2013
- Mellema, Garrelt; Koopmans, Léon V. E.; Abdalla, Filipe A.
- Experimental Astronomy, Vol. 36, Issue 1-2
Convolutional neural networks applied to neutrino events in a liquid argon time projection chamber
journal, March 2017
- Acciarri, R.; Adams, C.; An, R.
- Journal of Instrumentation, Vol. 12, Issue 03
CaloGAN: Simulating 3D high energy particle showers in multilayer electromagnetic calorimeters with generative adversarial networks
journal, January 2018
- Paganini, Michela; de Oliveira, Luke; Nachman, Benjamin
- Physical Review D, Vol. 97, Issue 1
Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC
journal, September 2012
- Chatrchyan, S.; Khachatryan, V.; Sirunyan, A. M.
- Physics Letters B, Vol. 716, Issue 1
DELPHES 3: a modular framework for fast simulation of a generic collider experiment
journal, February 2014
- de Favereau, J.; Delaere, C.; Demin, P.
- Journal of High Energy Physics, Vol. 2014, Issue 2
Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC
journal, September 2012
- Aad, G.; Abajyan, T.; Abbott, B.
- Physics Letters B, Vol. 716, Issue 1
A convolutional neural network neutrino event classifier
journal, September 2016
- Aurisano, A.; Radovic, A.; Rocco, D.
- Journal of Instrumentation, Vol. 11, Issue 09
FastJet user manual: (for version 3.0.2)
journal, March 2012
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- The European Physical Journal C, Vol. 72, Issue 3
Machine learning at the energy and intensity frontiers of particle physics
journal, August 2018
- Radovic, Alexander; Williams, Mike; Rousseau, David
- Nature, Vol. 560, Issue 7716
Constraints on Oscillation Parameters from Appearance and Disappearance in NOvA
journal, June 2017
- Adamson, P.; Aliaga, L.; Ambrose, D.
- Physical Review Letters, Vol. 118, Issue 23
ImageNet: A large-scale hierarchical image database
conference, June 2009
- Deng, Jia; Dong, Wei; Socher, Richard
- 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), 2009 IEEE Conference on Computer Vision and Pattern Recognition
Deep Residual Learning for Image Recognition
conference, June 2016
- He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing
- 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
The HEP.TrkX Project: deep neural networks for HL-LHC online and offline tracking
journal, January 2017
- Farrell, Steven; Anderson, Dustin; Calafiura, Paolo
- EPJ Web of Conferences, Vol. 150
An introduction to PYTHIA 8.2
journal, June 2015
- Sjöstrand, Torbjörn; Ask, Stefan; Christiansen, Jesper R.
- Computer Physics Communications, Vol. 191
A cloud-scale acceleration architecture
conference, October 2016
- Caulfield, Adrian M.; Chung, Eric S.; Putnam, Andrew
- 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
Tuning PYTHIA 8.1: the Monash 2013 tune
journal, August 2014
- Skands, P.; Carrazza, S.; Rojo, J.
- The European Physical Journal C, Vol. 74, Issue 8
Machine Learning in High Energy Physics Community White Paper
journal, September 2018
- Albertsson, Kim; Altoe, Piero; Anderson, Dustin
- Journal of Physics: Conference Series, Vol. 1085
The Future of Computing Performance
journal, July 2011
- Post, Douglass
- Computing in Science & Engineering, Vol. 13, Issue 4
Observation of a new Boson at a mass of 125 gev with the cms Experiment at the lhc
conference, March 2015
- Mariotti, Chiara
- Proceedings of the MG13 Meeting on General Relativity, The Thirteenth Marcel Grossmann Meeting
Energy calibration and resolution of the CMS electromagnetic calorimeter in pp collisions at √s = 7 TeV
text, January 2013
- Collaboration, Cms; Chatrchyan, Serguei; Bachmair, Felix
- ETH Zurich
Deep-learned Top Tagging with a Lorentz Layer
text, January 2018
- Butter, Anja; Kasieczka, Gregor; Plehn, Tilman
- ETH Zurich
Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC
text, January 2018
- Akgun, B.; Azzolini, V.; Calamba, A.
- Figshare
Energy calibration and resolution of the CMS electromagnetic calorimeter in pp collisions at √s = 7 TeV
text, January 2013
- Collaboration, Cms; Chatrchyan, S.; Khachatryan, V.
- IOP Publishing
Convolutional Neural Networks Applied to Neutrino Events in a Liquid Argon Time Projection Chamber
text, January 2017
- Weber, Michael; Ereditato, Antonio; Kreslo, Igor
- Institute of Physics Publishing IOP
The anti-k_t jet clustering algorithm
text, January 2008
- Cacciari, Matteo; Salam, Gavin P.; Soyez, Gregory
- arXiv
DELPHES 3, A modular framework for fast simulation of a generic collider experiment
text, January 2013
- de Favereau, J.; Delaere, C.; Demin, P.
- arXiv
Densely Connected Convolutional Networks
preprint, January 2016
- Huang, Gao; Liu, Zhuang; van der Maaten, Laurens
- arXiv
Convolutional Neural Networks Applied to Neutrino Events in a Liquid Argon Time Projection Chamber
text, January 2016
- Collaboration, MicroBooNE; Acciarri, R.; Adams, C.
- arXiv
CaloGAN: Simulating 3D High Energy Particle Showers in Multi-Layer Electromagnetic Calorimeters with Generative Adversarial Networks
text, January 2017
- Paganini, Michela; de Oliveira, Luke; Nachman, Benjamin
- arXiv
Machine Learning in High Energy Physics Community White Paper
preprint, January 2018
- Albertsson, Kim; Altoe, Piero; Anderson, Dustin
- arXiv
Dispelling the N^3 myth for the Kt jet-finder
text, January 2005
- Cacciari, Matteo; Salam, Gavin P.
- arXiv
Figures / Tables found in this record: