Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

GPU COMPUTING FOR PARTICLE TRACKING

Conference ·
OSTI ID:1022725
This is a feasibility study of using a modern Graphics Processing Unit (GPU) to parallelize the accelerator particle tracking code. To demonstrate the massive parallelization features provided by GPU computing, a simplified TracyGPU program is developed for dynamic aperture calculation. Performances, issues, and challenges from introducing GPU are also discussed. General purpose Computation on Graphics Processing Units (GPGPU) bring massive parallel computing capabilities to numerical calculation. However, the unique architecture of GPU requires a comprehensive understanding of the hardware and programming model to be able to well optimize existing applications. In the field of accelerator physics, the dynamic aperture calculation of a storage ring, which is often the most time consuming part of the accelerator modeling and simulation, can benefit from GPU due to its embarrassingly parallel feature, which fits well with the GPU programming model. In this paper, we use the Tesla C2050 GPU which consists of 14 multi-processois (MP) with 32 cores on each MP, therefore a total of 448 cores, to host thousands ot threads dynamically. Thread is a logical execution unit of the program on GPU. In the GPU programming model, threads are grouped into a collection of blocks Within each block, multiple threads share the same code, and up to 48 KB of shared memory. Multiple thread blocks form a grid, which is executed as a GPU kernel. A simplified code that is a subset of Tracy++ [2] is developed to demonstrate the possibility of using GPU to speed up the dynamic aperture calculation by having each thread track a particle.
Research Organization:
Ernest Orlando Lawrence Berkeley National Laboratory, Berkeley, CA (US)
Sponsoring Organization:
Advanced Light Source Division; Information Technology Division
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1022725
Report Number(s):
LBNL-4638E
Country of Publication:
United States
Language:
English

Similar Records

Balar: A SST GPU Component for Performance Modeling and Profiling
Technical Report · Tue Sep 03 00:00:00 EDT 2019 · OSTI ID:1560919

CUDA Computation of the Feynman Distribution
Journal Article · Sat Jul 01 00:00:00 EDT 2017 · Transactions of the American Nuclear Society · OSTI ID:23050325

SST-GPU: An Execution -Driven CUDA Kernel Scheduler and Streaming-Multiprocessor Compute Model
Technical Report · Thu Feb 21 23:00:00 EST 2019 · OSTI ID:1497416