SU (2) lattice gauge theory simulations on Fermi GPUs
Journal Article
·
· Journal of Computational Physics
- CFTP, Departamento de Fisica, Instituto Superior Tecnico, Av. Rovisco Pais, 1049-001 Lisboa (Portugal)
In this work we explore the performance of CUDA in quenched lattice SU (2) simulations. CUDA, NVIDIA Compute Unified Device Architecture, is a hardware and software architecture developed by NVIDIA for computing on the GPU. We present an analysis and performance comparison between the GPU and CPU in single and double precision. Analyses with multiple GPUs and two different architectures (G200 and Fermi architectures) are also presented. In order to obtain a high performance, the code must be optimized for the GPU architecture, i.e., an implementation that exploits the memory hierarchy of the CUDA programming model. We produce codes for the Monte Carlo generation of SU (2) lattice gauge configurations, for the mean plaquette, for the Polyakov Loop at finite T and for the Wilson loop. We also present results for the potential using many configurations (50,000) without smearing and almost 2000 configurations with APE smearing. With two Fermi GPUs we have achieved an excellent performance of 200x the speed over one CPU, in single precision, around 110 Gflops/s. We also find that, using the Fermi architecture, double precision computations for the static quark-antiquark potential are not much slower (less than 2x slower) than single precision computations.
- OSTI ID:
- 21499744
- Journal Information:
- Journal of Computational Physics, Journal Name: Journal of Computational Physics Journal Issue: 10 Vol. 230; ISSN JCTPAH; ISSN 0021-9991
- Country of Publication:
- United States
- Language:
- English
Similar Records
Hands-on Performance Tuning of 3D Finite Difference Earthquake Simulation on GPU Fermi Chipset
GPU-acceleration of the ELPA2 distributed eigensolver for dense symmetric and hermitian eigenproblems
GPU-accelerated DNS of compressible turbulent flows
Journal Article
·
Fri Jun 01 20:00:00 EDT 2012
· Procedia Computer Science
·
OSTI ID:1567289
GPU-acceleration of the ELPA2 distributed eigensolver for dense symmetric and hermitian eigenproblems
Journal Article
·
Wed Dec 30 19:00:00 EST 2020
· Computer Physics Communications
·
OSTI ID:1773653
GPU-accelerated DNS of compressible turbulent flows
Journal Article
·
Sun Nov 27 19:00:00 EST 2022
· Computers and Fluids
·
OSTI ID:1959502
Related Subjects
72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS
97 MATHEMATICS AND COMPUTING
CALCULATION METHODS
COMPUTER ARCHITECTURE
COMPUTER CODES
COMPUTERIZED SIMULATION
GAUGE INVARIANCE
INTERACTIONS
INVARIANCE PRINCIPLES
LIE GROUPS
MATHEMATICAL MODELS
MONTE CARLO METHOD
PARTICLE INTERACTIONS
PERFORMANCE
PROGRAMMING
QUARK-ANTIQUARK INTERACTIONS
SIMULATION
SU GROUPS
SU-2 GROUPS
SYMMETRY GROUPS
WILSON LOOP
97 MATHEMATICS AND COMPUTING
CALCULATION METHODS
COMPUTER ARCHITECTURE
COMPUTER CODES
COMPUTERIZED SIMULATION
GAUGE INVARIANCE
INTERACTIONS
INVARIANCE PRINCIPLES
LIE GROUPS
MATHEMATICAL MODELS
MONTE CARLO METHOD
PARTICLE INTERACTIONS
PERFORMANCE
PROGRAMMING
QUARK-ANTIQUARK INTERACTIONS
SIMULATION
SU GROUPS
SU-2 GROUPS
SYMMETRY GROUPS
WILSON LOOP