Targeting GPUs with OpenMP directives on Summit: A simple and effective Fortran experience
Journal Article
·
· Parallel Computing
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). National Center for Computational Sciences; Univ. of Tennessee, Knoxville, TN (United States). Dept. of Physics and Astronomy
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Physics Division; Univ. of Tennessee, Knoxville, TN (United States). Dept. of Physics and Astronomy
We use OpenMP to target hardware accelerators (GPUs) on Summit, a newly deployed supercomputer at the Oak Ridge Leadership Computing Facility (OLCF), demonstrating simplified access to GPU devices for users of our astrophysics code GenASiS and useful speedup on a sample fluid dynamics problem. We modify our workhorse class for data storage to include members and methods that significantly streamline the persistent allocation of and association to GPU memory. Users offload computational kernels with OpenMP target directives that are rather similar to constructs already familiar from multi-core parallelization. In this initial example we ask, “With a given number of Summit nodes, how fast can we compute with and without GPUs?”, and find total wall time speedups of ~ 12X. We also find reasonable weak scaling up to 8000 GPUs (1334 Summit nodes). We make available the source code from this work at https://github.com/GenASiS/GenASiS_Basics.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
- Grant/Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1569391
- Alternate ID(s):
- OSTI ID: 1776465
- Journal Information:
- Parallel Computing, Journal Name: Parallel Computing Journal Issue: C Vol. 88; ISSN 0167-8191
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Early experiences evaluating the HPE/Cray ecosystem for AMD GPUs
Integer Sum Reduction with OpenMP on an AMD MI100 GPU
OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems
Journal Article
·
Wed Apr 10 20:00:00 EDT 2024
· Concurrency and Computation. Practice and Experience
·
OSTI ID:2336800
Integer Sum Reduction with OpenMP on an AMD MI100 GPU
Conference
·
Sun May 01 00:00:00 EDT 2022
·
OSTI ID:1883905
OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems
Conference
·
Wed Jun 01 00:00:00 EDT 2022
·
OSTI ID:1885285