Static Graphs for Coding Productivity in OpenACC
- Barcelona Supercomputing Center
- ORNL
The main contribution of this work is to increase the coding productivity for GPU programming by using the concept of Static Graphs. To do so, we have combined the new CUDA Graph API with the OpenACC programming model. We use as test cases a well-known and widely used problems in HPC and AI: the Particle Swarm Optimization. We complement the OpenACC functionality with the use of CUDA Graph, achieving accelerations of more than one order of magnitude, and a performance very close to a reference and optimized CUDA code. Finally, we propose a new specification to incorporate the concept of Static Graphs into the OpenACC specification.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1883754
- Resource Relation:
- Conference: 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC) - Virtual, , India - 12/17/2021 5:00:00 AM-8/20/2022 4:00:00 AM
- Country of Publication:
- United States
- Language:
- English
Similar Records
Towards Enhancing Coding Productivity for GPU Programming Using Static Graphs
OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration
Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems
Journal Article
·
2022
· Electronics
·
OSTI ID:1883753
+1 more
OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration
Conference
·
2020
·
OSTI ID:2000432
+6 more
Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems
Journal Article
·
· Lecture Notes in Computer Science
·
OSTI ID:1567379