OpenACC acceleration for the algorithm in Nek5000
Journal Article
·
· Journal of Parallel and Distributed Computing
- KTH Royal Inst. of Technology, Stockholm (Sweden)
- Argonne National Lab. (ANL), Argonne, IL (United States)
Due to its high performance and throughput capabilities, GPU-accelerated computing is becoming a popular technology in scientific computing, in particular using programming models such as CUDA and OpenACC. The main advantage with OpenACC is that it enables to simply port codes in their ‘‘original’’ form to GPU systems through compiler directives, thus allowing an incremental approach. An OpenACC implementation is applied to the CFD code Nek5000 for simulation of incompressible flows, based on the spectral-element method. The work follows up previous implementations and focuses now on the PN- PN-2 method for the spatial discretization of the Navier–Stokes equations. Performance results of the ported code show a speed-up of up to 3.1 on multi-GPU for a polynomial order N > 11.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- Swedish Foundation for Strategic Research (SSF); USDOE
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1571263
- Journal Information:
- Journal of Parallel and Distributed Computing, Journal Name: Journal of Parallel and Distributed Computing Journal Issue: C Vol. 132; ISSN 0743-7315
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Nek5000 with OpenACC
OpenACC acceleration of the Nek5000 spectral element code
Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
Journal Article
·
· Lecture Notes in Computer Science
·
OSTI ID:1567377
OpenACC acceleration of the Nek5000 spectral element code
Journal Article
·
Mon Mar 30 00:00:00 EDT 2015
· International Journal of High Performance Computing Applications
·
OSTI ID:1565389
Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
Journal Article
·
Mon Jul 18 00:00:00 EDT 2016
· Journal of Supercomputing
·
OSTI ID:1565549