A case study of CUDA FORTRAN and OpenACC for an atmospheric climate kernel
Abstract
The porting of a key kernel in the tracer advection routines of the Community Atmosphere Model – Spectral Element (CAM-SE) to use Graphics Processing Units (GPUs) using OpenACC is considered in comparison to an existing CUDA FORTRAN port. The development of the OpenACC kernel for GPUs was substantially simpler than that of the CUDA port. Also, OpenACC performance was about 1.5× slower than the optimized CUDA version. Particular focus is given to compiler maturity regarding OpenACC implementation for modern FORTRAN, and it is found that the Cray implementation is currently more mature than the PGI implementation. Still, for the case that ran successfully on PGI, the PGI OpenACC runtime was slightly faster than Cray. The results show encouraging performance for OpenACC implementation compared to CUDA while also exposing some issues that may be necessary before the implementations are suitable for porting all of CAM-SE. Furthermore, most notable are that GPU shared memory should be used by future OpenACC implementations and that derived type support should be expanded.
- Authors:
-
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Nvidia, Santa Clara, CA (United States)
- Cray, Seattle, WA (United States)
- Publication Date:
- Research Org.:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 1462913
- Alternate Identifier(s):
- OSTI ID: 1251654
- Grant/Contract Number:
- AC05-00OR22725
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Journal of Computational Science
- Additional Journal Information:
- Journal Volume: 9; Journal Issue: C; Journal ID: ISSN 1877-7503
- Publisher:
- Elsevier
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; 54 ENVIRONMENTAL SCIENCES; OpenACC; Climate; CUDA; GPU; HPC
Citation Formats
Norman, Matthew R., Larkin, Jeffrey, Vose, Aaron, and Evans, Katherine J. A case study of CUDA FORTRAN and OpenACC for an atmospheric climate kernel. United States: N. p., 2015.
Web. doi:10.1016/j.jocs.2015.04.022.
Norman, Matthew R., Larkin, Jeffrey, Vose, Aaron, & Evans, Katherine J. A case study of CUDA FORTRAN and OpenACC for an atmospheric climate kernel. United States. https://doi.org/10.1016/j.jocs.2015.04.022
Norman, Matthew R., Larkin, Jeffrey, Vose, Aaron, and Evans, Katherine J. Sat .
"A case study of CUDA FORTRAN and OpenACC for an atmospheric climate kernel". United States. https://doi.org/10.1016/j.jocs.2015.04.022. https://www.osti.gov/servlets/purl/1462913.
@article{osti_1462913,
title = {A case study of CUDA FORTRAN and OpenACC for an atmospheric climate kernel},
author = {Norman, Matthew R. and Larkin, Jeffrey and Vose, Aaron and Evans, Katherine J.},
abstractNote = {The porting of a key kernel in the tracer advection routines of the Community Atmosphere Model – Spectral Element (CAM-SE) to use Graphics Processing Units (GPUs) using OpenACC is considered in comparison to an existing CUDA FORTRAN port. The development of the OpenACC kernel for GPUs was substantially simpler than that of the CUDA port. Also, OpenACC performance was about 1.5× slower than the optimized CUDA version. Particular focus is given to compiler maturity regarding OpenACC implementation for modern FORTRAN, and it is found that the Cray implementation is currently more mature than the PGI implementation. Still, for the case that ran successfully on PGI, the PGI OpenACC runtime was slightly faster than Cray. The results show encouraging performance for OpenACC implementation compared to CUDA while also exposing some issues that may be necessary before the implementations are suitable for porting all of CAM-SE. Furthermore, most notable are that GPU shared memory should be used by future OpenACC implementations and that derived type support should be expanded.},
doi = {10.1016/j.jocs.2015.04.022},
journal = {Journal of Computational Science},
number = C,
volume = 9,
place = {United States},
year = {Sat Apr 18 00:00:00 EDT 2015},
month = {Sat Apr 18 00:00:00 EDT 2015}
}
Web of Science
Works referenced in this record:
Efficient estimation and ensemble generation in climate modelling
journal, June 2007
- Annan, J. D.; Hargreaves, J. C.
- Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 365, Issue 1857
Atmospheric blocking in a high resolution climate model: influences of mean state, orography and eddy forcing: Atmospheric blocking in a high resolution climate model
journal, January 2013
- Berckmans, Julie; Woollings, Tim; Demory, Marie-Estelle
- Atmospheric Science Letters, Vol. 14, Issue 1
Simulated Climate and Climate Change in the GFDL CM2.5 High-Resolution Coupled Climate Model
journal, April 2012
- Delworth, Thomas L.; Rosati, Anthony; Anderson, Whit
- Journal of Climate, Vol. 25, Issue 8
CAM-SE: A scalable spectral element dynamical core for the Community Atmosphere Model
journal, November 2011
- Dennis, John M.; Edwards, Jim; Evans, Katherine J.
- The International Journal of High Performance Computing Applications, Vol. 26, Issue 1
High-Resolution Global Climate Simulations with the ECMWF Model in Project Athena: Experimental Design, Model Climate, and Seasonal Forecast Skill
journal, May 2012
- Jung, T.; Miller, M. J.; Palmer, T. N.
- Journal of Climate, Vol. 25, Issue 9
Simulations of the Atmospheric General Circulation Using a Cloud-Resolving Model as a Superparameterization of Physical Processes
journal, July 2005
- Khairoutdinov, Marat; Randall, David; DeMott, Charlotte
- Journal of the Atmospheric Sciences, Vol. 62, Issue 7
Emulation to simulate low-resolution atmospheric data
journal, January 2014
- Rao, Vishwas; Archibald, Rick; Evans, Katherine J.
- International Journal of Computer Mathematics, Vol. 91, Issue 4
Effects of vertical resolution and nonorographic gravity wave drag on the simulated climate in the Community Atmosphere Model, version 5
journal, May 2014
- Richter, Jadwiga H.; Solomon, Abraham; Bacmeister, Julio T.
- Journal of Advances in Modeling Earth Systems, Vol. 6, Issue 2
Sensitivity of Simulated Climate to Horizontal and Vertical Resolution in the ECHAM5 Atmosphere Model
journal, August 2006
- Roeckner, E.; Brokopf, R.; Esch, M.
- Journal of Climate, Vol. 19, Issue 16
A mass and energy conserving spectral element atmospheric dynamical core on the cubed-sphere grid
journal, July 2007
- Taylor, M. A.; Edwards, J.; Thomas, S.
- Journal of Physics: Conference Series, Vol. 78