skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model

Abstract

Abstract. We present an architecture-portable and performant implementation of the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM). The original Fortran implementation is highly performant and scalable on conventional architectures using MPI and OpenMP. We rewrite the model in C++ and use the Kokkos library to express on-node parallelism in a largely architecture-independent implementation. Kokkos provides an abstraction of a compute node or device, layout-polymorphic multidimensional arrays, and parallel execution constructs. Here, the new implementation achieves the same or better performance on conventional multicore computers and is portable to GPUs. We present performance data for the original and new implementations on multiple platforms, on up to 5400 compute nodes, and study several aspects of the single- and multi-node performance characteristics of the new implementation on conventional CPU, Intel Xeon Phi Knights Landing, and Nvidia V100 GPU.

Authors:
ORCiD logo [1]; ORCiD logo [1];  [1];  [1];  [1];  [1]; ORCiD logo [1];  [1]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Publication Date:
Research Org.:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Sandia National Laboratories, Livermore, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23)
OSTI Identifier:
1497003
Report Number(s):
SAND-2019-1304J
Journal ID: ISSN 1991-962X; 672318
Grant/Contract Number:  
AC04-94AL85000
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
Geoscientific Model Development Discussions (Online)
Additional Journal Information:
Journal Volume: 12; Journal Issue: 4; Journal ID: ISSN 1991-962X
Publisher:
European Geosciences Union
Country of Publication:
United States
Language:
English
Subject:
54 ENVIRONMENTAL SCIENCES

Citation Formats

Bertagna, Luca, Deakin, Michael, Guba, Oksana, Sunderland, Daniel, Bradley, Andrew M., Tezaur, Irina K., Taylor, Mark A., and Salinger, Andrew G.. HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model. United States: N. p., 2018. Web. doi:10.5194/gmd-2018-218.
Bertagna, Luca, Deakin, Michael, Guba, Oksana, Sunderland, Daniel, Bradley, Andrew M., Tezaur, Irina K., Taylor, Mark A., & Salinger, Andrew G.. HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model. United States. doi:10.5194/gmd-2018-218.
Bertagna, Luca, Deakin, Michael, Guba, Oksana, Sunderland, Daniel, Bradley, Andrew M., Tezaur, Irina K., Taylor, Mark A., and Salinger, Andrew G.. Fri . "HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model". United States. doi:10.5194/gmd-2018-218. https://www.osti.gov/servlets/purl/1497003.
@article{osti_1497003,
title = {HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model},
author = {Bertagna, Luca and Deakin, Michael and Guba, Oksana and Sunderland, Daniel and Bradley, Andrew M. and Tezaur, Irina K. and Taylor, Mark A. and Salinger, Andrew G.},
abstractNote = {Abstract. We present an architecture-portable and performant implementation of the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM). The original Fortran implementation is highly performant and scalable on conventional architectures using MPI and OpenMP. We rewrite the model in C++ and use the Kokkos library to express on-node parallelism in a largely architecture-independent implementation. Kokkos provides an abstraction of a compute node or device, layout-polymorphic multidimensional arrays, and parallel execution constructs. Here, the new implementation achieves the same or better performance on conventional multicore computers and is portable to GPUs. We present performance data for the original and new implementations on multiple platforms, on up to 5400 compute nodes, and study several aspects of the single- and multi-node performance characteristics of the new implementation on conventional CPU, Intel Xeon Phi Knights Landing, and Nvidia V100 GPU.},
doi = {10.5194/gmd-2018-218},
journal = {Geoscientific Model Development Discussions (Online)},
issn = {1991-962X},
number = 4,
volume = 12,
place = {United States},
year = {2018},
month = {10}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Save / Share: