Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Acceleration of the IMplicit–EXplicit nonhydrostatic unified model of the atmosphere on manycore processors

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1];  [2];  [1];  [1];  [3]
  1. Naval Postgraduate School, Monterey, CA (United States)
  2. Argonne National Lab. (ANL), Argonne, IL (United States)
  3. Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)
We present the acceleration of an IMplicit-EXplicit (IMEX) nonhydrostatic atmospheric model on manycore processors such as graphic processing units (GPUs) and Intel's Many Integrated Core (MIC) architecture. IMEX time integration methods sidestep the constraint imposed by the Courant-Friedrichs-Lewy condition on explicit methods through corrective implicit solves within each time step. In this work, we implement and evaluate the performance of IMEX on manycore processors relative to explicit methods. Using 3D-IMEX at Courant number C = 15, we obtained a speedup of about 4x relative to an explicit time stepping method run with the maximum allowable C = 1. Moreover, the unconditional stability of IMEX with respect to the fast waves means the speedup can increase significantly with the Courant number as long as the accuracy of the resulting solution is acceptable. We show a speedup of 100x at C = 150 using 1D-IMEX to demonstrate this point. Several improvements on the IMEX procedure were necessary in order to outperform our results with explicit methods: (a) reducing the number of degrees of freedom of the IMEX formulation by forming the Schur complement, (b) formulating a horizontally explicit vertically implicit 1D-IMEX scheme that has a lower workload and better scalability than 3D-IMEX, (c) using high-order polynomial preconditioners to reduce the condition number of the resulting system, and (d) using a direct solver for the 1D-IMEX method by performing and storing LU factorizations once to obtain a constant cost for any Courant number. Without all of these improvements, explicit time integration methods turned out to be difficult to beat. We discuss in detail the IMEX infrastructure required for formulating and implementing efficient methods on manycore processors. Several parametric studies are conducted to demonstrate the gain from each of the abovementioned improvements. Lastly, we validate our results with standard benchmark problems in numerical weather prediction and evaluate the performance and scalability of the IMEX method using up to 4192 GPUs and 16 Knights Landing processors.
Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
Office of Naval Research; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1498290
Journal Information:
International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 2 Vol. 33; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English

References (36)

Numerical Methods for Ordinary Differential Equations book January 2003
Numerical Methods for Ordinary Differential Equations book January 2008
Numerical Methods for Ordinary Differential Equations book July 2016
A split explicit integration scheme for numerical weather prediction journal July 1978
A High-Order Accurate Discontinuous Finite Element Method for the Numerical Solution of the Compressible Navier–Stokes Equations journal March 1997
Targeting Atmospheric Simulation Algorithms for Large, Distributed-Memory, GPU-Accelerated Computers book January 2013
Matrix-Free Polynomial-Based Nonlinear Least Squares Optimized Preconditioning and Its Application to Discontinuous Galerkin Discretizations of the Euler Equations journal June 2015
High-order splitting methods for the incompressible Navier-Stokes equations journal December 1991
Additive Runge–Kutta schemes for convection–diffusion–reaction equations journal January 2003
Hybrid Eulerian-Lagrangian Semi-Implicit Time-Integrators journal October 2006
An element-based displacement preconditioner for linear elasticity problems journal December 2006
A new dynamical framework of nonhydrostatic global model using the icosahedral grid journal June 2004
A study of spectral element and discontinuous Galerkin methods for the Navier–Stokes equations in nonhydrostatic mesoscale atmospheric modeling: Equation sets and test cases journal April 2008
Runge–Kutta IMEX schemes for the Horizontally Explicit/Vertically Implicit (HEVI) solution of wave equations journal November 2013
Efficient construction of unified continuous and discontinuous Galerkin formulations for the 3D Euler equations journal September 2016
Petascale solvers for anisotropic PDEs in atmospheric modelling on GPU clusters journal December 2015
Accelerating Time Integration for the Shallow Water Equations on the Sphere Using GPUs journal January 2015
A Conservative Discontinuous Galerkin Semi-Implicit Formulation for the Navier–Stokes Equations in Nonhydrostatic Mesoscale Modeling journal January 2009
Semi-Implicit Formulations of the Navier–Stokes Equations: Application to Nonhydrostatic Atmospheric Modeling journal January 2010
An Element-Based Spectrally Optimized Approximate Inverse Preconditioner for the Euler Equations journal January 2012
Implicit-Explicit Formulations of a Three-Dimensional Nonhydrostatic Unified Model of the Atmosphere (NUMA) journal January 2013
A Scalable Fully Implicit Compressible Euler Solver for Mesoscale Nonhydrostatic Simulation of Atmospheric Flows journal January 2014
A New Class of Optimal High-Order Strong-Stability-Preserving Time Discretization Methods journal January 2002
A Fast and Accurate Numerical Scheme for the Primitive Equations of the Atmosphere journal January 1999
A peta-scalable CPU-GPU algorithm for global atmospheric simulations conference January 2013
A peta-scalable CPU-GPU algorithm for global atmospheric simulations journal August 2013
The Influence of Gravity Wave Breaking on the General Circulation of the Middle Atmosphere journal October 1983
Improving the Anelastic Approximation journal June 1989
A Semi-Implicit Scheme for grid Point Atmospheric Models of the Primitive Equations journal January 1971
An Implicit Time Integration Scheme for Baroclinic Models of the Atmosphere journal May 1972
A Semi-Lagrangian and Semi-Implicit Numerical Integration Scheme for Multilevel Atmospheric Models journal March 1985
Accuracy Analysis of a Spectral Element Atmospheric Model Using a Fully Implicit Solution Framework journal August 2010
Horizontally Explicit and Vertically Implicit (HEVI) Time Discretization Scheme for a Discontinuous Galerkin Nonhydrostatic Model journal March 2015
Conservative Split-Explicit Time Integration Methods for the Compressible Nonhydrostatic Equations journal August 2007
A split explicit integration scheme for numerical weather prediction journal July 1978
Comparison of Some Schemes for Nonhydrostatic Models with Orography [山を取り扱える非静水圧モデルの各種計算スキームの比較] journal January 1988

Cited By (3)

Current and Emerging Time-Integration Strategies in Global Numerical Weather and Climate Prediction journal February 2018
NCAR Release of CAM‐SE in CESM2.0: A Reformulation of the Spectral Element Dynamical Core in Dry‐Mass Vertical Coordinates With Comprehensive Treatment of Condensates and Energy journal July 2018
HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model journal January 2019

Similar Records

Efficient IMEX Runge-Kutta Methods for Nonhydrostatic Dynamics
Journal Article · Mon Jun 10 00:00:00 EDT 2019 · SIAM Journal on Scientific Computing · OSTI ID:1837626

A framework to evaluate IMEX schemes for atmospheric models
Journal Article · Mon Dec 21 19:00:00 EST 2020 · Geoscientific Model Development (Online) · OSTI ID:1765755

An optimal, parallel, fully implicit Newton-Krylov solver for three-dimensional viscoresistive magnetohydrodynamics
Journal Article · Thu May 15 00:00:00 EDT 2008 · Physics of Plasmas · OSTI ID:21120358