DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Acceleration of the IMplicit–EXplicit nonhydrostatic unified model of the atmosphere on manycore processors

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1];  [2];  [1];  [1];  [3]
  1. Naval Postgraduate School, Monterey, CA (United States)
  2. Argonne National Lab. (ANL), Argonne, IL (United States)
  3. Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)

We present the acceleration of an IMplicit-EXplicit (IMEX) nonhydrostatic atmospheric model on manycore processors such as graphic processing units (GPUs) and Intel's Many Integrated Core (MIC) architecture. IMEX time integration methods sidestep the constraint imposed by the Courant-Friedrichs-Lewy condition on explicit methods through corrective implicit solves within each time step. In this work, we implement and evaluate the performance of IMEX on manycore processors relative to explicit methods. Using 3D-IMEX at Courant number C = 15, we obtained a speedup of about 4x relative to an explicit time stepping method run with the maximum allowable C = 1. Moreover, the unconditional stability of IMEX with respect to the fast waves means the speedup can increase significantly with the Courant number as long as the accuracy of the resulting solution is acceptable. We show a speedup of 100x at C = 150 using 1D-IMEX to demonstrate this point. Several improvements on the IMEX procedure were necessary in order to outperform our results with explicit methods: (a) reducing the number of degrees of freedom of the IMEX formulation by forming the Schur complement, (b) formulating a horizontally explicit vertically implicit 1D-IMEX scheme that has a lower workload and better scalability than 3D-IMEX, (c) using high-order polynomial preconditioners to reduce the condition number of the resulting system, and (d) using a direct solver for the 1D-IMEX method by performing and storing LU factorizations once to obtain a constant cost for any Courant number. Without all of these improvements, explicit time integration methods turned out to be difficult to beat. We discuss in detail the IMEX infrastructure required for formulating and implementing efficient methods on manycore processors. Several parametric studies are conducted to demonstrate the gain from each of the abovementioned improvements. Lastly, we validate our results with standard benchmark problems in numerical weather prediction and evaluate the performance and scalability of the IMEX method using up to 4192 GPUs and 16 Knights Landing processors.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); US Department of the Navy, Office of Naval Research (ONR)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1498290
Journal Information:
International Journal of High Performance Computing Applications, Vol. 33, Issue 2; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 13 works
Citation information provided by
Web of Science

References (35)

Comparison of Some Schemes for Nonhydrostatic Models with Orography [山を取り扱える非静水圧モデルの各種計算スキームの比較] journal January 1988
A High-Order Accurate Discontinuous Finite Element Method for the Numerical Solution of the Compressible Navier–Stokes Equations journal March 1997
An Element-Based Spectrally Optimized Approximate Inverse Preconditioner for the Euler Equations journal January 2012
Improving the Anelastic Approximation journal June 1989
A Conservative Discontinuous Galerkin Semi-Implicit Formulation for the Navier–Stokes Equations in Nonhydrostatic Mesoscale Modeling journal January 2009
A Fast and Accurate Numerical Scheme for the Primitive Equations of the Atmosphere journal January 1999
A new dynamical framework of nonhydrostatic global model using the icosahedral grid journal June 2004
Accuracy Analysis of a Spectral Element Atmospheric Model Using a Fully Implicit Solution Framework journal August 2010
A Semi-Lagrangian and Semi-Implicit Numerical Integration Scheme for Multilevel Atmospheric Models journal March 1985
Semi-Implicit Formulations of the Navier–Stokes Equations: Application to Nonhydrostatic Atmospheric Modeling journal January 2010
An element-based displacement preconditioner for linear elasticity problems journal December 2006
Additive Runge–Kutta schemes for convection–diffusion–reaction equations journal January 2003
A split explicit integration scheme for numerical weather prediction journal July 1978
A Scalable Fully Implicit Compressible Euler Solver for Mesoscale Nonhydrostatic Simulation of Atmospheric Flows journal January 2014
Matrix-Free Polynomial-Based Nonlinear Least Squares Optimized Preconditioning and Its Application to Discontinuous Galerkin Discretizations of the Euler Equations journal June 2015
Runge–Kutta IMEX schemes for the Horizontally Explicit/Vertically Implicit (HEVI) solution of wave equations journal November 2013
Targeting Atmospheric Simulation Algorithms for Large, Distributed-Memory, GPU-Accelerated Computers book January 2013
Numerical Methods for Ordinary Differential Equations book January 2003
Hybrid Eulerian-Lagrangian Semi-Implicit Time-Integrators journal October 2006
Implicit-Explicit Formulations of a Three-Dimensional Nonhydrostatic Unified Model of the Atmosphere (NUMA) journal January 2013
The Influence of Gravity Wave Breaking on the General Circulation of the Middle Atmosphere journal October 1983
A peta-scalable CPU-GPU algorithm for global atmospheric simulations journal August 2013
Horizontally Explicit and Vertically Implicit (HEVI) Time Discretization Scheme for a Discontinuous Galerkin Nonhydrostatic Model journal March 2015
A New Class of Optimal High-Order Strong-Stability-Preserving Time Discretization Methods journal January 2002
Efficient construction of unified continuous and discontinuous Galerkin formulations for the 3D Euler equations journal September 2016
Petascale solvers for anisotropic PDEs in atmospheric modelling on GPU clusters journal December 2015
Accelerating Time Integration for the Shallow Water Equations on the Sphere Using GPUs journal January 2015
A Semi-Implicit Scheme for grid Point Atmospheric Models of the Primitive Equations journal January 1971
High-order splitting methods for the incompressible Navier-Stokes equations journal December 1991
A study of spectral element and discontinuous Galerkin methods for the Navier–Stokes equations in nonhydrostatic mesoscale atmospheric modeling: Equation sets and test cases journal April 2008
Conservative Split-Explicit Time Integration Methods for the Compressible Nonhydrostatic Equations journal August 2007
A peta-scalable CPU-GPU algorithm for global atmospheric simulations conference January 2013
A split explicit integration scheme for numerical weather prediction journal July 1978
Numerical Methods for Ordinary Differential Equations book January 2008
Numerical Methods for Ordinary Differential Equations book July 2016

Cited By (2)

Current and Emerging Time-Integration Strategies in Global Numerical Weather and Climate Prediction journal February 2018
HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model journal January 2019