skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Hierarchical Krylov and nested Krylov methods for extreme-scale computing

Journal Article · · Parallel Computing
 [1];  [1];  [2];  [3]
  1. Argonne National Lab. (ANL), Argonne, IL (United States)
  2. Argonne National Lab. (ANL), Argonne, IL (United States); Illinois Inst. of Technology, Chicago, IL (United States)
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Univ. of Tennessee, Knoxville, TN (United States)

The solution of large, sparse linear systems is typically a dominant phase of computation for simulations based on partial differential equations, which are ubiquitous in scientific and engineering applications. While preconditioned Krylov methods are widely used and provide many advantages for solving sparse linear systems that do not have highly convergent, geometric multigrid solvers or specialized fast solvers, Krylov methods encounter well-known scaling difficulties for over 10,000 processor cores because each iteration requires at least one vector inner product, which in turn requires a global synchronization that scales poorly because of internode latency. To aid in overcoming these difficulties, we have developed hierarchical Krylov methods and nested Krylov methods in the PETSc library that reduce the number of global inner products required across the entire system (where they are expensive), though freely allow vector inner products across smaller subsets of the entire system (where they are inexpensive) or use inner iterations that do not invoke vector inner products at all. Nested Krylov methods are a generalization of inner-outer iterative methods with two or more layers. Hierarchical Krylov methods are a generalization of block Jacobi and overlapping additive Schwarz methods, where each block itself is solved by Krylov methods on smaller blocks. Conceptually, the hierarchy can continue recursively to an arbitrary number of levels of smaller and smaller blocks. As a specific case, we introduce the hierarchical FGMRES method, or h-FGMRES, and we demonstrate the impact of two-level h-FGMRES with a variable preconditioner on the PFLOTRAN subsurface flow application. We also demonstrate the impact of nested FGMRES, BiCGStab and Chebyshev methods. These hierarchical Krylov methods and nested Krylov methods significantly reduced overall PFLOTRAN simulation time on the Cray XK6 when using 10,000 through 224,000 cores through the combined effects of reduced global synchronization due to fewer global inner products and stronger inner hierarchical or nested preconditioners.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); Office of Science (SC), Biological and Environmental Research (BER). Earth and Environmental Systems Science Division
Grant/Contract Number:
AC02-05CH11231; AC02-06CH11357; AC05-00OR22725
OSTI ID:
1565143
Journal Information:
Parallel Computing, Vol. 40, Issue 1; ISSN 0167-8191
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 24 works
Citation information provided by
Web of Science

References (18)

GMRES: A Generalized Minimal Residual Algorithm for Solving Nonsymmetric Linear Systems journal July 1986
Parallel iterative S-step methods for unsymmetric linear systems journal August 1996
Block s-step Krylov iterative methods journal January 2010
Reducing the effect of global communication in GMRES(m) and CG on parallel distributed memory computers journal October 1995
Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines journal January 2013
Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems journal March 1992
The Design of Data-Structure-Neutral Libraries for the Iterative Solution of Sparse Linear Systems journal January 1996
Enabling high-fidelity neutron transport simulations on petascale architectures conference January 2009
Inner and Outer Iterations for the Chebyshev Algorithm journal February 1998
An efficient hierarchical preconditioner for quadratic discretizations of finite element problems: AN EFFICIENT HIERARCHICAL PRECONDITIONER journal November 2010
A Flexible Inner-Outer Preconditioned GMRES Algorithm journal March 1993
Trilinos users guide report August 2003
Analysis of iterative methods for saddle point problems: a unified approach journal May 2001
Using multiple levels of parallelism to enhance the performance of domain decomposition solvers journal June 2010
Algebraic Multilevel Preconditioning Methods, II journal December 1990
A Hybrid Chebyshev Krylov Subspace Algorithm for Solving Nonsymmetric Systems of Linear Equations journal July 1986
A Closed-form Equation for Predicting the Hydraulic Conductivity of Unsaturated Soils1 journal January 1980
Multiphysics simulations: Challenges and opportunities journal February 2013

Cited By (3)

Multilevel approaches for FSAI preconditioning: MULTILEVEL APPROACHES FOR FSAI PRECONDITIONING
  • Magri, Victor A. P.; Franceschini, Andrea; Ferronato, Massimiliano
  • Numerical Linear Algebra with Applications, Vol. 25, Issue 5 https://doi.org/10.1002/nla.2183
journal April 2018
Convergence analysis of Anderson‐type acceleration of Richardson's iteration journal April 2019
A Hierarchical Space-Time Spectral Element and Moment-of-Fluid Method for Improved Capturing of Vortical Structures in Incompressible Multi-phase/Multi-material Flows journal November 2019