Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An Efficient Multicore Implementation of a Novel HSS-Structured Multifrontal Solver Using Randomized Sampling

Journal Article · · SIAM Journal on Scientific Computing
DOI:https://doi.org/10.1137/15M1010117· OSTI ID:1378736
 [1];  [1];  [1];  [1];  [2]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Univ. Libre de Bruxelles, Brussels (Belgium)

Here, we present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factoriz ation leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to 7 fold for problems in our test suite. The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK - STRUctured Matrices PACKage, which also has a distributed memory component for dense rank-structured matrices.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1378736
Alternate ID(s):
OSTI ID: 1439185
Journal Information:
SIAM Journal on Scientific Computing, Journal Name: SIAM Journal on Scientific Computing Journal Issue: 5 Vol. 38; ISSN 1064-8275
Publisher:
SIAMCopyright Statement
Country of Publication:
United States
Language:
English

References (39)

Sweeping preconditioner for the Helmholtz equation: Hierarchical matrix representation journal February 2011
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures journal November 2010
Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting: HIGH-PERFORMANCE RECURSIVE TILE LU FACTORIZATION journal September 2013
An algebraic multifrontal preconditioner that exploits the low-rank property: Algebraic Multifrontal Preconditioner that Exploits Low-Rank Property journal July 2015
Fast algorithms for hierarchically semiseparable matrices journal November 2010
Approximation of boundary element matrices journal October 2000
$${{\fancyscript{H}}} $$ H -LU factorization on many-core systems journal June 2013
A bibliography on semiseparable matrices* journal December 2005
PaStiX: a high-performance parallel direct solver for sparse symmetric positive definite systems journal February 2002
Introduction to hierarchical matrices with applications journal May 2003
Fast construction of hierarchical matrix representation from matrix–vector multiplication journal May 2011
A fast block low-rank dense solver with applications to finite-element matrices journal January 2016
A class of parallel tiled linear algebra algorithms for multicore architectures journal January 2009
DAGuE: A generic distributed DAG engine for High Performance Computing journal January 2012
Efficient Sparse LU Factorization with Left-Right Looking Strategy on Shared Memory Multiprocessors journal March 2000
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects journal July 2009
On the Automatic Scaling of Matrices for Gaussian Elimination journal January 1972
On 3D modeling of seismic wave propagation via a structured parallel multifrontal direct Helmholtz solver journal August 2011
On the Compression of Low Rank Matrices journal January 2005
Design of a Multicore Sparse Cholesky Factorization Using DAGs journal January 2010
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions journal January 2011
On the Numerical Rank of the Off-Diagonal Blocks of Schur Complements of Discretized Elliptic PDEs journal January 2010
Efficient Algorithms for Computing a Strong Rank-Revealing QR Factorization journal July 1996
A Fast Randomized Algorithm for Computing a Hierarchically Semiseparable Representation of a Matrix journal October 2011
The Multifrontal Method for Sparse Matrix Solution: Theory and Practice journal March 1992
A Superfast Structured Solver for Toeplitz Linear Systems via Randomized Sampling journal January 2012
Randomized Sparse Direct Solvers journal January 2013
Improving Multifrontal Methods by Means of Block Low-Rank Representations journal January 2015
A Fast $ULV$ Decomposition Solver for Hierarchically Semiseparable Representations journal January 2006
An Approximate Minimum Degree Ordering Algorithm journal October 1996
The Design and Use of Algorithms for Permuting Large Entries to the Diagonal of Sparse Matrices journal January 1999
A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling journal January 2001
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs journal January 1998
A BLAS-3 Version of the QR Factorization with Column Pivoting journal September 1998
OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES journal June 2011
A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling journal October 2014
A Parallel Geometric Multifrontal Solver Using Hierarchically Semiseparable Structure journal May 2016
Scheduling multithreaded computations by work stealing journal September 1999
The Multifrontal Solution of Indefinite Sparse Symmetric Linear journal September 1983

Cited By (4)

Geometry-oblivious FMM for compressing dense SPD matrices
  • Yu, Chenhan D.; Levitt, James; Reiz, Severin
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17 https://doi.org/10.1145/3126908.3126921
conference January 2017
Direct frequency-domain 3D acoustic solver with intermediate data compression benchmarked against time-domain modeling for full-waveform inversion applications journal July 2019
Many-body localization in a quasiperiodic Fibonacci chain journal January 2019
Many-body localization in a quasiperiodic Fibonacci chain text January 2018

Similar Records

High performance sparse multifrontal solvers on modern GPUs
Journal Article · Fri Feb 04 23:00:00 EST 2022 · Parallel Computing · OSTI ID:1960514

Efficient scalable algorithms for hierarchically semiseparable matrices
Journal Article · Wed Sep 14 00:00:00 EDT 2011 · SIAM J. Scientific Computing · OSTI ID:1052181