skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: An Efficient Multicore Implementation of a Novel HSS-Structured Multifrontal Solver Using Randomized Sampling

Journal Article · · SIAM Journal on Scientific Computing
DOI:https://doi.org/10.1137/15M1010117· OSTI ID:1378736
 [1];  [1];  [1];  [1];  [2]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Univ. Libre de Bruxelles, Brussels (Belgium)

We present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factorization leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to sevenfold for problems in our test suite. The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK (STRUctured Matrices PACKage), which also has a distributed memory component for dense rank-structured matrices.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1378736
Alternate ID(s):
OSTI ID: 1439185
Journal Information:
SIAM Journal on Scientific Computing, Vol. 38, Issue 5; ISSN 1064-8275
Publisher:
SIAMCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 68 works
Citation information provided by
Web of Science

References (38)

Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects journal July 2009
Improving Multifrontal Methods by Means of Block Low-Rank Representations journal January 2015
An Approximate Minimum Degree Ordering Algorithm journal October 1996
A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling journal January 2001
A fast block low-rank dense solver with applications to finite-element matrices journal January 2016
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures journal November 2010
Scheduling multithreaded computations by work stealing journal September 1999
Introduction to hierarchical matrices with applications journal May 2003
DAGuE: A generic distributed DAG engine for High Performance Computing journal January 2012
A class of parallel tiled linear algebra algorithms for multicore architectures journal January 2009
On the Numerical Rank of the Off-Diagonal Blocks of Schur Complements of Discretized Elliptic PDEs journal January 2010
A Fast $ULV$ Decomposition Solver for Hierarchically Semiseparable Representations journal January 2006
On the Compression of Low Rank Matrices journal January 2005
On the Automatic Scaling of Matrices for Gaussian Elimination journal January 1972
Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting: HIGH-PERFORMANCE RECURSIVE TILE LU FACTORIZATION journal September 2013
The Design and Use of Algorithms for Permuting Large Entries to the Diagonal of Sparse Matrices journal January 1999
The Multifrontal Solution of Indefinite Sparse Symmetric Linear journal September 1983
OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES journal June 2011
Sweeping preconditioner for the Helmholtz equation: Hierarchical matrix representation journal February 2011
Efficient Algorithms for Computing a Strong Rank-Revealing QR Factorization journal July 1996
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions journal January 2011
PaStiX: a high-performance parallel direct solver for sparse symmetric positive definite systems journal February 2002
Design of a Multicore Sparse Cholesky Factorization Using DAGs journal January 2010
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs journal January 1998
A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling journal October 2014
$${{\fancyscript{H}}} $$ H -LU factorization on many-core systems journal June 2013
Fast construction of hierarchical matrix representation from matrix–vector multiplication journal May 2011
The Multifrontal Method for Sparse Matrix Solution: Theory and Practice journal March 1992
A Fast Randomized Algorithm for Computing a Hierarchically Semiseparable Representation of a Matrix journal October 2011
An algebraic multifrontal preconditioner that exploits the low-rank property: Algebraic Multifrontal Preconditioner that Exploits Low-Rank Property journal July 2015
A BLAS-3 Version of the QR Factorization with Column Pivoting journal September 1998
Efficient Sparse LU Factorization with Left-Right Looking Strategy on Shared Memory Multiprocessors journal March 2000
A bibliography on semiseparable matrices* journal December 2005
On 3D modeling of seismic wave propagation via a structured parallel multifrontal direct Helmholtz solver journal August 2011
A Parallel Geometric Multifrontal Solver Using Hierarchically Semiseparable Structure journal May 2016
Randomized Sparse Direct Solvers journal January 2013
Fast algorithms for hierarchically semiseparable matrices journal November 2010
A Superfast Structured Solver for Toeplitz Linear Systems via Randomized Sampling journal January 2012

Cited By (3)

Direct frequency-domain 3D acoustic solver with intermediate data compression benchmarked against time-domain modeling for full-waveform inversion applications journal July 2019
Geometry-oblivious FMM for compressing dense SPD matrices
  • Yu, Chenhan D.; Levitt, James; Reiz, Severin
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17 https://doi.org/10.1145/3126908.3126921
conference January 2017
Many-body localization in a quasiperiodic Fibonacci chain text January 2018

Similar Records

High performance sparse multifrontal solvers on modern GPUs
Journal Article · Sat Feb 05 00:00:00 EST 2022 · Parallel Computing · OSTI ID:1378736

Efficient scalable algorithms for hierarchically semiseparable matrices
Journal Article · Wed Sep 14 00:00:00 EDT 2011 · SIAM J. Scientific Computing · OSTI ID:1378736

A Distributed-Memory Package for Dense Hierarchically Semi-Separable Matrix Computations Using Randomization
Journal Article · Thu Jun 30 00:00:00 EDT 2016 · ACM Transactions on Mathematical Software · OSTI ID:1378736