DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion: The Symmetric Case

Abstract

We describe an efficient parallel implementation of the selected inversion algorithm for distributed memory computer systems, which we call PSelInv. The PSelInv method computes selected elements of a general sparse matrix Athat can be decomposed as A= LU, where L is lower triangular and U is upper triangular. The implementation described in this article focuses on the case of sparse symmetric matrices. It contains an interface that is compatible with the distributed memory parallel sparse direct factorization SuperLU-DIST. Yet, the underlying data structure and design of PSelInv allows it to be easily combined with other factorization routines, such as PARDISO. We discuss general parallelization strategies such as data and task distribution schemes. Specifically, we describe how to exploit the concurrency exposed by the elimination tree associated with the LU factorization of A. We demonstrate the efficiency and accuracy of PSelInv by presenting several numerical experiments. In particular, we show that PSelInv can run efficiently on more than 4,000 cores for a modestly sized matrix. We also show how PSelInv can be used to accelerate large-scale electronic structure calculations.

Authors:
 [1];  [2];  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Univ. of California, Berkeley, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Org.:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC); USDOE Office of Science (SC), Basic Energy Sciences (BES)
OSTI Identifier:
1525182
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
ACM Transactions on Mathematical Software
Additional Journal Information:
Journal Volume: 43; Journal Issue: 3; Journal ID: ISSN 0098-3500
Publisher:
Association for Computing Machinery
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; selected inversion; sparse direct method; distributed memory parallel algorithm; high performance computation; electronic structure theory

Citation Formats

Jacquelin, Mathias, Lin, Lin, and Yang, Chao. PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion: The Symmetric Case. United States: N. p., 2017. Web. doi:10.1145/2786977.
Jacquelin, Mathias, Lin, Lin, & Yang, Chao. PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion: The Symmetric Case. United States. https://doi.org/10.1145/2786977
Jacquelin, Mathias, Lin, Lin, and Yang, Chao. Mon . "PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion: The Symmetric Case". United States. https://doi.org/10.1145/2786977. https://www.osti.gov/servlets/purl/1525182.
@article{osti_1525182,
title = {PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion: The Symmetric Case},
author = {Jacquelin, Mathias and Lin, Lin and Yang, Chao},
abstractNote = {We describe an efficient parallel implementation of the selected inversion algorithm for distributed memory computer systems, which we call PSelInv. The PSelInv method computes selected elements of a general sparse matrix Athat can be decomposed as A= LU, where L is lower triangular and U is upper triangular. The implementation described in this article focuses on the case of sparse symmetric matrices. It contains an interface that is compatible with the distributed memory parallel sparse direct factorization SuperLU-DIST. Yet, the underlying data structure and design of PSelInv allows it to be easily combined with other factorization routines, such as PARDISO. We discuss general parallelization strategies such as data and task distribution schemes. Specifically, we describe how to exploit the concurrency exposed by the elimination tree associated with the LU factorization of A. We demonstrate the efficiency and accuracy of PSelInv by presenting several numerical experiments. In particular, we show that PSelInv can run efficiently on more than 4,000 cores for a modestly sized matrix. We also show how PSelInv can be used to accelerate large-scale electronic structure calculations.},
doi = {10.1145/2786977},
journal = {ACM Transactions on Mathematical Software},
number = 3,
volume = 43,
place = {United States},
year = {Mon Jan 16 00:00:00 EST 2017},
month = {Mon Jan 16 00:00:00 EST 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

On Computing Inverse Entries of a Sparse Matrix in an Out-of-Core Environment
journal, January 2012

  • Amestoy, Patrick R.; Duff, Iain S.; L'Excellent, Jean-Yves
  • SIAM Journal on Scientific Computing, Vol. 34, Issue 4
  • DOI: 10.1137/100799411

ScaLAPACK Users' Guide
book, January 1997

  • Blackford, L. S.; Choi, J.; Cleary, A.
  • Society for Industrial and Applied Mathematics
  • DOI: 10.1137/1.9780898719642

On computing certain elements of the inverse of a sparse matrix
journal, March 1975


Solving fluctuation-enhanced Poisson–Boltzmann equations
journal, October 2014


Accelerating atomic orbital-based electronic structure calculation via pole expansion and selected inversion
journal, June 2013


The influence of relaxed supernode partitions on the multifrontal method
journal, December 1989

  • Ashcraft, Cleve; Grimes, Roger
  • ACM Transactions on Mathematical Software, Vol. 15, Issue 4
  • DOI: 10.1145/76909.76910

SelInv---An Algorithm for Selected Inversion of a Sparse Symmetric Matrix
journal, February 2011

  • Lin, Lin; Yang, Chao; Meza, Juan C.
  • ACM Transactions on Mathematical Software, Vol. 37, Issue 4
  • DOI: 10.1145/1916461.1916464

ScaLAPACK's MRRR algorithm
journal, January 2010


A Fast Parallel Algorithm for Selected Inversion of Structured Sparse Matrices with Application to 2D Electronic Structure Calculations
journal, January 2011

  • Lin, Lin; Yang, Chao; Lu, Jianfeng
  • SIAM Journal on Scientific Computing, Vol. 33, Issue 3
  • DOI: 10.1137/09077432X

A hybrid method for the parallel computation of Green’s functions
journal, August 2009

  • Petersen, Dan Erik; Li, Song; Stokbro, Kurt
  • Journal of Computational Physics, Vol. 228, Issue 14
  • DOI: 10.1016/j.jcp.2009.03.035

SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems
journal, June 2003

  • Li, Xiaoye S.; Demmel, James W.
  • ACM Transactions on Mathematical Software, Vol. 29, Issue 2
  • DOI: 10.1145/779359.779361

Pole-Based approximation of the Fermi-Dirac function
journal, August 2009

  • Lin, Lin; Lu, Jianfeng; Ying, Lexing
  • Chinese Annals of Mathematics, Series B, Vol. 30, Issue 6
  • DOI: 10.1007/s11401-009-0201-7

A Parallel Algorithm for Multilevel Graph Partitioning and Sparse Matrix Ordering
journal, January 1998

  • Karypis, George; Kumar, Vipin
  • Journal of Parallel and Distributed Computing, Vol. 48, Issue 1
  • DOI: 10.1006/jpdc.1997.1403

The SIESTA method for ab initio order- N materials simulation
journal, March 2002

  • Soler, José M.; Artacho, Emilio; Gale, Julian D.
  • Journal of Physics: Condensed Matter, Vol. 14, Issue 11
  • DOI: 10.1088/0953-8984/14/11/302

Some stable methods for calculating inertia and solving symmetric linear systems
journal, January 1977


A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling
journal, January 2001

  • Amestoy, Patrick R.; Duff, Iain S.; L'Excellent, Jean-Yves
  • SIAM Journal on Matrix Analysis and Applications, Vol. 23, Issue 1
  • DOI: 10.1137/S0895479899358194

The university of Florida sparse matrix collection
journal, November 2011

  • Davis, Timothy A.; Hu, Yifan
  • ACM Transactions on Mathematical Software, Vol. 38, Issue 1
  • DOI: 10.1145/2049662.2049663

A probing method for computing the diagonal of a matrix inverse
journal, April 2011

  • Tang, Jok M.; Saad, Yousef
  • Numerical Linear Algebra with Applications, Vol. 19, Issue 3
  • DOI: 10.1002/nla.779

A two-dimensional domain decomposition technique for the simulation of quantum-scale devices
journal, February 2012

  • Cauley, Stephen; Balakrishnan, Venkataramanan; Klimeck, Gerhard
  • Journal of Computational Physics, Vol. 231, Issue 4
  • DOI: 10.1016/j.jcp.2011.10.006

Inhomogeneous Electron Gas
journal, November 1964


An iteration method for the solution of the eigenvalue problem of linear differential and integral operators
journal, October 1950

  • Lanczos, C.
  • Journal of Research of the National Bureau of Standards, Vol. 45, Issue 4
  • DOI: 10.6028/jres.045.026

Extension and optimization of the FIND algorithm: Computing Green’s and less-than Green’s functions
journal, February 2012


Low cost high performance uncertainty quantification
conference, January 2009

  • Bekas, C.; Curioni, A.; Fedulova, I.
  • Proceedings of the 2nd Workshop on High Performance Computational Finance - WHPCF '09
  • DOI: 10.1145/1645413.1645421

Rational approximation to the Fermi–Dirac function with applications in density functional theory
journal, May 2010


Electronic structure calculations with dynamical mean-field theory
journal, August 2006


The Role of Elimination Trees in Sparse Factorization
journal, January 1990

  • Liu, Joseph W. H.
  • SIAM Journal on Matrix Analysis and Applications, Vol. 11, Issue 1
  • DOI: 10.1137/0611010

An estimator for the diagonal of a matrix
journal, November 2007


Finding off-diagonal entries of the inverse of a large symmetric sparse matrix: FINDING OFF-DIAGONAL ENTRIES OF THE INVERSE OF A LARGE SPARSE MATRIX
journal, February 2012

  • Eastwood, Shawn; Wan, Justin W. L.
  • Numerical Linear Algebra with Applications, Vol. 20, Issue 1
  • DOI: 10.1002/nla.1826

A nested dissection approach to modeling transport in nanodevices: Algorithms and applications: NESTED DISSECTION APPROACH TO MODELING TRANSPORT IN NANODEVICES
journal, June 2013

  • Hetmaniuk, U.; Zhao, Y.; Anantram, M. P.
  • International Journal for Numerical Methods in Engineering, Vol. 95, Issue 7
  • DOI: 10.1002/nme.4518

Adaptive local basis set for Kohn–Sham density functional theory in a discontinuous Galerkin framework I: Total energy calculation
journal, February 2012


A fast algorithm for sparse matrix computations related to inversion
journal, June 2013


An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization
journal, November 1994

  • Rothberg, Edward; Gupta, Anoop
  • SIAM Journal on Scientific Computing, Vol. 15, Issue 6
  • DOI: 10.1137/0915085

A Shifted Block Lanczos Algorithm for Solving Sparse Symmetric Generalized Eigenproblems
journal, January 1994

  • Grimes, Roger G.; Lewis, John G.; Simon, Horst D.
  • SIAM Journal on Matrix Analysis and Applications, Vol. 15, Issue 1
  • DOI: 10.1137/S0895479888151111

Computing entries of the inverse of a sparse matrix using the FIND algorithm
journal, November 2008


PT-Scotch: A tool for efficient parallel graph ordering
journal, July 2008


Self-Consistent Equations Including Exchange and Correlation Effects
journal, November 1965


Works referencing / citing this record: