Deflation as a method of variance reduction for estimating the trace of a matrix inverse

Gambhir, Arjun Singh; Stathopoulos, Andreas; Orginos, Kostas

doi:10.1137/16M1066361

Title: Deflation as a method of variance reduction for estimating the trace of a matrix inverse

Abstract

Many fields require computing the trace of the inverse of a large, sparse matrix. The typical method used for such computations is the Hutchinson method which is a Monte Carlo (MC) averaging over matrix quadratures. To improve its convergence, several variance reductions techniques have been proposed. In this paper, we study the effects of deflating the near null singular value space. We make two main contributions. First, we analyze the variance of the Hutchinson method as a function of the deflated singular values and vectors. Although this provides good intuition in general, by assuming additionally that the singular vectors are random unitary matrices, we arrive at concise formulas for the deflated variance that include only the variance and mean of the singular values. We make the remarkable observation that deflation may increase variance for Hermitian matrices but not for non-Hermitian ones. This is a rare, if not unique, property where non-Hermitian matrices outperform Hermitian ones. The theory can be used as a model for predicting the benefits of deflation. Second, we use deflation in the context of a large scale application of "disconnected diagrams" in Lattice QCD. On lattices, Hierarchical Probing (HP) has previously provided an order of magnitude ofmore »« less

Authors:

Gambhir, Arjun Singh ^[1]; Stathopoulos, Andreas ^[1]; Orginos, Kostas ^[1]

College of William and Mary, Williamsburg, VA (United States)

Publication Date:: Thu Apr 06 00:00:00 EDT 2017

Research Org.:: Thomas Jefferson National Accelerator Facility (TJNAF), Newport News, VA (United States)

Sponsoring Org.:: USDOE Office of Science (SC), Nuclear Physics (NP)

OSTI Identifier:: 1362120

Report Number(s):: JLAB-THY-16-2291; DOE/OR/23177-3870; arXiv:1603.05988
Journal ID: ISSN 1064-8275

Grant/Contract Number:: AC05-06OR23100; CCF 1218349; ACI S12-SSE 1440700; FC02-12ER41890; FG02-04ER41302; AC05-06OR23177

Resource Type:: Accepted Manuscript

Journal Name:: SIAM Journal on Scientific Computing

Additional Journal Information:: Journal Volume: 39; Journal Issue: 2; Journal ID: ISSN 1064-8275

Publisher:: SIAM

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; deflation; random unitary matrices; Monte Carlo; trace of matrix inverse; Hutchinson; singular values

Citation Formats


                    Gambhir, Arjun Singh, Stathopoulos, Andreas, and Orginos, Kostas. Deflation as a method of variance reduction for estimating the trace of a matrix inverse.  United States: N. p., 2017. 
Web.  doi:10.1137/16M1066361.

Copy to clipboard


                    Gambhir, Arjun Singh, Stathopoulos, Andreas, & Orginos, Kostas. Deflation as a method of variance reduction for estimating the trace of a matrix inverse.  United States.  https://doi.org/10.1137/16M1066361

Copy to clipboard


                    Gambhir, Arjun Singh, Stathopoulos, Andreas, and Orginos, Kostas. Thu .  
"Deflation as a method of variance reduction for estimating the trace of a matrix inverse".  United States.  https://doi.org/10.1137/16M1066361.  https://www.osti.gov/servlets/purl/1362120.

Copy to clipboard


                    
@article{osti_1362120,

  title        = {Deflation as a method of variance reduction for estimating the trace of a matrix inverse},

  author       = {Gambhir, Arjun Singh and Stathopoulos, Andreas and Orginos, Kostas},

  abstractNote = {Many fields require computing the trace of the inverse of a large, sparse matrix. The typical method used for such computations is the Hutchinson method which is a Monte Carlo (MC) averaging over matrix quadratures. To improve its convergence, several variance reductions techniques have been proposed. In this paper, we study the effects of deflating the near null singular value space. We make two main contributions. First, we analyze the variance of the Hutchinson method as a function of the deflated singular values and vectors. Although this provides good intuition in general, by assuming additionally that the singular vectors are random unitary matrices, we arrive at concise formulas for the deflated variance that include only the variance and mean of the singular values. We make the remarkable observation that deflation may increase variance for Hermitian matrices but not for non-Hermitian ones. This is a rare, if not unique, property where non-Hermitian matrices outperform Hermitian ones. The theory can be used as a model for predicting the benefits of deflation. Second, we use deflation in the context of a large scale application of "disconnected diagrams" in Lattice QCD. On lattices, Hierarchical Probing (HP) has previously provided an order of magnitude of variance reduction over MC by removing "error" from neighboring nodes of increasing distance in the lattice. Although deflation used directly on MC yields a limited improvement of 30% in our problem, when combined with HP they reduce variance by a factor of over 150 compared to MC. For this, we pre-computated 1000 smallest singular values of an ill-conditioned matrix of size 25 million. Furthermore, using PRIMME and a domain-specific Algebraic Multigrid preconditioner, we perform one of the largest eigenvalue computations in Lattice QCD at a fraction of the cost of our trace computation.},

  doi          = {10.1137/16M1066361},

  journal      = {SIAM Journal on Scientific Computing},

  number       = 2,

  volume       = 39,

  place        = {United States},

  year         = {Thu Apr 06 00:00:00 EDT 2017},

  month        = {Thu Apr 06 00:00:00 EDT 2017}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.1137/16M1066361

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 17 works

Citation information provided by
Web of Science

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix
journal, April 2011

Avron, Haim; Toledo, Sivan
Journal of the ACM, Vol. 58, Issue 2
DOI: 10.1145/1944345.1944349

Adaptive Multigrid Algorithm for the Lattice Wilson-Dirac Operator
journal, November 2010

Babich, R.; Brannick, J.; Brower, R. C.
Physical Review Letters, Vol. 105, Issue 20
DOI: 10.1103/PhysRevLett.105.201602

An estimator for the diagonal of a matrix
journal, November 2007

Bekas, C.; Kokiopoulou, E.; Saad, Y.
Applied Numerical Mathematics, Vol. 57, Issue 11-12
DOI: 10.1016/j.apnum.2007.01.003

High-precision calculation of the strange nucleon electromagnetic form factors
journal, August 2015

Green, Jeremy; Meinel, Stefan; Engelhardt, Michael
Physical Review D, Vol. 92, Issue 3
DOI: 10.1103/PhysRevD.92.031501

A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines
journal, January 1990

Hutchinson, M. F.
Communications in Statistics - Simulation and Computation, Vol. 19, Issue 2
DOI: 10.1080/03610919008812866

Improved stochastic estimation of quark propagation with Laplacian Heaviside smearing in lattice QCD
journal, June 2011

Morningstar, C.; Bulava, J.; Foley, J.
Physical Review D, Vol. 83, Issue 11
DOI: 10.1103/PhysRevD.83.114505

Random matrix theory and spectral sum rules for the Dirac operator in QCD
journal, July 1993

Shuryak, E. V.; Verbaarschot, J. J. M.
Nuclear Physics A, Vol. 560, Issue 1
DOI: 10.1016/0375-9474(93)90098-I

Hierarchical Probing for Estimating the Trace of the Matrix Inverse on Toroidal Lattices
journal, January 2013

Stathopoulos, Andreas; Laeuchli, Jesse; Orginos, Kostas
SIAM Journal on Scientific Computing, Vol. 35, Issue 5
DOI: 10.1137/120881452

PRIMME: preconditioned iterative multimethod eigensolver—methods and software description
journal, April 2010

Stathopoulos, Andreas; McCombs, James R.
ACM Transactions on Mathematical Software, Vol. 37, Issue 2
DOI: 10.1145/1731022.1731031

Computing and Deflating Eigenvalues While Solving Multiple Right-Hand Side Linear Systems with an Application to Quantum Chromodynamics
journal, January 2010

Stathopoulos, Andreas; Orginos, Konstantinos
SIAM Journal on Scientific Computing, Vol. 32, Issue 1
DOI: 10.1137/080725532

Domain-Decomposition-Type Methods for Computing the Diagonal of a Matrix Inverse
journal, January 2011

Tang, Jok M.; Saad, Yousef
SIAM Journal on Scientific Computing, Vol. 33, Issue 5
DOI: 10.1137/100799939

Random Matrix Theory and Chiral Symmetry in QCD
journal, December 2000

Verbaarschot, J. J. M.; Wettig, T.
Annual Review of Nuclear and Particle Science, Vol. 50, Issue 1
DOI: 10.1146/annurev.nucl.50.1.343

Estimating the trace of the matrix inverse by interpolating from the diagonal of an approximate inverse
journal, December 2016

Wu, Lingfei; Laeuchli, Jesse; Kalantzis, Vassilis
Journal of Computational Physics, Vol. 326
DOI: 10.1016/j.jcp.2016.09.001

Controlling excited-state contamination in nucleon matrix elements
journal, June 2016

Yoon, Boram; Gupta, Rajan; Bhattacharya, Tanmoy
Physical Review D, Vol. 93, Issue 11
DOI: 10.1103/PhysRevD.93.114506

Works referencing / citing this record:

Proton and neutron electromagnetic form factors from lattice QCD
text, January 2018

Alexandrou, C.; Bacchio, S.; Constantinou, M.
Deutsches Elektronen-Synchrotron, DESY, Hamburg
DOI: 10.3204/pubdb-2019-00443

Complete flavor decomposition of the spin and momentum fraction of the proton using lattice QCD simulations at physical pion mass
text, January 2020

Alexandrou, C.; Bacchio, S.; Constantinou, M.
Deutsches Elektronen-Synchrotron, DESY, Hamburg
DOI: 10.3204/pubdb-2020-02369

Proton and neutron electromagnetic form factors from lattice QCD
text, January 2019

Alexandrou, C.; Bacchio, S.; Constantinou, M.
Deutsches Elektronen-Synchrotron, DESY, Hamburg
DOI: 10.3204/pubdb-2020-00239

Similar Records in DOE PAGES and OSTI.GOV collections:

Multigrid deflation for Lattice QCD

Journal Article Romero, Eloy ; Stathopoulos, Andreas ; Orginos, Kostas - Journal of Computational Physics

Computing the trace of the inverse of large matrices is typically addressed through statistical methods. Deflating out the lowest eigenvectors or singular vectors of the matrix reduces the variance of the trace estimator. This work summarizes our efforts to reduce the computational cost of computing the deflation space while achieving the desired variance reduction for Lattice QCD applications. Previous efforts computed the lower part of the singular spectrum of the Dirac operator by using an eigensolver preconditioned with a multigrid linear system solver. Despite the improvement in performance in those applications, as the problem size grows the runtime and storagemore »« less
Cited by 3
https://doi.org/10.1016/j.jcp.2020.109356

Full Text Available
Disconnected Diagrams in Lattice QCD

Thesis/Dissertation Gambhir, Arjun

In this work, we present state-of-the-art numerical methods and their applications for computing a particular class of observables using lattice quantum chromodynamics (Lattice QCD), a discretized version of the fundamental theory of quarks and gluons. These observables require calculating so called \disconnected diagrams" and are important for understanding many aspects of hadron structure, such as the strange content of the proton. We begin by introducing the reader to the key concepts of Lattice QCD and rigorously define the meaning of disconnected diagrams through an example of the Wick contractions of the nucleon. Subsequently, the calculation of observables requiring disconnected diagramsmore »« less
https://doi.org/10.2172/1422713

Full Text Available
Probing for the Trace Estimation of a Permuted Matrix Inverse Corresponding to a Lattice Displacement

Journal Article Switzer, Heather M. ; Stathopoulos, Andreas ; Romero, Eloy ; ... - SIAM Journal on Scientific Computing

We report thatpProbing is a general technique that is used to reduce the variance of the Hutchinson stochastic estimator for the trace of the inverse of a large, sparse matrix A. The variance of the estimator is the sum of the squares of the off-diagonal elements of A^-1. Therefore, this technique computes probing vectors that when used in the estimator annihilate the largest off-diagonal elements. For matrices that display decay of the magnitude of |Amore »« less
https://doi.org/10.1137/21m1422495

Full Text Available
Optimizing shift selection in multilevel Monte Carlo for disconnected diagrams in lattice QCD

Journal Article Whyte, Travis ; Stathopoulos, Andreas ; Romero, Eloy ; ... - Computer Physics Communications

The calculation of disconnected diagram contributions to physical signals is a computationally expensive task in Lattice QCD. To extract the physical signal, the trace of the inverse Lattice Dirac operator, a large sparse matrix, must be stochastically estimated. Because the variance of the stochastic estimator is typically large, variance reduction techniques must be employed. Multilevel Monte Carlo (MLMC) methods reduce the variance of the trace estimator by utilizing a telescoping sequence of estimators. Frequency Splitting is one such method that uses a sequence of inverses of shifted operators to estimate the trace of the inverse lattice Dirac operator, however theremore »« less
https://doi.org/10.1016/j.cpc.2023.108928
Deflation for inversion with multiple right-hand sides in QCD

Conference A. Stathopoulos, A.M. Abdel-Rehim, K. Orginos, - J. Phys., Conf. Ser.

Most calculations in lattice Quantum Chromodynamics (QCD) involve the solution of a series of linear systems of equations with exceedingly large matrices and a large number of right hand sides. Iterative methods for these problems can be sped up significantly if we deflate approximations of appropriate invariant spaces from the initial guesses. Recently we have developed eigCG, a modification of the Conjugate Gradient (CG) method, which while solving a linear system can reuse a window of the CG vectors to compute eigenvectors almost as accurately as the Lanczos method. The number of approximate eigenvectors can increase as more systems aremore »« less
https://doi.org/10.1088/1742-6596/180/1/012073

Similar Records

Title: Deflation as a method of variance reduction for estimating the trace of a matrix inverse

Abstract

Citation Formats

Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix journal, April 2011

Adaptive Multigrid Algorithm for the Lattice Wilson-Dirac Operator journal, November 2010

An estimator for the diagonal of a matrix journal, November 2007

High-precision calculation of the strange nucleon electromagnetic form factors journal, August 2015

A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines journal, January 1990

Improved stochastic estimation of quark propagation with Laplacian Heaviside smearing in lattice QCD journal, June 2011

Random matrix theory and spectral sum rules for the Dirac operator in QCD journal, July 1993

Hierarchical Probing for Estimating the Trace of the Matrix Inverse on Toroidal Lattices journal, January 2013

PRIMME: preconditioned iterative multimethod eigensolver—methods and software description journal, April 2010

Computing and Deflating Eigenvalues While Solving Multiple Right-Hand Side Linear Systems with an Application to Quantum Chromodynamics journal, January 2010

Domain-Decomposition-Type Methods for Computing the Diagonal of a Matrix Inverse journal, January 2011

Random Matrix Theory and Chiral Symmetry in QCD journal, December 2000

Estimating the trace of the matrix inverse by interpolating from the diagonal of an approximate inverse journal, December 2016

Controlling excited-state contamination in nucleon matrix elements journal, June 2016

Proton and neutron electromagnetic form factors from lattice QCD text, January 2018

Complete flavor decomposition of the spin and momentum fraction of the proton using lattice QCD simulations at physical pion mass text, January 2020

Proton and neutron electromagnetic form factors from lattice QCD text, January 2019

Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix
journal, April 2011

Adaptive Multigrid Algorithm for the Lattice Wilson-Dirac Operator
journal, November 2010

An estimator for the diagonal of a matrix
journal, November 2007

High-precision calculation of the strange nucleon electromagnetic form factors
journal, August 2015

A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines
journal, January 1990

Improved stochastic estimation of quark propagation with Laplacian Heaviside smearing in lattice QCD
journal, June 2011

Random matrix theory and spectral sum rules for the Dirac operator in QCD
journal, July 1993

Hierarchical Probing for Estimating the Trace of the Matrix Inverse on Toroidal Lattices
journal, January 2013

PRIMME: preconditioned iterative multimethod eigensolver—methods and software description
journal, April 2010

Computing and Deflating Eigenvalues While Solving Multiple Right-Hand Side Linear Systems with an Application to Quantum Chromodynamics
journal, January 2010

Domain-Decomposition-Type Methods for Computing the Diagonal of a Matrix Inverse
journal, January 2011

Random Matrix Theory and Chiral Symmetry in QCD
journal, December 2000

Estimating the trace of the matrix inverse by interpolating from the diagonal of an approximate inverse
journal, December 2016

Controlling excited-state contamination in nucleon matrix elements
journal, June 2016

Proton and neutron electromagnetic form factors from lattice QCD
text, January 2018

Complete flavor decomposition of the spin and momentum fraction of the proton using lattice QCD simulations at physical pion mass
text, January 2020

Proton and neutron electromagnetic form factors from lattice QCD
text, January 2019