Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Sparse matrix‐vector and matrix‐multivector products for the truncated SVD on graphics processors

Journal Article · · Concurrency and Computation. Practice and Experience
DOI:https://doi.org/10.1002/cpe.7871· OSTI ID:1993862
 [1];  [2];  [3];  [4]
  1. Depto. de Ingeniería y Ciencia de Computadores Universitat Jaume I Castellón de la Plana Spain
  2. Steinbuch Centre for Computing Karlsruhe Institute of Technology Karlsruhe Germany, Innovative Computing Lab University of Tennessee Knoxville (Tennessee) USA
  3. Depto. de Informática de Sistemas y Computadores Universitat Politècnica de València Valencia Spain
  4. Depto. de Ingeniería y Ciencia de Computadores Universitat Jaume I Castellón de la Plana Spain, Depto. de Informática Universitat de València Valencia Spain
Summary

Many practical algorithms for numerical rank computations implement an iterative procedure that involves repeated multiplications of a vector, or a collection of vectors, with both a sparse matrix and its transpose. Unfortunately, the realization of these sparse products on current high performance libraries often deliver much lower arithmetic throughput when the matrix involved in the product is transposed. In this work, we propose a hybrid sparse matrix layout, named CSRC, that combines the flexibility of some well‐known sparse formats to offer a number of appealing properties: (1) CSRC can be obtained at low cost from the popular CSR (compressed sparse row) format; (2) CSRC has similar storage requirements as CSR; and especially, (3) the implementation of the sparse product kernels delivers high performance for both the direct product and its transposed variant on modern graphics accelerators thanks to a significant reduction of atomic operations compared to a conventional implementation based on CSR. This solution thus renders considerably higher performance when integrated into an iterative algorithm for the truncated singular value decomposition (SVD), such as the randomized SVD or, as demonstrated in the experimental results, the block Golub–Kahan–Lanczos algorithm.

Sponsoring Organization:
USDOE
OSTI ID:
1993862
Alternate ID(s):
OSTI ID: 1995857
Journal Information:
Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 28 Vol. 35; ISSN 1532-0626
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (11)

Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs book January 2021
Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication
  • Buluç, Aydin; Williams, Samuel; Oliker, Leonid
  • Distributed Processing Symposium (IPDPS), 2011 IEEE International Parallel & Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2011.73
conference May 2011
GMRES: A Generalized Minimal Residual Algorithm for Solving Nonsymmetric Linear Systems journal July 1986
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions journal January 2011
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods book January 1994
Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method journal January 2001
Model-driven autotuning of sparse matrix-vector multiply on GPUs
  • Choi, Jee W.; Singh, Amik; Vuduc, Richard W.
  • Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '10 https://doi.org/10.1145/1693453.1693471
conference January 2010
The university of Florida sparse matrix collection journal November 2011
CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication conference January 2015
Sparse Matrix-Vector Multiplication on GPGPUs journal January 2017
A Block Lanczos Method for Computing the Singular Values and Corresponding Singular Vectors of a Matrix journal June 1981

Similar Records

Fast truncated SVD of sparse and dense matrices on graphics processors
Journal Article · Wed Jun 07 00:00:00 EDT 2023 · International Journal of High Performance Computing Applications · OSTI ID:2424934

Fast truncated SVD of sparse and dense matrices on graphics processors
Journal Article · Wed Jun 07 00:00:00 EDT 2023 · International Journal of High Performance Computing Applications · OSTI ID:1984302

Related Subjects