Fast synchronization-free algorithms for parallel sparse triangular solves with multiple right-hand sides
- University of Copenhagen
- BATTELLE (PACIFIC NW LAB)
- STFC Rutherford Appleton Laboratory, UK
The sparse triangular solve kernels, SpTRSV and SpTRSM, are important building blocks for a number of numerical linear algebra routines. Parallelizing SpTRSV and SpTRSM on today's many-core platforms, such as GPUs, is not an easy task since computing a component of the solution may depend on previously computed components, enforcing a degree of sequential processing. As a consequence, most existing work introduces a preprocessing stage to partition the components into a group of level-sets or colour-sets so that components within a set are independent and can be processed simultaneously during the subsequent solution stage. However, this class of methods requires a long preprocessing time as well as significant runtime synchronization over-heads between the sets. To address this, we
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1557091
- Report Number(s):
- PNNL-SA-130501
- Journal Information:
- Concurrency and Computation: Practice and Experience, Vol. 29, Issue 21
- Country of Publication:
- United States
- Language:
- English
Similar Records
Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures
Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters