skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices

Journal Article · · Journal of Parallel and Distributed Computing

Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0004439
OSTI ID:
1250088
Journal Information:
Journal of Parallel and Distributed Computing, Journal Name: Journal of Parallel and Distributed Computing Vol. 75 Journal Issue: C; ISSN 0743-7315
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

Similar Records

Performance Analysis of Memory Transfers and GEMM Subroutines on NVIDIA Tesla GPU Cluster
Conference · Mon Aug 31 00:00:00 EDT 2009 · OSTI ID:1250088

Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs
Journal Article · Fri Jul 01 00:00:00 EDT 2016 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1250088

Performance Portable Sparse Matrix-Matrix Multiplication on Intel Knights Landing and NVIDIA GPUs.
Conference · Tue Nov 01 00:00:00 EDT 2016 · OSTI ID:1250088

Related Subjects