A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices

Title: A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices

Journal Article · Thu Jan 01 00:00:00 EST 2015 · Journal of Parallel and Distributed Computing

DOI:https://doi.org/10.1016/j.jpdc.2014.09.003· OSTI ID:1250088

Jhurani, Chetan; Mullowney, Paul

Cite

Export

Save

Journal Information:: Journal of Parallel and Distributed Computing, Journal Name: Journal of Parallel and Distributed Computing Vol. 75 Journal Issue: C; ISSN 0743-7315

Citation Metrics:

Cited by: 12 works

Citation information provided by
Web of Science

Performance Analysis of Memory Transfers and GEMM Subroutines on NVIDIA Tesla GPU Cluster

Conference · Mon Aug 31 00:00:00 EDT 2009 · OSTI ID:1250088

Bode, Brett

Journal Article · Fri Jul 01 00:00:00 EDT 2016 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1250088

Kurzak, Jakub; Anzt, Hartwig; Gates, Mark; +1 more

Conference · Tue Nov 01 00:00:00 EDT 2016 · OSTI ID:1250088

Rajamanickam, Sivasankaran; Deveci, Mehmet; Trott, Christian Robert