Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications

Journal Article · · IEEE Transactions on Parallel and Distributed Systems

Not provided.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1982884
Journal Information:
IEEE Transactions on Parallel and Distributed Systems, Vol. 33, Issue 4; ISSN 1045-9219
Publisher:
IEEE
Country of Publication:
United States
Language:
English

References (16)

PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications
  • Burtscher, Martin; Kim, Byoung-Do; Diamond, Jeff
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.41
conference November 2010
Fast, accurate, and scalable memory modeling of GPGPUs using reuse profiles
  • Arafa, Yehia; Badawy, Abdel-Hameed; Chennupati, Gopinath
  • ICS '20: 2020 International Conference on Supercomputing, Proceedings of the 34th ACM International Conference on Supercomputing https://doi.org/10.1145/3392717.3392761
conference June 2020
Fast Computational GPU Design with GT-Pin conference October 2015
Flexible software profiling of GPU architectures journal January 2016
NVBit conference October 2019
Understanding the Performance of GPGPU Applications from a Data-Centric View conference November 2019
NAMD: Biomolecular Simulation on Thousands of Processors conference January 2002
The Tau Parallel Performance System journal May 2006
Rodinia: A benchmark suite for heterogeneous computing conference October 2009
Tools for top-down performance analysis of GPU-accelerated applications conference June 2020
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
  • Vazhkudai, Sudharshan S.; de Supinski, Bronis R.; Bland, Arthur S.
  • SC18: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2018.00055
conference November 2018
Speculative reconvergence for improved SIMT efficiency conference February 2020
BerkeleyGW: A massively parallel computer package for the calculation of the quasiparticle and optical properties of materials and nanostructures journal June 2012
Egeria conference November 2017
CQA: A code quality analyzer tool at binary level conference December 2014
High-Order Curvilinear Finite Element Methods for Lagrangian Hydrodynamics journal January 2012