Comparative evaluation of deep learning workloads for leadership-class systems
- ORNL
Deep learning (DL) workloads and their performance at scale are becoming important factors to consider as we design, develop and deploy next-generation high-performance computing systems. Since DL applications rely heavily on DL frameworks and underlying compute (CPU/GPU) stacks, it is essential to gain a holistic understanding from compute kernels, models, and frameworks of popular DL stacks, and to assess their impact on science-driven, mission-critical applications. At Oak Ridge Leadership Computing Facility (OLCF), we employ a set of micro and macro DL benchmarks established through the Collaboration of Oak Ridge, Argonne, and Livermore (CORAL) to evaluate the AI readiness of our next-generation supercomputers. In this paper, we present our early observations and performance benchmark comparisons between the Nvidia V100 based Summit system with its CUDA stack and an AMD MI100 based testbed system with its ROCm stack. We take a layered perspective on DL benchmarking and point to opportunities for future optimizations in the technologies that we consider.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1838972
- Country of Publication:
- United States
- Language:
- English
Similar Records
Studying Performance Portability of LAMMPS across Diverse GPU-based Platforms
Studying performance portability of LAMMPS across diverse GPU-based platforms
Workload Characterization of a Leadership Class Storage Cluster
Conference
·
Sun May 01 00:00:00 EDT 2022
·
OSTI ID:1869068
Studying performance portability of LAMMPS across diverse GPU-based platforms
Journal Article
·
Sat Sep 23 20:00:00 EDT 2023
· Concurrency and Computation. Practice and Experience
·
OSTI ID:2076176
Workload Characterization of a Leadership Class Storage Cluster
Conference
·
Thu Dec 31 23:00:00 EST 2009
·
OSTI ID:993463