Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Educating HPC Users in the use of advanced computing technology

Conference · · 2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC)

We examine a multi-modal approach to educating and training users of an advanced computing technology testbed at the Institute for Advanced Computational Science at Stony Brook University. Ookami provides researchers worldwide with access to 176 Fujitsu A64FX compute nodes, this being the same processor technology powering the Japanese Fugaku supercomputer, the fastest computer in the world since June 2020. However, achieving high-performance on this Arm-based, leadership computing technology requires that users be familiar with details of computer architecture, performance analysis and modeling, and high-performance programming models that are commonly omitted in introductory programming courses. Indeed, regardless of their seniority, many of the testbed users are surprisingly unfamiliar with basic concepts such as vectorization, pipelining, latency/bandwidth, roofline models, computing energy/power, threads, and non-uniform memory access. These same concepts also pervade mainstream x86 technologies, so this is of widespread concern. Due to the national/global nature of our user community that is also very diverse in both discipline and experience, the inability to offer formal classes, and our experience that most people do not tend to read online documentation or training materials in sufficient depth, we have consciously employed multiple approaches that heavily emphasize (online) personal interactions and transfer of skills. Online documentation has been organized around best-practices and FAQs; twice-weekly hackathons and office hours via Zoom enable deep dives by both the team and the user community with multiple broad benefits; a Slack channel provides both real time and archived answers and discussions; and workshops, training and webinars target community needs as they arise. Furthermore, the perspective that these tools are being used in an educational setting rather than just for project communication makes them more effective and contributes to community success.

Research Organization:
Stony Brook Univ., NY (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Nuclear Physics (NP)
DOE Contract Number:
FG02-87ER40317
OSTI ID:
1907878
Journal Information:
2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC), Conference: 2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC), St. Louis, MO (United States), 14 Nov 2021
Country of Publication:
United States
Language:
English

References (20)

Pragmatic optimizations for better scientific utilization of large supercomputers November 2012
Experience report: refactoring the mesh interface in FLASH, a multiphysics software October 2018
FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes November 2000
Open XDMoD: A Tool for the Comprehensive Management of High-Performance Computing Resources July 2015
Comparing the behavior of OpenMP Implementations with various Applications on two different Fujitsu A64FX platforms July 2021
On Validating an Astrophysical Simulation Code November 2002
The Response of Model and Astrophysical Thermonuclear Flames to Curvature and Stretch October 2003
The Tau Parallel Performance System May 2006
Helium Detonations on Neutron Stars March 2001
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures November 2018
On the nonlinear evolution of wind-driven gravity waves July 2004
Terascale turbulence computation using the FLASH3 application framework on the IBM Blue Gene/L system January 2008
Evolution of FLASH, a multi-physics scientific simulation code for high-performance computing October 2013
On Heavy Element Enrichment in Classical Novae February 2004
Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels November 2019
High-Performance Reactive Fluid Flow Simulations Using Adaptive Mesh Refinement on Thousands of Processors January 2000
The Software development process of FLASH, a multiphysics simulation code May 2013
Flash code: studying astrophysical thermonuclear flashes March 2000
FLASH MHD simulations of experiments that study shock-generated magnetic fields December 2015
Programming abstractions for orchestration of HPC scientific computing (keynote) June 2019

Similar Records

Ookami: Deployment and Initial Experiences
Conference · 2021 · Practice and Experience in Advanced Research Computing · OSTI ID:1907882

Experiences with Porting the FLASH Code to Ookami, an HPE Apollo 80 A64FX Platform
Conference · 2022 · International Conference on High Performance Computing in Asia-Pacific Region Workshops · OSTI ID:1907875

Performance of an Astrophysical Radiation Hydrodynamics Code under Scalable Vector Extension Optimization
Journal Article · 2022 · Proceedings - IEEE International Conference on Cluster Computing · OSTI ID:1907870