Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Experiences Readying Applications for Exascale

Conference ·

The advent of Exascale computing invites an assessment of existing best practices for developing application readiness on the world's largest supercomputers. This work details observations from the last four years in preparing scientific applications to run on the Oak Ridge Leadership Computing Facility's (OLCF) Frontier system. This paper addresses a range of topics in software including programmability, tuning, and portability considerations that are key to moving applications from existing systems to future installations. A set of representative workloads provides case studies for general system and software testing. We evaluate the use of early access systems for development across several generations of hardware. Finally, we discuss how best practices were identified and disseminated to the community through a wide range of activities including user-guides and trainings. We conclude with recommendations for ensuring application readiness on future leadership computing systems.

Research Organization:
National Renewable Energy Laboratory (NREL), Golden, CO (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC36-08GO28308
OSTI ID:
2283029
Report Number(s):
NREL/CP-2C00-85977; MainId:86750; UUID:84d7a168-ae61-48be-bc84-cca7e7ce19f8; MainAdminID:70599
Resource Relation:
Conference: Presented at the International Conference for High Performance Computing, Networking, Storage, and Analysis, 12-17 November 2023, Denver, Colorado
Country of Publication:
United States
Language:
English

References (42)

Progressive Optimization of Batched LU Factorization on GPUs September 2019
GridTools: A framework for portable weather and climate applications July 2021
Parallel reactive molecular dynamics: Numerical methods and algorithmic techniques April 2012
Nyx: A MASSIVELY PARALLEL AMR CODE FOR COMPUTATIONAL COSMOLOGY February 2013
Enabling GPU accelerated computing in the SUNDIALS time integration library December 2021
Scaling Correlated Fragment Molecular Orbital Calculations on Summit November 2022
Recent developments in the general atomic and molecular electronic structure system April 2020
RAJA: Portable Performance for Large-Scale Scientific Applications November 2019
The DOE E3SM Coupled Model Version 1: Description and Results at High Resolution December 2019
ReaxFF Reactive Force Field for Molecular Dynamics Simulations of Hydrocarbon Oxidation February 2008
OpenMP: an industry standard API for shared-memory programming January 1998
Kokkos: Enabling manycore performance portability through polymorphic memory access patterns December 2014
GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials February 2017
Real-Space Multiple-Scattering Theory and Its Applications at Exascale November 2017
Simulating Hydrodynamics in Cosmology with CRK-HACC January 2023
Farpoint: A High-resolution Cosmology Simulation at the Gigaparsec Scale February 2022
Novel Computer Architectures and Quantum Chemistry May 2020
HACC: extreme scaling and performance across diverse architectures December 2016
PeleC: An adaptive mesh refinement solver for compressible reacting flows September 2022
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction November 2018
Climatic Clustering and Longitudinal Analysis with Impacts on Food, Bioenergy, and Pandemics June 2023
Petascale direct numerical simulation of turbulent channel flow on up to 786K cores January 2013
Experiences from Leadership Computing in Simulations of Turbulent Fluid Flows September 2014
ReaxFF- l g: Correction of the ReaxFF Reactive Force Field for London Dispersion, with Applications to the Equations of State for Energetic Materials October 2011
Examining recent many-core architectures and programming models using SHOC
  • No authors listed
  • Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems https://doi.org/10.1145/2832087.2832090
November 2015
Pre-exascale accelerated application development: The ORNL Summit experience May 2020
Evaluating GPU Programming Models for the LUMI Supercomputer January 2022
The Parallel C++ Statistical Library for Bayesian Inference: QUESO January 2017
The Trinity Center of Excellence Co-Design Best Practices January 2017
Application Modernization at LLNL and the Sierra Center of Excellence January 2017
Knowledge Network Embedding of Transcriptomic Data from Spaceflown Mice Uncovers Signs and Symptoms Associated with Terrestrial Diseases January 2021
A Holistic Algorithmic Approach to Improving Accuracy, Robustness, and Computational Efficiency for Atmospheric Dynamics January 2020
Portable C++ Code that can Look and Feel Like Fortran Code with Yet Another Kernel Launcher (YAKL) December 2022
Unprecedented cloud resolution in a GPU-enabled full-physics atmospheric climate simulation on OLCF’s summit supercomputer July 2021
CFDNet June 2020
SURFNet: Super-Resolution of Turbulent Flows with Transfer Learning using Small Datasets September 2021
GPU acceleration of extreme scale pseudo-spectral simulations of turbulence using asynchronism November 2019
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems May 2010
LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales February 2022
Kokkos 3: Programming Model Extensions for the Exascale Era January 2021
Order- N Multiple Scattering Approach to Electronic Structure Calculations October 1995
AMReX: a framework for block-structured adaptive mesh refinement May 2019