Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars
- Univ. of Erlangen-Nuremberg (FAU), Bavaria (Germany)
- NVIDIA, Santa Clara, CA (United States)
- Univ. of Oregon, Eugene, OR (United States)
- National Supercomputing Centre, Lugano (Switzerland)
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- GSI-Helmholtzzentrum fur Schwerionenforschung, Darmstadt (Germany)
- Louisiana State Univ., Baton Rouge, LA (United States)
- Univ. of Stuttgart, Baden-Württemberg (Germany)
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
We propose a highly scalable demonstration of a portable asynchronous many-task programming model and runtime system applied to a grid-based adaptive mesh refinement hydrodynamic simulation of a double white dwarf merger with 14 levels of refinement that spans 17 orders of magnitude in astrophysical densities. The code uses the portable C++ parallel programming model that is embodied in the HPX library and being incorporated into the ISO C++ standard. The model reflects a significant shift from existing bulk synchronous parallel programming models under consideration for exascale systems. Through the use of the Futurization technique, seemingly sequential code is transformed into wait-free asynchronous tasks. We demonstrate the potential of our model by showing results from strong scaling runs on National Energy Research Scientific Computing Center’s Cori system (658,784 Intel Knight’s Landing cores) that achieve a parallel efficiency of 96.8% using billions of asynchronous tasks.
- Research Organization:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
- Contributing Organization:
- The STE||AR Group
- Grant/Contract Number:
- 89233218CNA000001; AC02-05CH11231; SC0008638; SC0008714; AC52-07NA27344
- OSTI ID:
- 1524389
- Report Number(s):
- LA-UR--17-31311
- Journal Information:
- International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 4 Vol. 33; ISSN 1094-3420
- Publisher:
- SAGECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Preparing NERSC users for Cori, a Cray XC40 system with Intel many integrated cores
|
journal | August 2017 |
Vc: A C++ library for explicit vectorization: VC: A C++ LIBRARY FOR EXPLICIT VECTORIZATION
|
journal | December 2011 |
New High-Resolution Central Schemes for Nonlinear Conservation Laws and Convection–Diffusion Equations
|
journal | May 2000 |
Angular momentum preserving cell-centered Lagrangian and Eulerian schemes on arbitrary grids
|
journal | June 2015 |
Legion: Expressing locality and independence with logical regions
|
conference | November 2012 |
X10: an object-oriented approach to non-uniform cluster computing
|
conference | January 2005 |
X10: an object-oriented approach to non-uniform cluster computing
|
journal | October 2005 |
Continuation-passing, closure-passing style
|
conference | January 1989 |
The Tau Parallel Performance System
|
journal | May 2006 |
Parallel Programmability and the Chapel Language
|
journal | August 2007 |
Similar Records
Demonstrating GPU code portability and scalability for radiative heat transfer computations
Demonstrating GPU code portability and scalability for radiative heat transfer computations