Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Accelerating Flash-X Simulations with Asynchronous I/O

Conference ·
 [1];  [2];  [1];  [3];  [4]
  1. Argonne National Laboratory (ANL)
  2. Lawrence Berkeley Laboratory, CA
  3. ORNL
  4. Lawrence Berkeley National Laboratory (LBNL)
Most high-fidelity physics simulation codes, such as Flash-X, need to save intermediate results (checkpoint files) to restart or gain insights into the evolution of the simulation. These simulation codes save such intermediate files synchronously, where computation is stalled while the data is written to storage. Depending on the problem size and computational requirements, this file write time can be a substantial portion of the total simulation time. In order to hide the I/O latency of checkpointing, asynchronous I/O methods have been introduced. These methods use background threads for performing I/O while the main threads continue with the simulation. The usage of background threads can compete for resources on the node as well as with communication. In this paper, we evaluate the overheads and the overall benefit of asynchronous I/O in HDF5 to simulations. Results from real-world high-fidelity simulations on the Summit supercomputer show that I/O operation is overlapped with application communication or computation or both, effectively hiding some or all of the I/O latency. Our evaluation shows that while using asynchronous I/O adds overhead to the application, the I/O time reduction is more significant, resulting in overall up to 1.5X performance speedup.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1959631
Country of Publication:
United States
Language:
English

References (7)

An overview of the HDF5 technology suite and its applications conference January 2011
An investigation of the gravity effects on pool boiling heat transfer via high-fidelity simulations journal December 2021
Extreme scale multi-physics simulations of the tsunamigenic 2004 sumatra megathrust earthquake
  • Uphoff, Carsten; Rettenberger, Sebastian; Bader, Michael
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/3126908.3126948
conference November 2017
Flash-X: A multiphysics simulation software instrument journal July 2022
Modeling resource-coupled computations conference November 2009
Enabling Transparent Asynchronous I/O using Background Threads conference November 2019
ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems journal January 2020

Similar Records

Asynchronous Checkpoint Migration with MRNet in the Scalable Checkpoint / Restart Library
Conference · Tue Mar 20 00:00:00 EDT 2012 · OSTI ID:1047769

Asynchronous I/O VOL Connector (AsyncVOL) v0.1
Software · Fri Jul 31 20:00:00 EDT 2020 · OSTI ID:code-41629

DeepFreeze: Towards Scalable Asynchronous Checkpointing of Deep Learning Models
Conference · Tue Dec 31 23:00:00 EST 2019 · OSTI ID:1770321

Related Subjects