Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Efficient Asynchronous I/O with Request Merging

Conference ·

With the advancement of exascale computing, the amount of scientific data is increasing day by day. Efficient data access is necessary for scientific discoveries. Unfortunately, the I/O performance is not improved, like the CPU and network speed. So, I/O operations take longer time than data generation or analysis. Asynchronous I/O has been proposed to extenuate the I/O bottleneck by overlapping I/O and computation time. However, multiple small write operations can diminish the benefits of asynchronous I/O, as the I/O time becomes significantly longer than the compute time, with little time to overlap with. To overcome these issues, we present an optimization technique to merge small contiguous write operations. We integrated our solution into the HDF5 asynchronous I/O VOL connector and demonstrated the effectiveness of merging HDF5 write operations automatically and transparently without requiring any code change from the application.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
2228867
Resource Relation:
Conference: 2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), St. Petersburg, FL (United States), 15-19 May 2023
Country of Publication:
United States
Language:
English

References (17)

EQSIM—A multidisciplinary framework for fault-to-structure earthquake simulations on exascale computers part I: Computational models and workflow journal December 2020
Accelerating Flash-X Simulations with Asynchronous I/O conference November 2022
Toward Scalable and Asynchronous Object-Centric Data Management for HPC conference May 2018
EQSIM—A multidisciplinary framework for fault-to-structure earthquake simulations on exascale computers, part II: Regional simulations of building response journal December 2020
Transparent Asynchronous Parallel I/O Using Background Threads journal April 2022
PHDFS: Optimizing I/O performance of HDFS in deep learning cloud computing platform journal October 2020
Optimization Techniques at the I/O Forwarding Layer conference September 2010
Preemptible I/O Scheduling of Garbage Collection for Solid State Drives journal February 2013
Dynamic merging/splitting for better responsiveness in mobile devices conference August 2016
Optimizing the Block I/O Subsystem for Fast Storage Devices journal June 2014
Coordinating Computation and I/O in Massively Parallel Sequence Search journal April 2011
Parallel I/O, analysis, and visualization of a trillion particle simulation
  • Byna, Surendra; Chou, Jerry; Rubel, Oliver
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.92
conference November 2012
ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems journal January 2020
Efficient I/O Merging Scheme for Distributed File Systems journal February 2023
Tuning Parallel Data Compression and I/O for Large-scale Earthquake Simulation conference December 2021
Enabling Transparent Asynchronous I/O using Background Threads conference November 2019
Nyx: A MASSIVELY PARALLEL AMR CODE FOR COMPUTATIONAL COSMOLOGY journal February 2013

Similar Records

h5bench: A unified benchmark suite for evaluating HDF5 I/O performance on pre-exascale platforms
Journal Article · 2024 · Concurrency and Computation. Practice and Experience · OSTI ID:2340936

Related Subjects