skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Tuning HDF5 subfiling performance on parallel file systems

Conference ·
OSTI ID:1398484
 [1];  [2];  [1];  [3];  [3]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Intel Corp. (United States)
  3. The HDF Group (United States)

Subfiling is a technique used on parallel file systems to reduce locking and contention issues when multiple compute nodes interact with the same storage target node. Subfiling provides a compromise between the single shared file approach that instigates the lock contention problems on parallel file systems and having one file per process, which results in generating a massive and unmanageable number of files. In this paper, we evaluate and tune the performance of recently implemented subfiling feature in HDF5. In specific, we explain the implementation strategy of subfiling feature in HDF5, provide examples of using the feature, and evaluate and tune parallel I/O performance of this feature with parallel file systems of the Cray XC40 system at NERSC (Cori) that include a burst buffer storage and a Lustre disk-based storage. We also evaluate I/O performance on the Cray XC30 system, Edison, at NERSC. Our results show performance benefits of 1.2X to 6X performance advantage with subfiling compared to writing a single shared HDF5 file. We present our exploration of configurations, such as the number of subfiles and the number of Lustre storage targets to storing files, as optimization parameters to obtain superior I/O performance. Based on this exploration, we discuss recommendations for achieving good I/O performance as well as limitations with using the subfiling feature.

Research Organization:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1398484
Resource Relation:
Conference: Cray User Group Meeting, Redmond, WA (United States), 8-11 May 2017
Country of Publication:
United States
Language:
English

Similar Records

Interfacing HDF5 with a scalable object‐centric storage system on hierarchical storage
Journal Article · Mon Mar 09 00:00:00 EDT 2020 · Concurrency and Computation. Practice and Experience · OSTI ID:1398484

Battle of the Defaults: Extracting Performance Characteristics of HDF5 under Production Load
Conference · Sat May 01 00:00:00 EDT 2021 · OSTI ID:1398484

h5bench: A unified benchmark suite for evaluating HDF5 I/O performance on pre‐exascale platforms
Journal Article · Mon Apr 15 00:00:00 EDT 2024 · Concurrency and Computation. Practice and Experience · OSTI ID:1398484

Related Subjects