Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load
Scientific computing workloads at HPC facilities have been shifting from traditional numerical simulations to AI/ML applications for training and inference while processing and producing ever-increasing amounts of scientific data. To address the growing need for increased storage capacity, lower access latency, and higher bandwidth, emerging technologies such as non-volatile memory are integrated into supercomputer I/O subsystems. With these emerging trends, we need a better understanding of the multilayer supercomputer I/O systems and ways to use these subsystems efficiently. In this work, we study the I/O access patterns and performance characteristics of two representative supercomputer I/O subsystems. Through an extensive analysis of year-long I/O logs on each system, we report new observations in I/O reads and writes, unbalanced use of storage system layers, and new trends in user behaviors at the HPC I/O middleware stack.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
- DOE Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1959026
- Country of Publication:
- United States
- Language:
- English
Revisiting I/O behavior in large-scale storage systems
|
conference | November 2019 |
End-to-end I/O portfolio for the summit supercomputing ecosystem
|
conference | November 2019 |
Diving into petascale production file systems through large scale profiling and analysis
|
conference | January 2017 |
Parallel netCDF: A High-Performance Scientific I/O Interface
|
conference | January 2003 |
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
|
conference | January 2008 |
Write amplification analysis in flash-based solid state drives
|
conference | January 2009 |
SCTuner: An Autotuner Addressing Dynamic I/O Needs on Supercomputer I/O Subsystems
|
conference | November 2021 |
Towards HPC I/O Performance Prediction through Large-scale Log Analysis
|
conference | June 2020 |
Systematically inferring I/O performance variability by examining repetitive job behavior
|
conference | November 2021 |
Applying Machine Learning to Understand Write Performance of Large-scale Parallel Filesystems
|
conference | November 2019 |
Understanding I/O Bottlenecks and Tuning for High Performance I/O on Large HPC Systems
|
conference | July 2018 |
Tuning Parallel Data Compression and I/O for Large-scale Earthquake Simulation
|
conference | December 2021 |
Understanding and improving computational science storage access through continuous characterization
|
conference | May 2011 |
A Multiplatform Study of I/O Behavior on Petascale Supercomputers
|
conference | January 2015 |
I/O Bottleneck Detection and Tuning: Connecting the Dots using Interactive Log Analysis
|
conference | November 2021 |
Battle of the Defaults: Extracting Performance Characteristics of HDF5 under Production Load
|
conference | May 2021 |
Predicting Output Performance of a Petascale Supercomputer
|
conference | January 2017 |
Similar Records
Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load
Characterizing output bottlenecks in a supercomputer
Characterizing output bottlenecks in a supercomputer
Conference
·
Wed Jun 01 00:00:00 EDT 2022
·
OSTI ID:1885368
Characterizing output bottlenecks in a supercomputer
Conference
·
Sat Dec 31 23:00:00 EST 2011
·
OSTI ID:1063838
Characterizing output bottlenecks in a supercomputer
Conference
·
Sat Dec 31 23:00:00 EST 2011
·
OSTI ID:1096349