...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats

Klasky, Scott A; Lofstead, J.; Bent, John; Gibson, Garth A; Parashar, Manish; Podhorszki, Norbert; Schwan, Karsten; Wolf, Matthew D

Title: ...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats

Conference · Thu Jan 01 00:00:00 EST 2009

OSTI ID:982187

Klasky, Scott A ^[1]; Lofstead, J. ^[2]; Bent, John ^[1]; Gibson, Garth A ^[1]; Parashar, Manish ^[3]; Podhorszki, Norbert ^[1]; Schwan, Karsten ^[2]; Wolf, Matthew D ^[1]

ORNL
Georgia Institute of Technology
Rutgers University

As HPC applications run on increasingly high process counts on larger and larger machines, both the frequency of checkpoints needed for fault tolerance and the resolution and size of Data Analysis Dumps are expected to increase proportionally. In order to maintain an acceptable ratio of time spent performing useful computation work to time spent performing I/O, write bandwidth to the underlying storage system must increase proportionally to this increase in the checkpoint and computation size. Unfortunately, popular scientific self-describing file formats such as netCDF and HDF5 are designed with a focus on portability and flexibility. Extra care and careful crafting of the output structure and API calls is required to optimize for write performance using these APIs. To provide sufficient write bandwidth to continue to support the demands of scientific applications, the HPC community has developed a number of I/O middleware layers, that structure output into write-optimized file formats. However, the obvious concern with any write optimized file format would be a corresponding penalty on reads. In the log-structured filesystem, for example, a file generated by random writes could be written efficiently, but reading the file back sequentially later would result in very poor performance. Simulation results require efficient read-back for visualization and analytics, and though most checkpoint files are never used, the efficiency of a restart is very important in the face of inevitable failures. The utility of write speed improving middleware would be greatly diminished if it sacrificed acceptable read performance. In this paper we examine the read performance of two write-optimized middleware layers on large parallel machines and compare it to reading data natively in popular file formats. The two I/O middleware layer examined in this paper are the Adaptable IO System (ADIOS), a library-based approach developed at Oak Ridge National Laboratory to provide a high-level IO API that can be used in place of netCDF or HDF5 to do much more aggressive write-behind and efficient reordering of data locations within the file; and the Parallel Log-structured Filesystem (PLFS), a stackable FUSE filesystem approach developed at Los Alamos National Laboratory that decouples concurrent writes to improve the speed of checkpoints. Since ADIOS is an I/O componentization that affords selection of different I/O methods at or during runtime, through a single API, users can have access to MPI-IO, Posix-IO, HDF5, netCDF, and staging methods. The ADIOS BP file format is a new log-file format that has a superset of the features of both HDF5 and netCDF, but is designed to be portable and flexible while being optimized for writing. PLFS, on the other hand, is mounted as a stackable filesystem on top of an existing parallel filesystem. Reads or writes to the PLFS filesystem are transparently translated into operations on per-process log files stored in the underlying parallel filesystem. Since PLFS performs this translation without application modification, users can write in HDF5, netCDF, or app-specific file formats and PLFS will store the writes in a set of efficiently written log-formatted files, while presenting the user with a logical 'flat' file on reads. Despite their different approaches, the commonality behind both of these middleware systems is that they both write to a log file format. As shown in previous publications, writes are fully optimized in both systems, sometimes resulting in 100x improvements over writing data in popular file formats. But as mentioned above, writing is only one part of the story. In this paper we examine the read performance of our middleware layers on large parallel machines and compare these to reading data either natively or from other popular file formats. We compare the reading performance in two different scenarios: (1) Reading back restarts from the same number of processors as wrote the data and (2) Reading back restart data from a different number of processors. We observe that not only can write-optimized I/O middleware be built to not penalize read speeds, but for important workloads, techniques that improve write performance can, perhaps counterintuitively, improve read speeds over reading from a contiguously organized file format. In the remainder of this paper, we investigate this further through case studies of PLFS and ADIOS on simulation checkpoint restarts.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: DE-AC05-00OR22725

OSTI ID:: 982187

Resource Relation:: Conference: Supercomputing 2009, Portland, OR, USA, 20091110, 20091110

Country of Publication:: United States

Language:: English

Similar Records

PETASCALE DATA STORAGE INSTITUTE (PDSI) Final Report

Technical Report · Mon Nov 26 00:00:00 EST 2012 · OSTI ID:982187

Gibson, Garth

Reducing Concurrency Bottlenecks in Parallel I/O Workloads

Conference · Sat Jan 01 00:00:00 EST 2011 · OSTI ID:982187

Manzanares, Adam C; Bent, John M; Wingate, Meghan

H5Part

Software · Thu Dec 01 00:00:00 EST 2005 · OSTI ID:982187

Shalf, John M.; Adelmann, Andreas; Siegerist, Cristina

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
COMPUTERS
COMPUTER CODES
PROGRAMMING
DATA ANALYSIS
EFFICIENCY
FLEXIBILITY
LANL
ORNL
PERFORMANCE
RESOLUTION
SIMULATION
STORAGE
TOLERANCE
VELOCITY

Title: ...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats

Citation Formats

Similar Records

Related Subjects