skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: SCORPIO: A Scalable Two-Phase Parallel I/O Library With Application To A Large Scale Subsurface Simulator

Conference ·
OSTI ID:1115367
 [1];  [2];  [1];  [3];  [4]
  1. ORNL
  2. Intel Corporation
  3. Pacific Northwest National Laboratory (PNNL)
  4. North Carolina State University

Inefficient parallel I/O is known to be a major bottleneck among scientific applications employed on supercomputers as the number of processor cores grows into the thousands. Our prior experience indicated that parallel I/O libraries such as HDF5 that rely on MPI-IO do not scale well beyond 10K processor cores, especially on parallel file systems (like Lustre) with single point of resource contention. Our previous optimization efforts for a massively parallel multi-phase and multi-component subsurface simulator (PFLOTRAN) led to a two-phase I/O approach at the application level where a set of designated processes participate in the I/O process by splitting the I/O operation into a communication phase and a disk I/O phase. The designated I/O processes are created by splitting the MPI global communicator into multiple sub-communicators. The root process in each sub-communicator is responsible for performing the I/O operations for the entire group and then distributing the data to rest of the group. This approach resulted in over 25X speedup in HDF I/O read performance and 3X speedup in write performance for PFLOTRAN at over 100K processor cores on the ORNL Jaguar supercomputer. This research describes the design and development of a general purpose parallel I/O library, SCORPIO (SCalable block-ORiented Parallel I/O) that incorporates our optimized two-phase I/O approach. The library provides a simplified higher level abstraction to the user, sitting atop existing parallel I/O libraries (such as HDF5) and implements optimized I/O access patterns that can scale on larger number of processors. Performance results with standard benchmark problems and PFLOTRAN indicate that our library is able to maintain the same speedups as before with the added flexibility of being applicable to a wider range of I/O intensive applications.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
DE-AC05-00OR22725
OSTI ID:
1115367
Resource Relation:
Conference: HiPC: 20th Annual International Conference on High Performance Computing, Hyderabad, India, 20131218, 20131221
Country of Publication:
United States
Language:
English

Similar Records

SCORPIO: A scalable two-phase parallel I/O library with application to a large scale subsurface simulator
Conference · Sun Dec 01 00:00:00 EST 2013 · 20th Annual International Conference on High Performance Computing; 17 April 2014; Bangalore, India · OSTI ID:1115367

Scalable I/O Tracing and Analysis
Conference · Thu Jan 01 00:00:00 EST 2009 · OSTI ID:1115367

...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats
Conference · Thu Jan 01 00:00:00 EST 2009 · OSTI ID:1115367