Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization
Journal Article
·
· IEEE Transactions on Parallel and Distributed Systems
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Center for Advanced Systems Understanding (CASUS), Görlitz (Germany)
- Missouri Univ. of Science and Technology, Rolla, MO (United States)
- Argonne National Lab. (ANL), Lemont, IL (United States)
The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Yet, as such algorithms improve parallel application efficiency, they raise new challenges for I/O logic due to their irregular and dynamic data distributions. Thus, while the enormous data rates of Exascale simulations already challenge existing file system write strategies, the need for efficient read and processing of generated data introduces additional constraints on the data layout strategies that can be used when writing data to secondary storage. We review these I/O challenges and introduce two online data layout reorganization approaches for achieving good tradeoffs between read and write performance. We demonstrate the benefits of using these two approaches for the ECP particle-in-cell simulation WarpX, which serves as a motif for a large class of important Exascale applications. Here, we show that by understanding application I/O patterns and carefully designing data layouts we can increase read performance by more than 80 percent.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1855220
- Journal Information:
- IEEE Transactions on Parallel and Distributed Systems, Journal Name: IEEE Transactions on Parallel and Distributed Systems Journal Issue: 4 Vol. 33; ISSN 1045-9219
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Usage Pattern-Driven Dynamic Data Layout Reorganization
Expediting Scientific Data Analysis with Reorganization of Data
SDS: A Framework for Scientific Data Services
Conference
·
Sun May 01 00:00:00 EDT 2016
·
OSTI ID:1567419
Expediting Scientific Data Analysis with Reorganization of Data
Conference
·
Mon Aug 19 00:00:00 EDT 2013
·
OSTI ID:1165204
SDS: A Framework for Scientific Data Services
Conference
·
Thu Oct 31 00:00:00 EDT 2013
·
OSTI ID:1164907