Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization

Wan, Lipeng; Huebl, Axel; Gu, Junmin; Poeschel, Franz; Gainaru, Ana; Wang, Ruonan; Chen, Jieyang; Liang, Xin; Ganyushin, Dmitry; Munson, Todd; Foster, Ian; Vay, Jean-Luc; Podhorszki, Norbert; Wu, Kesheng; Klasky, Scott

doi:10.1109/TPDS.2021.3100784

Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization

Journal Article · Thu Jul 29 00:00:00 EDT 2021 · IEEE Transactions on Parallel and Distributed Systems

DOI:https://doi.org/10.1109/TPDS.2021.3100784· OSTI ID:1855220

^[1]; ^[2]; ^[2]; ^[3]; Gainaru, Ana ^[1]; Wang, Ruonan ^[1]; ^[1]; ^[4]; Ganyushin, Dmitry ^[1]; ^[5]; ^[5]; ^[2]; Podhorszki, Norbert ^[1]; ^[2]; Klasky, Scott ^[1]

Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Center for Advanced Systems Understanding (CASUS), Görlitz (Germany)
Missouri Univ. of Science and Technology, Rolla, MO (United States)
Argonne National Lab. (ANL), Lemont, IL (United States)

The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Yet, as such algorithms improve parallel application efficiency, they raise new challenges for I/O logic due to their irregular and dynamic data distributions. Thus, while the enormous data rates of Exascale simulations already challenge existing file system write strategies, the need for efficient read and processing of generated data introduces additional constraints on the data layout strategies that can be used when writing data to secondary storage. We review these I/O challenges and introduce two online data layout reorganization approaches for achieving good tradeoffs between read and write performance. We demonstrate the benefits of using these two approaches for the ECP particle-in-cell simulation WarpX, which serves as a motif for a large class of important Exascale applications. Here, we show that by understanding application I/O patterns and carefully designing data layouts we can increase read performance by more than 80 percent.

View Accepted Manuscript (DOE)

Research Organization:: Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)

Grant/Contract Number:: AC02-05CH11231

OSTI ID:: 1855220

Journal Information:: IEEE Transactions on Parallel and Distributed Systems, Journal Name: IEEE Transactions on Parallel and Distributed Systems Journal Issue: 4 Vol. 33; ISSN 1045-9219

Publisher:: IEEECopyright Statement

Country of Publication:: United States

Language:: English

References (25)

Interfacing HDF5 with a scalable object‐centric storage system on hierarchical storage Mu, Jingqing; Soumagne, Jerome; Byna, Suren Concurrency and Computation: Practice and Experience, Vol. 32, Issue 20 https://doi.org/10.1002/cpe.5715	journal	March 2020
Querying Large Scientific Data Sets with Adaptable IO System ADIOS Gu, Junmin; Klasky, Scott; Podhorszki, Norbert Supercomputing Frontiers https://doi.org/10.1007/978-3-319-69953-0_4	book	January 2018
Optimizing checkpoint data placement with guaranteed burst buffer endurance in large-scale hierarchical storage systems Wan, Lipeng; Cao, Qing; Wang, Feiyi Journal of Parallel and Distributed Computing, Vol. 100 https://doi.org/10.1016/j.jpdc.2016.10.002	journal	February 2017
ADIOS 2: The Adaptable Input Output System. A framework for high-performance data management Godoy, William F.; Podhorszki, Norbert; Wang, Ruonan SoftwareX, Vol. 12 https://doi.org/10.1016/j.softx.2020.100561	journal	July 2020
Modeling of a chain of three plasma accelerator stages with the WarpX electromagnetic PIC code on GPUs Vay, J. -L.; Huebl, A.; Almgren, A. Physics of Plasmas, Vol. 28, Issue 2 https://doi.org/10.1063/5.0028512	journal	February 2021
An algorithm for point clustering and grid generation Berger, M.; Rigoutsos, I. IEEE Transactions on Systems, Man, and Cybernetics, Vol. 21, Issue 5 https://doi.org/10.1109/21.120081	journal	January 1991
Apply Block Index Technique to Scientific Data Analysis and I/O Systems Wu, Tzuhsien; Chou, Jerry; Podhorszki, Norbert 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) https://doi.org/10.1109/CCGRID.2017.37	conference	May 2017
Usage Pattern-Driven Dynamic Data Layout Reorganization Tang, Houjun; Byna, Suren; Harenberg, Steve 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) https://doi.org/10.1109/CCGrid.2016.15	conference	May 2016
Improving Parallel I/O Performance with Data Layout Awareness Chen, Yong; Sun, Xian-He; Thakur, Rajeev 2010 IEEE International Conference on Cluster Computing (CLUSTER) https://doi.org/10.1109/CLUSTER.2010.35	conference	September 2010
EDO: Improving Read Performance for Scientific Applications through Elastic Data Organization Tian, Yuan; Klasky, Scott; Abbasi, Hasan 2011 IEEE International Conference on Cluster Computing (CLUSTER) https://doi.org/10.1109/CLUSTER.2011.18	conference	September 2011
TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers Tessier, Francois; Vishwanath, Venkatram; Jeannot, Emmanuel 2017 IEEE International Conference on Cluster Computing (CLUSTER) https://doi.org/10.1109/CLUSTER.2017.80	conference	September 2017
Analysis and Modeling of the End-to-End I/O Performance on OLCF's Titan Supercomputer Wan, Lipeng; Wolf, Matthew; Wang, Feiyi 2017 IEEE 19th International Conference on High Performance Computing and Communications, IEEE 15th International Conference on Smart City and IEEE 3rd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), 2017 IEEE 19th International Conference on High Performance Computing and Communications; IEEE 15th International Conference on Smart City; IEEE 3rd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) https://doi.org/10.1109/HPCC-SmartCity-DSS.2017.1	conference	December 2017
Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales Foster, Ian 2017 IEEE 24th International Conference on High Performance Computing (HiPC) https://doi.org/10.1109/HiPC.2017.00042	conference	December 2017
Comprehensive Measurement and Analysis of the User-Perceived I/O Performance in a Production Leadership-Class Storage System Wan, Lipeng; Wolf, Matthew; Wang, Feiyi 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) https://doi.org/10.1109/ICDCS.2017.257	conference	June 2017
Model-Driven Data Layout Selection for Improving Read Performance Liu, Jialin; Byna, Surendra; Dong, Bin 2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW) https://doi.org/10.1109/IPDPSW.2014.190	conference	May 2014
A Plugin for HDF5 Using PLFS for Improved I/O Performance and Semantic Analysis Mehta, Kshitij; Bent, John; Torres, Aaron https://doi.org/10.1109/SC.Companion.2012.102	conference	November 2012
Optimizing Parallel I/O Accesses through Pattern-Directed and Layout-Aware Replication He, Shuibing; Yin, Yanlong; Sun, Xian-He IEEE Transactions on Computers, Vol. 69, Issue 2 https://doi.org/10.1109/TC.2019.2946135	journal	February 2020
DataStager: scalable data staging services for petascale applications Abbasi, Hasan; Wolf, Matthew; Eisenhauer, Greg Proceedings of the 18th ACM international symposium on High performance distributed computing - HPDC '09 https://doi.org/10.1145/1551609.1551618	conference	January 2009
Six degrees of scientific data: reading patterns for extreme scale science IO Lofstead, Jay; Polte, Milo; Gibson, Garth Proceedings of the 20th international symposium on High performance distributed computing - HPDC '11 https://doi.org/10.1145/1996130.1996139	conference	January 2011
Using active NVRAM for I/O staging Kannan, Sudarsun; Gavrilovska, Ada; Schwan, Karsten Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities - PDAC '11 https://doi.org/10.1145/2110205.2110209	conference	January 2011
Disk-directed I/O for MIMD multiprocessors Kotz, David ACM Transactions on Computer Systems, Vol. 15, Issue 1 https://doi.org/10.1145/244764.244766	journal	February 1997
Improving Collective MPI-IO Using Topology-Aware Stepwise Data Aggregation with I/O Throttling Tsujita, Yuichi; Hori, Atsushi; Kameyama, Toyohisa HPC Asia 2018: International Conference on High Performance Computing in Asia-Pacific Region, Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region https://doi.org/10.1145/3149457.3149464	conference	January 2018
Spatially-aware Parallel I/O for Particle Data Kumar, Sidharth; Petruzza, Steve; Usher, Will ICPP 2019: 48th International Conference on Parallel Processing, Proceedings of the 48th International Conference on Parallel Processing https://doi.org/10.1145/3337821.3337875	conference	August 2019
AMReX: a framework for block-structured adaptive mesh refinement Zhang, Weiqun; Almgren, Ann; Beckner, Vince Journal of Open Source Software, Vol. 4, Issue 37 https://doi.org/10.21105/joss.01370	journal	May 2019
Modeling of a chain of three plasma accelerator stages with the WarpX electromagnetic PIC code on GPUs Vay, Jean-Luc Zenodo https://doi.org/10.5281/zenodo.4429367	dataset	January 2021

Similar Records

Usage Pattern-Driven Dynamic Data Layout Reorganization

Conference · Sun May 01 00:00:00 EDT 2016 · OSTI ID:1567419

Expediting Scientific Data Analysis with Reorganization of Data

Conference · Mon Aug 19 00:00:00 EDT 2013 · OSTI ID:1165204

SDS: A Framework for Scientific Data Services

Conference · Thu Oct 31 00:00:00 EDT 2013 · OSTI ID:1164907

Related Subjects

97 MATHEMATICS AND COMPUTING
IO performance
Parallel IO
WarpX
data access optimization
data layout

Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization

Citation Formats

References (25)

Similar Records

Related Subjects