Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Usage Pattern-Driven Dynamic Data Layout Reorganization

Conference ·
As scientific simulations and experiments move toward extremely large scales and generate massive amounts of data, the data access performance of analytic applications becomes crucial. A mismatch often happens between write and read patterns of data accesses, typically resulting in poor read performance. Data layout reorganization has been used to improve the locality of data accesses. However, current data reorganizations are static and focus on generating a single (or set of) optimized layouts that rely on prior knowledge of exact future access patterns. We propose a framework that dynamically recognizes the data usage patterns, replicates the data of interest in multiple reorganized layouts that would benefit common read patterns, and makes runtime decisions on selecting a favorable layout for a given read pattern. This framework supports reading individual elements and chunks of a multi-dimensional array of variables. Our pattern-driven layout selection strategy achieves multi-fold speedups compared to reading from the original dataset.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
OSTI ID:
1567419
Country of Publication:
United States
Language:
English

Similar Records

Expediting Scientific Data Analysis with Reorganization of Data
Conference · Mon Aug 19 00:00:00 EDT 2013 · OSTI ID:1165204

SDS: A Framework for Scientific Data Services
Conference · Thu Oct 31 00:00:00 EDT 2013 · OSTI ID:1164907

Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization
Journal Article · Wed Jul 28 20:00:00 EDT 2021 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1855220