Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Error-controlled, progressive, and adaptable retrieval of scientific data with multilevel decomposition

Conference ·
Extreme-scale simulations and high-resolution instruments have been generating an increasing amount of data, which poses significant challenges to not only data storage during the run, but also post-processing where data will be repeatedly retrieved and analyzed for a long period of time. The challenges in satisfying a wide range of post-hoc analysis needs while minimizing the I/O overhead caused by inappropriate and/or excessive data retrieval should never be left unmanaged. In this paper, we propose a data refactoring, compressing, and retrieval framework capable of 1) fine-grained data refactoring with regard to precision; 2) incrementally retrieving and recomposing the data in terms of various error bounds; and 3) adaptively retrieving data in multi-precision and multi-resolution with respect to different analysis. With the progressive data re-composition and the adaptable retrieval algorithms, our framework significantly reduces the amount of data retrieved when multiple incremental precision are requested and/or the downstream analysis time when coarse resolution is used. Experiments show that the amount of data retrieved under the same progressively requested error bound using our framework is 64% less than that using state-of-the-art single-error-bounded approaches. Parallel experiments with up to 1, 024 cores and ~ 600 GB data in total show that our approach yields 1.36× and 2.52× performance over existing approaches in writing to and reading from persistent storage systems, respectively.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1855631
Country of Publication:
United States
Language:
English

Similar Records

Improving Progressive Retrieval for HPC Scientific Data using Deep Neural Network
Conference · 2023 · OSTI ID:2000261

A General Framework for Progressive Data Compression and Retrieval
Journal Article · 2023 · IEEE Transactions on Visualization and Computer Graphics · OSTI ID:2204923

Error-controlled Progressive Retrieval of Scientific Data under Derivable Quantities of Interest
Conference · 2024 · OSTI ID:2538586

Related Subjects