Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Online and Scalable Data Compression Pipeline with Guarantees on Quantities of Interest

Conference ·
Data compression is becoming critical for data-intensive scientific applications. Scientists require compression techniques that accurately preserve derived quantities of interest (QoIs). Prior work has shown that a pipeline can be built to guarantee error on the primary data (PD) within user-defined bounds and achieve near-floating point QoI errors. In this paper, we present novel computational approaches for accelerating the pipeline and demonstrate results that enable concurrent execution of compression in parallel with the simulation nodes. This allows compression, including the writing of the required compression data, for the previous time step to be completed while the simulation proceeds with the current time step. Overall, the approach presented in this paper results in a 6–8 times improvement in computational overhead compared to previous work. These results were obtained using data generated by a large-scale fusion code called XGC, which produces hundreds of terabytes of data in a single day.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2251609
Country of Publication:
United States
Language:
English

Similar Records

An Algorithmic and Software Pipeline for Very Large Scale Scientific Data Compression with Error Guarantees
Conference · Wed Nov 30 23:00:00 EST 2022 · OSTI ID:2000257

Maintaining Trust in Reduction: Preserving the Accuracy of Quantities of Interest for Lossy Compression
Conference · Mon Feb 28 23:00:00 EST 2022 · OSTI ID:1855632

Fast Algorithms for Scientific Data Compression
Conference · Thu Nov 30 23:00:00 EST 2023 · OSTI ID:2438668

Related Subjects