Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Performance Optimization for Relative-Error-Bounded Lossy Compression on Scientific Data

Journal Article · · IEEE Transactions on Parallel and Distributed Systems
Scientific simulations in high-performance computing (HPC) environments generate vast volume of data, which may cause a severe I/O bottleneck at runtime and a huge burden on storage space for postanalysis. Unlike traditional data reduction schemes such as deduplication or lossless compression, not only can error-controlled lossy compression significantly reduce the data size but it also holds the promise to satisfy user demand on error control. Pointwise relative error bounds (i.e., compression errors depends on the data values) are widely used by many scientific applications with lossy compression since error control can adapt to the error bound in the dataset automatically. Pointwise relative-error-bounded compression is complicated and time consuming. In this article, we develop efficient precomputation-based mechanisms based on the SZ lossy compression framework. Our mechanisms can avoid costly logarithmic transformation and identify quantization factor values via a fast table lookup, greatly accelerating the relative-error-bounded compression with excellent compression ratios. In addition, we reduce traversing operations for Huffman decoding, significantly accelerating the decompression process in SZ. Experiments with eight well-known real-world scientific simulation datasets show that our solution can improve the compression and decompression rates (i.e., the speed) by about 40 and 80 p, respectively, in most of cases, making our designed lossy compression strategy the best-in-class solution in most cases.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
National Science Foundation (NSF); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Contributing Organization:
National Key Research and Development Program of China
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1603491
Journal Information:
IEEE Transactions on Parallel and Distributed Systems, Journal Name: IEEE Transactions on Parallel and Distributed Systems Journal Issue: 7 Vol. 31; ISSN 1045-9219
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

Similar Records

Accelerating Relative-error Bounded Lossy Compression for HPC datasets with Precomputation-Based Mechanisms
Conference · Mon May 20 00:00:00 EDT 2019 · OSTI ID:1515554

TopoSZ: Preserving Topology in Error-Bounded Lossy Compression
Journal Article · Sun Nov 05 19:00:00 EST 2023 · IEEE Transactions on Visualization and Computer Graphics · OSTI ID:2369454

Ultrafast Error-bounded Lossy Compression for Scientific Datasets
Conference · Fri Dec 31 23:00:00 EST 2021 · OSTI ID:1903841