Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Data Reduction Techniques for Simulation, Visualization and Data Analysis

Journal Article · · Computer Graphics Forum
DOI:https://doi.org/10.1111/cgf.13336· OSTI ID:1463451
 [1];  [2];  [3];  [4];  [5];  [2]
  1. National Center for Atmospheric Research, Boulder, CO (United States); Univ. of Oregon, Eugene, OR (United States); University of Oregon
  2. Univ. of Oregon, Eugene, OR (United States)
  3. Univ. of Kaiserslautern (Germany)
  4. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
  5. National Center for Atmospheric Research, Boulder, CO (United States)

Data reduction is increasingly being applied to scientific data for numerical simulations, scientific visualizations, and data analyses. It is most often used to lower I/O and storage costs, and sometimes to lower in-memory data size as well. With this work, we consider five categories of data reduction techniques based on their information loss: 1) truly lossless, 2) near lossless, 3) lossy, 4) mesh reduction, and 5) derived representations. We then survey available techniques in each of these categories, summarize their properties from a practical point of view, and discuss relative merits within a category. We believe, in total, this work will enable simulation scientists and visualization/data analysis scientists to decide which data reduction techniques will be most helpful for their needs.

Research Organization:
Univ. of Oregon, Eugene, OR (United States); Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Grant/Contract Number:
SC0010652
OSTI ID:
1463451
Journal Information:
Computer Graphics Forum, Journal Name: Computer Graphics Forum Journal Issue: 6 Vol. 37; ISSN 0167-7055
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (106)

Biorthogonal bases of compactly supported wavelets journal June 1992
ISABELA for effective in situ compression of scientific data: ISABELA FOR EFFECTIVE
  • Lakshminarasimhan, Sriram; Shah, Neil; Ethier, Stephane
  • Concurrency and Computation: Practice and Experience, Vol. 25, Issue 4 https://doi.org/10.1002/cpe.2887
journal July 2012
A Mathematical Theory of Communication journal October 1948
A Mathematical Theory of Communication journal July 1948
Wavelet Transforms That Map Integers to Integers journal July 1998
Three-Dimensional Embedded Subband Coding with Optimized Truncation (3-D ESCOT) journal May 2001
Organization and maintenance of large ordered indexes journal January 1972
Ueber die stetige Abbildung einer Line auf ein Fl�chenst�ck journal September 1891
Factoring wavelet transforms into lifting steps journal May 1998
A method for the construction of minimum-redundancy codes journal February 2006
Lossy volume compression using Tucker truncation and thresholding journal May 2015
Lossless compression of predicted floating-point geometry journal July 2005
Advanced techniques for high-quality multi-resolution volume rendering journal February 2004
Reducing disk storage of full-3D seismic waveform tomography (F3DT) through lossy online compression journal August 2016
Survey and analysis of multiresolution methods for turbulence data journal February 2016
A parallel multiresolution volume rendering algorithm for large data visualization journal February 2005
Adaptive Multiresolution Methods: Practical issues on Data Structures, Implementation and Parallelization journal December 2011
Partitioning a Large Simulation as It Runs journal July 2016
Interactive desktop analysis of high resolution simulations: application to turbulent plume dynamics and current sheet formation journal August 2007
The Visible Human Project: a resource for education journal January 1999
Volume rendering of DCT-based compressed 3D scalar data journal March 1995
Simplification of tetrahedral meshes with error bounds journal January 1999
Compressed progressive meshes journal January 2000
Generalized unstructured decimation [computer graphics] journal January 1996
The Visible Human Project journal March 1998
Embedded image coding using zerotrees of wavelet coefficients journal January 1993
High performance scalable image compression with EBCOT journal July 2000
Frequency domain volume rendering by the wavelet X-ray transform journal July 2000
Transparent in Situ Data Transformations in ADIOS conference May 2014
MPC: A Massively Parallel Compression Algorithm for Scientific Data conference September 2015
Spatiotemporal Wavelet Compression for Visualization of Scientific Simulation Data conference September 2017
Seismic data compression using high-dimensional wavelet transforms conference January 1996
Differential FCM: increasing value prediction accuracy by improving table usage efficiency
  • Goeman, B.; Vandierendonck, H.; de Bosschere, K.
  • HPCA-7 - 7th IEEE Symposium on High Performance Computer Architecture, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture https://doi.org/10.1109/HPCA.2001.903264
conference January 2001
ISOBAR Preconditioner for Effective and High-throughput Lossless Data Compression
  • Schendel, Eric R.; Jin, Ye; Shah, Neil
  • 2012 IEEE International Conference on Data Engineering (ICDE 2012), 2012 IEEE 28th International Conference on Data Engineering https://doi.org/10.1109/ICDE.2012.114
conference April 2012
Parallel Tensor Compression for Large-Scale Scientific Data conference May 2016
Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization conference May 2017
A Method for the Construction of Minimum-Redundancy Codes journal September 1952
Revisiting wavelet compression for large-scale climate data using JPEG 2000 and ensuring data precision conference October 2011
Salient time steps selection from large scale time-varying data sets with dynamic time warping conference October 2012
Evaluating the efficacy of wavelet configurations on turbulent-flow data conference October 2015
Using feature importance metrics to detect events of interest in scientific computing applications conference October 2017
The predictability of data values conference January 1997
An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis
  • Ahrens, James; Jourdain, Sebastien; OLeary, Patrick
  • SC14: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2014.40
conference November 2014
Real-Time Synthesis of Compression Algorithms for Scientific Data
  • Burtscher, Martin; Mukka, Hari; Yang, Annie
  • SC16: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2016.22
conference November 2016
Explorable Volumetric Depth Images from Raycasting
  • Frey, Steffen; Sadlo, Filip; Ertl, Thomas
  • 2013 XXVI SIBGRAPI - Conference on Graphics, Patterns and Images (SIBGRAPI), 2013 XXVI Conference on Graphics, Patterns and Images https://doi.org/10.1109/SIBGRAPI.2013.26
conference August 2013
Discrete Cosine Transform journal January 1974
FPC: A High-Speed Compressor for Double-Precision Floating-Point Data journal January 2009
An Algorithm for Vector Quantizer Design journal January 1980
Efficient, Low-Complexity Image Coding With a Set-Partitioning Embedded Block Coder journal November 2004
Multidimensional Directional Filter Banks and Surfacelets journal April 2007
A universal algorithm for sequential data compression journal May 1977
Compression of individual sequences via variable-rate coding journal September 1978
Least squares quantization in PCM journal March 1982
Four-dimensional wavelet compression of arbitrarily sized echocardiographic data journal September 2002
Fast and Efficient Compression of Floating-Point Data journal September 2006
Streaming Simplification of Tetrahedral Meshes journal January 2007
Interactive, Internet Delivery of Visualization via Structured Prerendered Multiresolution Imagery journal March 2008
Transform Coding for Hardware-accelerated Volume Rendering journal November 2007
Query-Driven Visualization of Time-Varying Adaptive Mesh Refinement Data journal November 2008
Visualization by Proxy: A Novel Framework for Deferred Interaction with Volume Data journal November 2010
Interactive Exploration and Analysis of Large-Scale Simulations Using Topology-Based Data Segmentation journal September 2011
An Application of Multivariate Statistical Analysis for Query-Driven Visualization journal March 2011
Feature-Based Statistical Analysis of Combustion Simulation Data journal December 2011
An Adaptive Prediction-Based Approach to Lossless Compression of Floating-Point Volume Data journal December 2012
An Information-Aware Framework for Exploring Multivariate Data Sets journal December 2013
Fixed-Rate Compressed Floating-Point Arrays journal December 2014
Adaptive Multilinear Tensor Product Wavelets journal January 2016
A Combined Eulerian-Lagrangian Data Representation for Large-Scale Applications journal October 2017
Fast volume rendering of compressed data conference January 1993
Direct rendering of Laplacian pyramid compressed volume data conference January 1995
Wavelets applied to lossless compression and progressive transmission of floating point data in 3-D curvilinear grids conference January 1996
High performance scalable image compression with EBCOT conference January 1999
Wavelet-Based 3D Compression Scheme for Interactive Visualization of Very Large Volume Data journal March 1999
Rapid High Quality Compression of Volume Data for Visualization journal September 2001
Out-of-core compression and decompression of large n-dimensional scalar fields journal September 2003
In Situ Methods, Infrastructures, and Applications on High Performance Computing Platforms journal June 2016
A Survey of Topology-based Methods in Visualization journal June 2016
Simplex and Diamond Hierarchies: Models and Applications journal November 2011
In-situ Sampling of a Large-Scale Particle Simulation for Interactive Visualization and Analysis journal June 2011
Three-dimensional subband coding of video using the zero-tree method conference February 1996
QccPack: an open-source software library for quantization, compression, and coding conference December 2000
A prototype discovery environment for analyzing and visualizing terascale turbulent fluid flow simulations conference March 2005
Fast Discrete Curvelet Transforms journal January 2006
Tensor Decompositions and Applications journal August 2009
Discrete Cosine Transform book April 2001
Adaptive tetrapuzzles: efficient out-of-core construction and visualization of gigantic multiresolution polygonal models journal August 2004
Quadric-based simplification in any dimension journal April 2005
Efficient query processing on unstructured tetrahedral meshes
  • Papadomanolakis, Stratos; Ailamaki, Anastassia; Lopez, Julio C.
  • Proceedings of the 2006 ACM SIGMOD international conference on Management of data - SIGMOD '06 https://doi.org/10.1145/1142473.1142535
conference January 2006
Adaptive tetrapuzzles: efficient out-of-core construction and visualization of gigantic multiresolution polygonal models conference January 2004
Decimation of triangle meshes
  • Schroeder, William J.; Zarge, Jonathan A.; Lorensen, William E.
  • Proceedings of the 19th annual conference on Computer graphics and interactive techniques - SIGGRAPH '92 https://doi.org/10.1145/133994.134010
conference January 1992
Decimation of triangle meshes journal July 1992
Vector quantization for volume rendering conference January 1992
Adaptive TetraPuzzles: efficient out-of-core construction and visualization of gigantic multiresolution polygonal models conference January 2008
Lossless compression of volume data conference January 1994
Arithmetic coding for data compression journal June 1987
BTRFS: The Linux B-Tree Filesystem journal August 2013
Bitmap index design and evaluation conference January 1998
Bitmap index design and evaluation journal June 1998
Enabling Adaptive Scientific Workflows Via Trigger Detection
  • Salloum, Maher; Bennett, Janine C.; Pinar, Ali
  • Proceedings of the First Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization - ISAV2015 https://doi.org/10.1145/2828612.2828619
conference January 2015
Image-driven simplification journal July 2000
A mathematical theory of communication journal January 2001
R-trees: a dynamic index structure for spatial searching conference January 1984
The R*-tree: an efficient and robust access method for points and rectangles
  • Beckmann, Norbert; Kriegel, Hans-Peter; Schneider, Ralf
  • Proceedings of the 1990 ACM SIGMOD international conference on Management of data - SIGMOD '90 https://doi.org/10.1145/93597.98741
conference January 1990
The R*-tree: an efficient and robust access method for points and rectangles journal May 1990
DEFLATE Compressed Data Format Specification version 1.3 report May 1996
Lossy volume compression using Tucker truncation and thresholding text January 2016

Cited By (4)

Is Smaller Always Better? - Evaluating Video Compression Techniques for Simulation Ensembles text January 2021
Visitation Graphs: Interactive Ensemble Visualization with Visitation Maps text January 2021
Use cases of lossy compression for floating-point data in scientific data sets journal May 2019
VAPOR: A Visualization Package Tailored to Analyze Simulation Data in Earth System Science journal August 2019

Similar Records

Understanding and Modeling Lossy Compression Schemes on HPC Scientific Data
Conference · Tue May 01 00:00:00 EDT 2018 · OSTI ID:1468061

Evaluating lossy data compression on climate simulation data within a large ensemble
Journal Article · Tue Dec 06 23:00:00 EST 2016 · Geoscientific Model Development (Online) · OSTI ID:1389988

Understanding Performance-Quality Trade-offs in Scientific Visualization Workflows with Lossy Compression
Conference · Fri Nov 01 00:00:00 EDT 2019 · OSTI ID:1657917