Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Parallel Tensor Compression for Large-Scale Scientific Data.

Technical Report ·
DOI:https://doi.org/10.2172/1226255· OSTI ID:1226255
 [1];  [1];  [2]
  1. Sandia National Lab. (SNL-CA), Livermore, CA (United States)
  2. Univ. of Texas, Austin, TX (United States)
As parallel computing trends towards the exascale, scientific data produced by high-fidelity simulations are growing increasingly massive. For instance, a simulation on a three-dimensional spatial grid with 512 points per dimension that tracks 64 variables per grid point for 128 time steps yields 8 TB of data. By viewing the data as a dense five way tensor, we can compute a Tucker decomposition to find inherent low-dimensional multilinear structure, achieving compression ratios of up to 10000 on real-world data sets with negligible loss in accuracy. So that we can operate on such massive data, we present the first-ever distributed memory parallel implementation for the Tucker decomposition, whose key computations correspond to parallel linear algebra operations, albeit with nonstandard data layouts. Our approach specifies a data distribution for tensors that avoids any tensor data redistribution, either locally or in parallel. We provide accompanying analysis of the computation and communication costs of the algorithms. To demonstrate the compression and accuracy of the method, we apply our approach to real-world data sets from combustion science simulations. We also provide detailed performance results, including parallel performance in both weak and strong scaling experiments.
Research Organization:
Sandia National Laboratories (SNL-CA), Livermore, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1226255
Report Number(s):
SAND2015--9205R; 608108
Country of Publication:
United States
Language:
English

Similar Records

TuckerCompressMPI v. 1.0
Software · Tue Sep 20 20:00:00 EDT 2016 · OSTI ID:code-45231

TuckerMPI: A Parallel C++/MPI Software Package for Large-scale Data Compression via the Tucker Tensor Decomposition
Journal Article · Wed Jun 10 20:00:00 EDT 2020 · ACM Transactions on Mathematical Software · OSTI ID:1639093

Related Subjects