Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

GentenMPI: Distributed Memory Sparse Tensor Decomposition

Technical Report ·
DOI:https://doi.org/10.2172/1656940· OSTI ID:1656940
 [1];  [1]
  1. Sandia National Laboratories (SNL), Albuquerque, NM, and Livermore, CA (United States)

GentenMPl is a toolkit of sparse canonical polyadic (CP) tensor decomposition algorithms that is designed to run effectively on distributed-memory high-performance computers. Its use of distributed-memory parallelism enables it to efficiently decompose tensors that are too large for a single compute node's memory. GentenMPl leverages Sandia's decades-long investment in the Trilinos solver framework for much of its parallel-computation capability. Trilinos contains numerical algorithms and linear algebra classes that have been optimized for parallel simulation of complex physical phenomena. This work applies these tools to the data science problem of sparse tensor decomposition. In this report, we describe the use of Trilinos in GentenMPl, extensions needed for sparse tensor decomposition, and implementations of the CP-ALS (CP via alternating least squares) and GCP-SGD (generalized CP via stochastic gradient descent) sparse tensor decomposition algorithms. We show that GentenMPl can decompose sparse tensors of extreme size, e.g., a 12.6-terabyte tensor on 8192 computer cores. We demonstrate that the Trilinos backbone provides good strong and weak scaling of the tensor decomposition algorithms.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
AC04-94AL85000; NA0003525
OSTI ID:
1656940
Report Number(s):
SAND--2020-8515; 690118
Country of Publication:
United States
Language:
English

Similar Records

Streaming Generalized Canonical Polyadic Tensor Decompositions
Technical Report · Fri Oct 01 00:00:00 EDT 2021 · OSTI ID:1832304

HPC formulations of optimization algorithms for tensor completion
Journal Article · Fri Nov 03 00:00:00 EDT 2017 · Parallel Computing · OSTI ID:1478749

Stochastic Gradients for Large-Scale Tensor Decomposition
Journal Article · Tue Oct 27 00:00:00 EDT 2020 · SIAM Journal on Mathematics of Data Science · OSTI ID:1738932

Related Subjects