Reverse-mode differentiation in arbitrary tensor network format: with application to supervised learning.

Gorodetsky, Alex; Safta, Cosmin; Jakeman, John

Title: Reverse-mode differentiation in arbitrary tensor network format: with application to supervised learning.

Journal Article · Sun May 01 00:00:00 EDT 2022 · Journal of Machine Learning Research

OSTI ID:1872019

Gorodetsky, Alex ^[1]; Safta, Cosmin ^[2]; Jakeman, John ^[3]

Univ. of Michigan, Ann Arbor, MI (United States)
Sandia National Lab. (SNL-CA), Livermore, CA (United States)
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

This paper describes an efficient reverse-mode differentiation algorithm for contraction operations of tensor networks that may have arbitrary and unconventional network topologies. The approach leverages the tensor contraction tree of Evenbly and Pfeifer (2014), which provides an instruction set for the contraction sequence of a network. We show that this tree can be efficiently leveraged for differentiation of a full tensor network contraction using a recursive scheme that exploits (1) the bilinear property of contraction and (2) the property that trees have single path from root to leaves. While differentiation of tensor-tensor contraction is already possible in most automatic differentiation packages, we show that exploiting these two additional properties in the specific context of contraction sequences can improve efficiency. Following a description of the algorithm and computational complexity analysis, we investigate its utility for gradient-based supervised learning for low-rank function recovery and for fitting real-world unstructured datasets. We demonstrate improved performance over alternating least-squares optimization approaches and the capability to handle heterogeneous and arbitrary tensor network formats. When compared to alternating minimization algorithms, we find that the gradient-based approach requires a smaller oversampling ratio (number of samples compared to number model parameters) for recovery. This increased efficiency extends to fitting unstructured data of varying dimensionality and when employing a variety of tensor network formats. Here, we show improved learning using the hierarchical Tucker method over the tensor-train in high-dimensional settings on a number of benchmark problems.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Sponsoring Organization:: USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)

Grant/Contract Number:: NA0003525

OSTI ID:: 1872019

Report Number(s):: SAND2022-6112J; 706378

Journal Information:: Journal of Machine Learning Research, Vol. 23, Issue 143; ISSN 1532-4435

Publisher:: JMLRCopyright Statement

Country of Publication:: United States

Language:: English

Similar Records

INTEGRATE - Inverse Network Transformations for Efficient Generation of Robust Airfoil and Turbine Enhancements

Dataset · Tue May 04 00:00:00 EDT 2021 · OSTI ID:1872019

Vijayakumar, Ganesh; King, Ryan; Glaws, Andrew; +6 more

Self-supervised learning and prediction of microstructure evolution with convolutional recurrent neural networks

Journal Article · Sat May 01 00:00:00 EDT 2021 · Patterns · OSTI ID:1872019

Yang, Kaiqi; Cao, Yifan; Zhang, Youtian; +5 more

A Regularized Tensor Completion Approach for PMU Data Recovery

Journal Article · Tue Oct 13 00:00:00 EDT 2020 · IEEE Transactions on Smart Grid · OSTI ID:1872019

Ghasemkhani, Amir; Niazazari, Iman; Liu, Yunchuan; +3 more

Related Subjects

97 MATHEMATICS AND COMPUTING
Tensor networks
supervised learning
gradient descent
alternating least squares
differentiable programming

Title: Reverse-mode differentiation in arbitrary tensor network format: with application to supervised learning.

Citation Formats

Similar Records

Related Subjects