Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A Survey of Singular Value Decomposition Methods for Distributed Tall/Skinny Data

Conference ·
The Singular Value Decomposition (SVD) is one of the most important matrix factorizations, enjoying a wide variety of applications across numerous application domains. In statistics and data analysis, the common applications of SVD inclue Principal Components Analysis (PCA) and regression. Usually these applications arise on data that has far more rows than columns, so-called "tall/skinny" matrices. In the big data analytics context, this may take the form of hundreds of millions to billions of rows with only a few hundred columns. There is a need, therefore, for fast, accurate, and scalable tall/skinny SVD implementations which can fully utilize modern computing resources. To that end, we present a survey of three different algorithms for computing the SVD for these kinds of tall/skinny data layouts using MPI for communication. We contextualize these with common big data analytics techniques. Finally, we present both CPU and GPU timing results from the Summit supercomputer, and discuss possible alternative approaches.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1772867
Country of Publication:
United States
Language:
English

Similar Records

Reconstructing householder vectors from Tall-Skinny QR
Journal Article · Wed Aug 05 00:00:00 EDT 2015 · Journal of Parallel and Distributed Computing · OSTI ID:1236219

A parallel strategy for density functional theory computations on accelerated nodes
Journal Article · Thu Oct 15 00:00:00 EDT 2020 · Parallel Computing · OSTI ID:1731060

Related Subjects