DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Randomized Algorithms for Symmetric Nonnegative Matrix Factorization

Journal Article · · SIAM Journal on Matrix Analysis and Applications
DOI: https://doi.org/10.1137/24m1638355 · OSTI ID:2526199

Symmetric Nonnegative Matrix Factorization (SymNMF) is a technique in data analysis and machine learning that approximates a matrix with a product of a nonnegative, low-rank matrix and it transpose. To design faster and more scalable algorithms for SymNMF we develop two randomized algorithms for its computation. The first method uses randomized matrix sketching to compute an initial low-rank approximation to the input matrix and proceeds to uses this as a low-rank input to rapidly compute a SymNMF. The second methods uses randomized leverage score sampling to approximately solve constrained least squares problems. Many successful methods for SymNMF rely on (approximately) solving sequences of constrained least squares problems. Here, we prove theoretically that leverage score sampling can approximately solve constrained least squares problems to e-accuracy. Finally we demonstrate both methods work in practice by applying them to graph clustering tasks on large real world data sets. These experiments show that our methods approximately maintain solution quality and achieve significant speed ups for both large dense and large sparse problems.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
National Science Foundation (NSF); USDOE
Grant/Contract Number:
AC05-76RL01830; SC0020347
OSTI ID:
2526199
Report Number(s):
PNNL-SA--193926
Journal Information:
SIAM Journal on Matrix Analysis and Applications, Journal Name: SIAM Journal on Matrix Analysis and Applications Journal Issue: 1 Vol. 46; ISSN 0895-4798
Publisher:
Society for Industrial and Applied Mathematics (SIAM)Copyright Statement
Country of Publication:
United States
Language:
English

References (31)

Stability of the solution of definite quadratic programs journal December 1973
Faster least squares approximation journal October 2010
Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework journal March 2013
SymNMF: nonnegative low-rank approximation of a similarity matrix for graph clustering journal November 2014
DC-NMF: nonnegative matrix factorization based on divide-and-conquer for fast clustering and topic modeling journal April 2017
Hybrid clustering based on content and connection structure using joint nonnegative matrix factorization journal October 2017
Silhouettes: A graphical aid to the interpretation and validation of cluster analysis journal November 1987
Random projections for the nonnegative least-squares problem journal August 2009
SVD based initialization: A head start for nonnegative matrix factorization journal April 2008
Randomized nonnegative matrix factorization journal March 2018
Randomized numerical linear algebra: Foundations and algorithms journal May 2020
Learning the parts of objects by non-negative matrix factorization journal October 1999
Randomized CP tensor decomposition journal May 2020
MPI-FAUN: An MPI-Based Framework for Alternating-Updating Nonnegative Matrix Factorization journal March 2018
Fast Nonnegative Matrix/Tensor Factorization Based on Low-Rank Approximation journal June 2012
Compressed Nonnegative Matrix Factorization Is Fast and Accurate journal May 2016
Efficient and Non-Convex Coordinate Descent for Symmetric Nonnegative Matrix Factorization journal November 2016
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions journal January 2011
Efficient Nonnegative Matrix Factorization with Random Projections conference April 2010
Nonnegative Matrix Factorization book January 2020
Fast Nonnegative Matrix Factorization: An Active-Set-Like Method and Comparisons journal January 2011
Subspace Iteration Randomization and Singular Value Problems journal January 2015
A Practical Randomized CP Tensor Decomposition journal January 2018
Practical Leverage-Based Sampling for Low-Rank Tensor Decomposition journal August 2022
Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication journal January 2006
Planc journal June 2021
Accelerated Multiplicative Updates and Hierarchical ALS Algorithms for Nonnegative Matrix Factorization journal April 2012
Projected Gradient Methods for Nonnegative Matrix Factorization journal October 2007
Hierarchical community detection via rank-2 symmetric nonnegative matrix factorization journal September 2017
Mega journal January 2020
Fast Local Algorithms for Large Scale Nonnegative Matrix and Tensor Factorizations journal January 2009