Randomized Algorithms for Symmetric Nonnegative Matrix Factorization
- Georgia Institute of Technology, Atlanta, GA (United States)
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Wake Forest Univ., Winston-Salem, NC (United States)
Symmetric Nonnegative Matrix Factorization (SymNMF) is a technique in data analysis and machine learning that approximates a matrix with a product of a nonnegative, low-rank matrix and it transpose. To design faster and more scalable algorithms for SymNMF we develop two randomized algorithms for its computation. The first method uses randomized matrix sketching to compute an initial low-rank approximation to the input matrix and proceeds to uses this as a low-rank input to rapidly compute a SymNMF. The second methods uses randomized leverage score sampling to approximately solve constrained least squares problems. Many successful methods for SymNMF rely on (approximately) solving sequences of constrained least squares problems. Here, we prove theoretically that leverage score sampling can approximately solve constrained least squares problems to e-accuracy. Finally we demonstrate both methods work in practice by applying them to graph clustering tasks on large real world data sets. These experiments show that our methods approximately maintain solution quality and achieve significant speed ups for both large dense and large sparse problems.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- National Science Foundation (NSF); USDOE
- Grant/Contract Number:
- AC05-76RL01830; SC0020347
- OSTI ID:
- 2526199
- Report Number(s):
- PNNL-SA--193926
- Journal Information:
- SIAM Journal on Matrix Analysis and Applications, Journal Name: SIAM Journal on Matrix Analysis and Applications Journal Issue: 1 Vol. 46; ISSN 0895-4798
- Publisher:
- Society for Industrial and Applied Mathematics (SIAM)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Algorithm for linear least squares problems with equality and nonnegativity constraints
Tensor Decompositions for Count Data that Leverage Stochastic and Deterministic Optimization