Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A Class of Sparse Johnson–Lindenstrauss Transforms and Analysis of their Extreme Singular Values

Journal Article · · SIAM Journal on Matrix Analysis and Applications
DOI:https://doi.org/10.1137/23m1605661· OSTI ID:2563384
 [1];  [2]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States)
  2. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
The Johnson–Lindenstrauss (JL) lemma is a powerful tool for dimensionality reduction in modern algorithm design. The lemma states that any set of high-dimensional points in a Euclidean space can be projected into lower dimensions while approximately preserving pairwise Euclidean distances. Random matrices satisfying this lemma are called JL transforms (JLTs). Inspired by existing $$s$$-hashing JLTs with exactly $$s$$ nonzero elements on each column, the present work introduces an ensemble of sparse matrices encompassing so-called $$s$$-hashing-like matrices whose expected number of nonzero elements on each column is $$s$$. The independence of the sub-Gaussian entries of these matrices and the knowledge of their exact distribution play an important role in their analyses. Using properties of independent sub-Gaussian random variables, these matrices are demonstrated to be JLTs, and their smallest nontrivial singular values and largest singular values are estimated nonasymptotically using a technique from geometric functional analysis. As the dimensions of the matrix grow to infinity, these singular values are proved to converge almost surely to fixed quantities (by using the universal Bai–Yin law) and in distribution to the Gaussian orthogonal ensemble Tracy–Widom law after proper rescalings. Understanding the behaviors of extreme singular values is important in general because they are often used to define a measure of stability of matrix algorithms. For example, JLTs were recently used in derivative-free optimization algorithmic frameworks to select random subspaces in which are constructed random models or poll directions to achieve scalability, and hence estimating their smallest singular value in particular helps determine the dimension of these subspaces.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2563384
Journal Information:
SIAM Journal on Matrix Analysis and Applications, Journal Name: SIAM Journal on Matrix Analysis and Applications Journal Issue: 1 Vol. 46; ISSN 0895-4798
Publisher:
Society for Industrial and Applied Mathematics (SIAM)Copyright Statement
Country of Publication:
United States
Language:
English

References (31)

Smallest singular value of a random rectangular matrix journal December 2009
An elementary proof of a theorem of Johnson and Lindenstrauss journal November 2002
On variants of the Johnson–Lindenstrauss lemma journal April 2008
Derivative-Free and Blackbox Optimization book January 2017
On the limit of the largest eigenvalue of the large dimensional sample covariance matrix journal August 1988
A Universality Result for the Smallest Eigenvalues of Certain Sample Covariance Matrices journal April 2010
Universality at the Edge of the Spectrum¶in Wigner Random Matrices journal November 1999
Universality results for the largest eigenvalues of some sample covariance matrix ensembles journal January 2008
Scalable subspace methods for derivative-free nonlinear least-squares optimization journal June 2022
Dimensionality reduction via the Johnson–Lindenstrauss Lemma: theoretical and empirical bounds on embedding dimension journal May 2018
Database-friendly random projections: Johnson-Lindenstrauss with binary coins journal June 2003
Smallest singular value of random matrices and geometry of random polytopes journal August 2005
The Littlewood–Offord problem and invertibility of random matrices journal June 2008
High-Dimensional Probability: An Introduction with Applications in Data Science book September 2018
Probability: Theory and Examples book January 2019
A Note on Universality of the Distribution of the Largest Eigenvalues in Certain Sample Covariance Matrices journal September 2002
Distribution of Eigenvalues for some sets of Random Matrices journal April 1967
Extensions of Lipschitz mappings into a Hilbert space book January 1984
Topics in Random Matrix Theory book January 2012
Recent developments in non-asymptotic theory of random matrices book January 2014
Introduction to Derivative-Free Optimization book January 2009
Direct Search Based on Probabilistic Descent in Reduced Spaces journal November 2023
Stochastic Trust-Region Algorithm in Random Subspaces with Convergence and Expected Complexity Analyses journal July 2024
Non-asymptotic Theory of Random Matrices: Extreme Singular Values conference June 2011
Sparser Johnson-Lindenstrauss Transforms journal January 2014
Sharp nonasymptotic bounds on the norm of random matrices with independent entries journal July 2016
Limit of the Smallest Eigenvalue of a Large Dimensional Sample Covariance Matrix journal July 1993
The Smallest Eigenvalue of a Large Dimensional Wishart Matrix journal November 1985
A Limit Theorem for the Norm of Random Matrices journal April 1980
Computational Advertising: Techniques for Targeting Relevant Ads journal January 2014
On the Distribution of the Roots of Certain Symmetric Matrices journal March 1958

Similar Records

An Analysis of the Johnson-Lindenstrauss Lemma with the Bivariate Gamma Distribution
Technical Report · Tue Jan 24 23:00:00 EST 2023 · OSTI ID:1959476

Faster Johnson–Lindenstrauss transforms via Kronecker products
Journal Article · Thu Oct 22 20:00:00 EDT 2020 · Information and Inference (Online) · OSTI ID:1738933

Large Deviations of Extreme Eigenvalues of Random Matrices
Journal Article · Fri Oct 20 00:00:00 EDT 2006 · Physical Review Letters · OSTI ID:20861003