e RPCA : Robust Principal Component Analysis for Exponential Family Distributions
- Department of Statistical Science Duke University Durham North Carolina USA
- School of Data Science The Chinese University of Hong Kong Shenzhen China
- H. Milton Stewart School of Industrial and Systems Engineering (ISyE) Georgia Institute of Technology Atlanta Georgia USA
Abstract Robust principal component analysis (RPCA) is a widely used method for recovering low‐rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low‐rank background is critical for process monitoring and diagnosis. However, existing RPCA methods and their extensions largely do not account for the underlying probabilistic distribution for the data matrices, which in many applications are known and can be highly non‐Gaussian. We thus propose a new method called RPCA for exponential family distributions (), which can perform the desired decomposition into low‐rank and sparse matrices when such a distribution falls within the exponential family. We present a novel alternating direction method of multiplier optimization algorithm for efficient decomposition, under either its natural or canonical parametrization. The effectiveness of is then demonstrated in two applications: the first for steel sheet defect detection and the second for crime activity monitoring in the Atlanta metropolitan area.
- Sponsoring Organization:
- USDOE
- OSTI ID:
- 2329329
- Journal Information:
- Statistical Analysis and Data Mining, Journal Name: Statistical Analysis and Data Mining Journal Issue: 2 Vol. 17; ISSN 1932-1864
- Publisher:
- Wiley Blackwell (John Wiley & Sons)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Spy the Lie: Detecting Malicious Insiders
21 cm Signal Recovery via Robust Principal Component Analysis