|
Anomaly detection tools have been increasingly used in recent years to generate predictive insights on rare events. The typical challenges encountered in such applications include a large number of data dimensions and absence of labeled data. An anomaly detection strategy for these scenarios is dimensionality reduction followed by clustering in the reduced space, with the degree of anomaly of an event or observation quantified by statistical distance from the clusters. However, most research efforts so far are focused on single abrupt anomalies, while the correlation between observations is completely ignored. In this paper, we address the problem of detection of both abrupt and sustained anomalies with high dimensions. The task becomes more challenging than only detecting abrupt outliers because of the gradual and indiscriminant changes in sustained anomalies. We utilize a mixture model of probabilistic principal component analyzers to quantify each observation by probabilistic measures. A statistical process control method is then used to monitor both abrupt and gradual changes. On the other hand, the mixture model can be regarded as a trade-off strategy between linear and nonlinear dimensionality reductions in terms of computational efficiency. This compromise is particularly important in real-time deployment. The proposed method is evaluated on simulated and benchmark data, as well as on data from wide-area sensors at a truck weigh station test-bed.
|
| Authors: |
Fang, Yi [ORNL];
Ganguly, Auroop R [ORNL]
|
| Publication Date: | 2007 Aug 01 |
| OSTI Identifier: | 932067 |
| DOE Contract Number: | DE-AC05-00OR22725 |
| Resource Type: | Conference/Event |
| Resource Relation: | ACM KDD 2007 - ACM Workshop on Knowledge Discovery from Sensor Data, San Jose, CA, USA, 20070812, 20070812 |
| Research Org: | Oak Ridge National Laboratory (ORNL) |
| Sponsoring Org: | ORNL LDRD Director's R&D |
| Country of Publication: | United States |
| Language: | English |
| Format: | Size: 10 |
| Other Number(s): | TRN: US200901%%1242 |
| Subject: | 99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE; VARIATIONS; DETECTION; DATA ANALYSIS; PROBABILITY; SENSORS; WEIGHT; TRUCKS |
| Update Date: | 2011 Jan 27 |
Top |
|