skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Anomaly detection in scientific data using joint statistical moments

Journal Article · · Journal of Computational Physics
 [1];  [1];  [1];  [2];  [3];  [2]
  1. Sandia National Lab. (SNL-CA), Livermore, CA (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  3. Citrine Informatics, Redwood City, CA (United States)

We propose an anomaly detection method for multi-variate scientific data based on analysis of high-order joint moments. Using kurtosis as a reliable measure of outliers, we suggest that principal kurtosis vectors, by analogy to principal component analysis (PCA) vectors, signify the principal directions along which outliers appear. The inception of an anomaly, then, manifests as a change in the principal values and vectors of kurtosis. Obtaining the principal kurtosis vectors requires decomposing a fourth order joint cumulant tensor for which we use a simple, computationally less expensive approach that involves performing a singular value decomposition (SVD) over the matricized tensor. We demonstrate the efficacy of this approach on synthetic data, and develop an algorithm to identify the occurrence of a spatial and/or temporal anomalous event in scientific phenomena. The algorithm decomposes the data into several spatial sub-domains and time steps to identify regions with such events. Feature moment metrics, based on the alignments of the principal kurtosis vectors, are computed at each sub-domain and time step for all features to quantify their relative importance towards the overall kurtosis in the data. Accordingly, spatial and temporal anomaly metrics for each sub-domain are proposed using the Hellinger distance of the feature moment metric distribution from a suitable nominal distribution. Finally, we apply the algorithm to two turbulent auto-ignition combustion cases and demonstrate that the anomaly metrics reliably capture the occurrence of auto-ignition in relevant spatial sub-domains at the right time steps.

Research Organization:
Sandia National Lab. (SNL-CA), Livermore, CA (United States); Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
NA0003525; AC04-94AL85000; FWP16-019471
OSTI ID:
1502973
Alternate ID(s):
OSTI ID: 1502456; OSTI ID: 1636004
Report Number(s):
SAND-2019-2948J; SAND-2018-8923J; 673503
Journal Information:
Journal of Computational Physics, Vol. 387; ISSN 0021-9991
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 5 works
Citation information provided by
Web of Science

References (19)

Anomaly detection: A survey journal July 2009
Using feature importance metrics to detect events of interest in scientific computing applications conference October 2017
Numerically stable, scalable formulas for parallel and online computation of higher-order multivariate central moments with arbitrary weights journal March 2016
Procedures for Detecting Outlying Observations in Samples journal February 1969
On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study journal January 2016
A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data journal April 2016
Kurtosis as Peakedness, 1905–2014. R.I.P. journal July 2014
Tensor Decompositions and Applications journal August 2009
Symmetric Tensors and Symmetric Tensor Rank journal January 2008
Independent component analysis and (simultaneous) third-order tensor diagonalization journal January 2001
Advanced compression-ignition engines—understanding the in-cylinder processes journal January 2009
The Reheat Concept: The Proven Pathway to Ultralow Emissions and High Efficiency and Flexibility journal December 2008
Direct numerical simulation of flame stabilization assisted by autoignition in a reheat gas turbine combustor journal January 2019
Trigger Detection for Adaptive Scientific Workflows Using Percentile Sampling journal January 2016
Three-dimensional direct numerical simulation of a turbulent lifted hydrogen jet flame in heated coflow: a chemical explosive mode analysis journal May 2010
Terascale direct numerical simulations of turbulent combustion using S3D journal January 2009
Scalar mixing in direct numerical simulations of temporally evolving plane jet flames with skeletal CO/H2 kinetics journal January 2007
Direct numerical simulations of HCCI/SACI with ethanol journal July 2014
A Multilinear Singular Value Decomposition journal January 2000