Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Anomaly detection in scientific data using joint statistical moments

Journal Article · · Journal of Computational Physics
 [1];  [1];  [1];  [2];  [3];  [2]
  1. Sandia National Lab. (SNL-CA), Livermore, CA (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  3. Citrine Informatics, Redwood City, CA (United States)

We propose an anomaly detection method for multi-variate scientific data based on analysis of high-order joint moments. Using kurtosis as a reliable measure of outliers, we suggest that principal kurtosis vectors, by analogy to principal component analysis (PCA) vectors, signify the principal directions along which outliers appear. The inception of an anomaly, then, manifests as a change in the principal values and vectors of kurtosis. Obtaining the principal kurtosis vectors requires decomposing a fourth order joint cumulant tensor for which we use a simple, computationally less expensive approach that involves performing a singular value decomposition (SVD) over the matricized tensor. We demonstrate the efficacy of this approach on synthetic data, and develop an algorithm to identify the occurrence of a spatial and/or temporal anomalous event in scientific phenomena. The algorithm decomposes the data into several spatial sub-domains and time steps to identify regions with such events. Feature moment metrics, based on the alignments of the principal kurtosis vectors, are computed at each sub-domain and time step for all features to quantify their relative importance towards the overall kurtosis in the data. Accordingly, spatial and temporal anomaly metrics for each sub-domain are proposed using the Hellinger distance of the feature moment metric distribution from a suitable nominal distribution. Finally, we apply the algorithm to two turbulent auto-ignition combustion cases and demonstrate that the anomaly metrics reliably capture the occurrence of auto-ignition in relevant spatial sub-domains at the right time steps.

Research Organization:
Sandia National Laboratories (SNL-CA), Livermore, CA (United States); Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC04-94AL85000
OSTI ID:
1502456
Alternate ID(s):
OSTI ID: 1502973
OSTI ID: 1636004
Report Number(s):
SAND--2018-8923J; 667074
Journal Information:
Journal of Computational Physics, Journal Name: Journal of Computational Physics Vol. 387; ISSN 0021-9991
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (20)

A Multilinear Singular Value Decomposition journal January 2000
Numerically stable, scalable formulas for parallel and online computation of higher-order multivariate central moments with arbitrary weights journal March 2016
On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study journal January 2016
Direct numerical simulations of HCCI/SACI with ethanol journal July 2014
Scalar mixing in direct numerical simulations of temporally evolving plane jet flames with skeletal CO/H2 kinetics journal January 2007
Advanced compression-ignition engines—understanding the in-cylinder processes journal January 2009
Direct numerical simulation of flame stabilization assisted by autoignition in a reheat gas turbine combustor journal January 2019
Three-dimensional direct numerical simulation of a turbulent lifted hydrogen jet flame in heated coflow: a chemical explosive mode analysis journal May 2010
Kurtosis as Peakedness, 1905–2014. R.I.P. journal July 2014
Procedures for Detecting Outlying Observations in Samples journal February 1969
Terascale direct numerical simulations of turbulent combustion using S3D journal January 2009
Independent component analysis and (simultaneous) third-order tensor diagonalization journal January 2001
Using feature importance metrics to detect events of interest in scientific computing applications conference October 2017
The Reheat Concept: The Proven Pathway to Ultralow Emissions and High Efficiency and Flexibility journal December 2008
Symmetric Tensors and Symmetric Tensor Rank journal January 2008
Tensor Decompositions and Applications journal August 2009
Trigger Detection for Adaptive Scientific Workflows Using Percentile Sampling journal January 2016
A Multilinear Singular Value Decomposition journal January 2000
Anomaly detection: A survey journal July 2009
A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data journal April 2016

Similar Records

In situ multi-tier auto-ignition detection applied to dual-fuel combustion simulations
Journal Article · Tue Jun 17 00:00:00 EDT 2025 · Combustion and Flame · OSTI ID:2574694

Fast and efficient identification of anomalous galaxy spectra with neural density estimation
Journal Article · Tue Sep 12 00:00:00 EDT 2023 · Monthly Notices of the Royal Astronomical Society · OSTI ID:2008397