DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Visualisation and outlier detection for probability density function ensembles

Journal Article · · Stat
DOI: https://doi.org/10.1002/sta4.662 · OSTI ID:2323357
ORCiD logo [1];  [1];  [1];  [2];  [2]
  1. Statistical Sciences (CCS‐6), Computer, Computational, and Statistical Sciences Division Los Alamos National Laboratory Los Alamos New Mexico USA
  2. Energy and Natural Resources Security (EES‐16), Earth and Environmental Sciences Division Los Alamos National Laboratory Los Alamos New Mexico USA

Abstract Exploratory data analysis (EDA) for functional data—data objects where observations are entire functions—is a difficult problem that has seen significant attention in recent literature. This surge in interest is motivated by the ubiquitous nature of functional data, which are prevalent in applications across fields such as meteorology, biology, medicine and engineering. Empirical probability density functions (PDFs) can be viewed as constrained functional data objects that must integrate to one and be nonnegative. They show up in contexts such as yearly income distributions, zooplankton size structure in oceanography and in connectivity patterns in the brain, among others. While PDF data are certainly common in modern research, little attention has been given to EDA specifically for PDFs. In this paper, we extend several methods for EDA on functional data for PDFs and compare them on simulated data that exhibit different types of variation, designed to mimic that seen in real‐world applications. We then use our new methods to perform EDA on the breakthrough curves observed in gas transport simulations for underground fracture networks.

Sponsoring Organization:
USDOE
OSTI ID:
2323357
Alternate ID(s):
OSTI ID: 2340908; OSTI ID: 2346235
Journal Information:
Stat, Journal Name: Stat Journal Issue: 2 Vol. 13; ISSN 2049-1573
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (50)

Riemannian Analysis of Probability Density Functions with Applications in Vision conference June 2007
From Fluid Flow to Coupled Processes in Fractured Rock: Recent Advances and New Frontiers journal February 2022
Characterizing flow and transport in fractured geological media: A review journal August 2002
Fracture network flow prediction with uncertainty using physics-informed graph features journal October 2023
Bayes Hilbert Spaces
  • van den Boogaart, Karl Gerald; Egozcue, Juan José; Pawlowsky‐Glahn, Vera
  • Australian & New Zealand Journal of Statistics, Vol. 56, Issue 2 https://doi.org/10.1111/anzs.12074
journal June 2014
Economic Applications of Quantile Regression book January 2002
Functional Outlier Detection for Density-Valued Data with Application to Robustify Distribution-to-Distribution Regression journal January 2023
A Probabilistic Clustering Approach for Identifying Primary Subnetworks of Discrete Fracture Networks with Quantified Uncertainty journal January 2020
Contour Boxplots: A Method for Characterizing Uncertainty in Feature Sets from Simulation Ensembles journal December 2013
Functional Boxplots journal January 2011
Scaling of fracture systems in geological media journal August 2001
Functional data analysis characterizes the shapes of the first COVID-19 epidemic wave in Italy journal August 2021
Introduction to Nonparametric Estimation book January 2009
Efficient Monte Carlo With Graph‐Based Subsurface Flow and Transport Models journal May 2018
Comparison of alternative modelling approaches for groundwater flow in fractured rock journal February 2002
Simplicial band depth for multivariate functional data journal March 2014
Fracture size and transmissivity correlations: Implications for transport simulations in sparse three-dimensional discrete fracture networks following a truncated power law distribution of fracture size: FRACTURE SIZE AND TRANSMISSIVITY CORRELATIONS journal August 2016
Modeling Probability Density Functions as Data Objects journal January 2022
Curve Boxplot: Generalization of Boxplot for Ensembles of Curves journal December 2014
Understanding hydraulic fracturing: a multi-scale problem
  • Hyman, J. D.; Jiménez-Martínez, J.; Viswanathan, H. S.
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 374, Issue 2078 https://doi.org/10.1098/rsta.2015.0426
journal October 2016
The state of the art in monitoring and verification—Ten years on journal September 2015
Visualization and Outlier Detection for Multivariate Elastic Curve Data journal November 2020
The Behavior of Dense, Nonaqueous Phase Liquids in Fractured Clay and Rock journal September 1991
dfnWorks: A discrete fracture network framework for modeling subsurface flow and transport journal November 2015
Elastic Depths for Detecting Shape Anomalies in Functional Data journal October 2020
LQD-RKHS-based distribution-to-distribution regression methodology for restoring the probability distributions of missing SHM data journal April 2019
The shale gas revolution: Barriers, sustainability, and emerging opportunities journal August 2017
Investigating Concurrency in Online Auctions Through Visualization journal August 2006
Trajectory functional boxplots journal January 2020
Singular Value Decomposition and Its Visualization journal December 2007
On the Concept of Depth for Functional Data journal June 2009
Classifying densities using functional regression trees: Applications in oceanology journal June 2007
Trends, prospects and challenges in quantifying flow and transport through fractured rocks journal February 2005
Hilbert Space of Probability Density Functions Based on Aitchison Geometry journal January 2006
Sensitivity Analysis in the Presence of Intrinsic Stochasticity for Discrete Fracture Network Simulations journal August 2024
Functional Data Analysis book January 2005
Dissolution of non-aqueous-phase liquids and aqueous-phase contaminant transport in discretely-fractured porous media journal June 1996
Quantifying Transport Uncertainty in Unsaturated Rock using Monte Carlo Sampling of Retention Curves journal October 2012
Functional data analysis for density functions by transformation to a Hilbert space journal February 2016
Short-Term Spatio-Temporal Wind Power Forecast in Robust Look-ahead Power System Dispatch journal January 2014
Forecasting of density functions with an application to cross-sectional and intraday returns journal October 2019
A methodology to constrain the parameters of a hydrogeological discrete fracture network model for sparsely fractured crystalline rock, exemplified by data from the proposed high-level nuclear waste repository site at Forsmark, Sweden journal December 2013
A Geometric Approach to Visualization of Variability in Functional Data journal July 2016
Conforming Delaunay Triangulation of Stochastically Generated Three Dimensional Discrete Fracture Networks: A Feature Rejection Algorithm for Meshing Strategy journal January 2014
Optimal Transport: Theory and Applications book August 2014
Functional data analysis of the dynamics of the monthly index of nondurable goods production journal March 2002
Outlier detection in functional data by depth measures, with application to identify abnormal NOx levels journal August 2007
A functional analysis of NOx levels: location and scale estimation and outlier detection journal March 2007
Inference for Density Families Using Functional Principal Component Analysis journal June 2001
Rainbow Plots, Bagplots, and Boxplots for Functional Data journal January 2010