Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Dimensionality reduction using elastic measures

Journal Article · · Stat
DOI:https://doi.org/10.1002/sta4.551· OSTI ID:2311587
 [1];  [2];  [3]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  2. Boeing, Albuquerque, NM (United States)
  3. Moffitt Cancer Center, Tampa, FL (United States)

With the recent surge in big data analytics for hyperdimensional data, there is a renewed interest in dimensionality reduction techniques. In order for these methods to improve performance gains and understanding of the underlying data, a proper metric needs to be identified. This step is often overlooked, and metrics are typically chosen without consideration of the underlying geometry of the data. Here, in this paper, we present a method for incorporating elastic metrics into the t-distributed stochastic neighbour embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP). We apply our method to functional data, which is uniquely characterized by rotations, parameterization and scale. If these properties are ignored, they can lead to incorrect analysis and poor classification performance. Through our method, we demonstrate improved performance on shape identification tasks for three benchmark data sets (MPEG-7, Car data set and Plane data set of Thankoor), where we achieve 0.77, 0.95 and 1.00 F1 score, respectively.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
NA0003525
OSTI ID:
2311587
Alternate ID(s):
OSTI ID: 1970420
Report Number(s):
SAND--2023-04438J
Journal Information:
Stat, Journal Name: Stat Journal Issue: 1 Vol. 12; ISSN 2049-1573
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (13)

Statistical Shape Analysis, with Applications in R book September 2016
Functional and Shape Data Analysis book January 2016
Embedding to reference t-SNE space addresses batch effects in single-cell classification journal August 2021
Generative models for functional data using phase and amplitude separation journal May 2013
Stochastic neighbor embedding (SNE) for dimension reduction and visualization using arbitrary divergences journal August 2012
Random Forests journal January 2001
Bayesian sensitivity analysis with the Fisher–Rao metric journal July 2015
Hidden Markov Model-Based Weighted Likelihood Discriminant for 2-D Shape Classification journal November 2007
Nearest neighbor pattern classification journal January 1967
Shape Analysis of Elastic Curves in Euclidean Spaces journal July 2011
Statistical analysis of trajectories on Riemannian manifolds: Bird migration, hurricane tracking and video surveillance journal March 2014
machine. journal October 2001
UMAP: Uniform Manifold Approximation and Projection journal September 2018

Similar Records

Improved Treatment of Host-galaxy Correlations in Cosmological Analyses with Type Ia Supernovae
Journal Article · Mon May 24 00:00:00 EDT 2021 · The Astrophysical Journal · OSTI ID:1821611