DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: PCAfold 2.0—Novel tools and algorithms for low-dimensional manifold assessment and optimization

Journal Article · · SoftwareX
ORCiD logo [1];  [2];  [1];  [2]
  1. Université Libre de Bruxelles, Brussels (Belgium); BRITE: BRussels Institute for Thermal-fluid systems and clean Energy (Belgium)
  2. University of Utah, Salt Lake City, UT (United States)

We describe an update to our open-source Python package, PCAfold, designed to help researchers generate, analyze and improve low-dimensional data manifolds. In the current version, PCAfold 2.0, we introduce novel tools and algorithms for assessing and optimizing low-dimensional manifolds. This includes a method that generates a “map” of local feature sizes that can help pinpoint researchers to problematic regions on a manifold. We introduce a novel cost function that characterizes the quality of a manifold topology with a single number. We develop two algorithms for feature selection based on principal component analysis (PCA) that use the cost function as an objective function to minimize. We introduce a quantity of interest (QoI)-aware dimensionality reduction strategy where data projections are computed using an artificial neural network and are directly optimized towards representing various projection-independent and projection-dependent QoIs. We also introduce an implementation of partition of unity networks (POUnets) for efficient reconstruction of QoIs from low-dimensional manifolds based on combining neural network classification with localized polynomial regression. Our software can be broadly applicable in all domains of science and engineering that aim to reduce data dimensionality, as well as in the fundamental research on representation learning.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
European Union’s Horizon 2020 research and innovation program; National Science Foundation (NSF); USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
NA0003525
OSTI ID:
2424309
Journal Information:
SoftwareX, Journal Name: SoftwareX Journal Issue: C Vol. 23; ISSN 2352-7110
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (8)

Reduced-order modeling of supersonic fuel–air mixing in a multi-strut injection scramjet engine using machine learning techniques journal January 2023
Co-optimized machine-learned manifold models for large eddy simulation of turbulent combustion journal October 2022
Manifold-informed state vector subset for reduced-order modeling journal July 2022
PCAfold: Python software to generate, analyze and improve PCA-derived low-dimensional manifolds journal July 2020
Cost function for low-dimensional manifold topology assessment journal August 2022
Accurate Compression of Tabulated Chemistry Models with Partition of Unity Networks journal August 2022
A technique for characterising feature size and quality of manifolds journal June 2021
Data-driven framework for input/output lookup tables reduction: Application to hypersonic flows in chemical nonequilibrium journal February 2023