Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Probabilistic partition of unity networks for high–dimensional regression problems

Journal Article · · International Journal for Numerical Methods in Engineering
DOI:https://doi.org/10.1002/nme.7207· OSTI ID:2305532
 [1];  [2];  [3];  [4]
  1. Stanford University, CA (United States); Stanford University
  2. Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
  3. Sandia National Laboratories (SNL-CA), Livermore, CA (United States)
  4. Stanford University, CA (United States)

We explore the probabilistic partition of unity network (PPOU-Net) model in the context of high-dimensional regression problems and propose a general framework focusing on adaptive dimensionality reduction. With the proposed framework, the target function is approximated by a mixture of experts model on a low-dimensional manifold, where each cluster is associated with a fixed-degree polynomial. We present a training strategy that leverages the expectation maximization (EM) algorithm. During the training, we alternate between (i) applying gradient descent to update the DNN coefficients; and (ii) using closed-form formulae derived from the EM algorithm to update the mixture of experts model parameters. Under the probabilistic formulation, step (ii) admits the form of embarrassingly paralleliazable weighted least-squares solves. The PPOU-Nets consistently outperform the baseline fully-connected neural networks of comparable sizes in numerical experiments of various data dimensions. Here, we also explore the proposed model in applications of quantum computing, where the PPOU-Nets act as surrogate models for cost landscapes associated with variational quantum circuits.

Research Organization:
Stanford University, CA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
SC0019205; NA0003525
OSTI ID:
2305532
Journal Information:
International Journal for Numerical Methods in Engineering, Journal Name: International Journal for Numerical Methods in Engineering Journal Issue: 10 Vol. 124; ISSN 0029-5981
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (48)

Mixture of experts: a literature survey journal May 2012
Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review journal March 2017
Learning generative neural networks with physics knowledge journal May 2022
Convergence results for the EM approach to mixtures of experts architectures journal January 1995
Filling the gaps: Gaussian mixture models from noisy, truncated or incomplete samples journal October 2018
Non-intrusive low-rank separated approximation of high-dimensional stochastic models journal August 2013
Data-driven computational mechanics journal June 2016
Exploiting active subspaces to quantify uncertainty in the numerical simulation of the HyShot II scramjet journal December 2015
Bayesian deep convolutional encoder–decoder networks for surrogate modeling and uncertainty quantification journal August 2018
Gradient-based optimization for regression in the functional tensor-train format journal December 2018
DGM: A deep learning algorithm for solving partial differential equations journal December 2018
Deep UQ: Learning deep neural network surrogate models for high dimensional uncertainty quantification journal December 2018
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations journal February 2019
Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data journal October 2019
Learning constitutive relations from indirect observations using deep neural networks journal September 2020
Physics constrained learning for data-driven inverse modeling from sparse observations journal March 2022
A unified deep artificial neural network approach to partial differential equations in complex geometries journal November 2018
Man vs. computer: Benchmarking machine learning algorithms for traffic sign recognition journal August 2012
Error bounds for approximations with deep ReLU networks journal October 2017
Kernel PCA for novelty detection journal March 2007
Reliability analysis of high-dimensional models using low-rank tensor approximations journal October 2016
Deep learning for high-dimensional reliability analysis journal May 2020
Radial Basis Functions book January 2009
Solving parametric PDE problems with artificial neural networks journal July 2020
Physics-informed machine learning journal May 2021
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators journal March 2021
Evaluation of machine learning algorithms for prediction of regions of high Reynolds averaged Navier Stokes uncertainty journal August 2015
Solving high-dimensional partial differential equations using deep learning journal August 2018
Machine learning classification can reduce false positives in structure-based virtual screening journal July 2020
Splines in Statistics journal June 1983
Variational Inference: A Review for Statisticians journal July 2016
Artificial neural networks for solving ordinary and partial differential equations journal January 1998
Distributed Signal Processing via Chebyshev Polynomial Approximation journal December 2018
Reducing the Dimensionality of Data with Neural Networks journal July 2006
A Sparse Grid Stochastic Collocation Method for Partial Differential Equations with Random Input Data journal January 2008
Active Subspaces book January 2015
Approximation Theory and Approximation Practice, Extended Edition book January 2019
Spectral Tensor-Train Decomposition journal January 2016
Deep learning in high dimension: Neural network expression rates for generalized polynomial chaos expansions in UQ journal December 2018
Deep ReLU networks and high-order finite element methods journal February 2020
Adaptive Mixtures of Local Experts journal February 1991
Hierarchical Mixtures of Experts and the EM Algorithm journal March 1994
Lebesgue functions and Lebesgue constants in polynomial interpolation journal March 2016
Projection-Based Approximation and a Duality with Kernel Methods journal March 1989
An Introduction to Variational Autoencoders journal January 2019
Extending Classical Surrogate Modeling to high Dimensions Through Supervised Dimensionality Reduction: a Data-Driven Approach journal January 2020
Workshop Report on Basic Research Needs for Scientific Machine Learning: Core Technologies for Artificial Intelligence report February 2019
Active Manifold and Model-Order Reduction to Accelerate Multidisciplinary Analysis and Optimization journal November 2021