Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

B-DeepONet: An enhanced Bayesian DeepONet for solving noisy parametric PDEs using accelerated replica exchange SGLD

Journal Article · · Journal of Computational Physics
 [1];  [2];  [2]
  1. Purdue Univ., West Lafayette, IN (United States); OSTI
  2. Purdue Univ., West Lafayette, IN (United States)
Here, the Deep Operator Network (DeepONet) is a neural network architecture used to approximate operators, including the solution operator of parametric PDEs. DeepONets have shown remarkable approximation ability. However, the performance of DeepONets deteriorates when the training data is polluted with noise, a scenario that occurs in practice. To handle noisy data, we propose a Bayesian DeepONet based on replica exchange Langevin diffusion (reLD). Replica exchange uses two particles. The first particle trains a DeepONet to exploit the loss landscape and make predictions. The other particle trains a different DeepONet to explore the loss landscape and escape local minima via swapping. Compared to DeepONets trained with state-of-the-art gradient-based algorithms (e.g., Adam), the proposed Bayesian DeepONet greatly improves the training convergence for noisy scenarios and accurately estimates the uncertainty. To further reduce the high computational cost of the reLD training of DeepONets, we propose (1) an accelerated training framework that exploits the DeepONet's architecture to reduce its computational cost up to 25% without compromising performance and (2) a transfer learning strategy that accelerates training DeepONets for PDEs with different parameter values. Finally, we illustrate the effectiveness of the proposed Bayesian DeepONet using four parametric PDE problems.
Research Organization:
Brookhaven National Laboratory (BNL), Upton, NY (United States); Purdue Univ., West Lafayette, IN (United States)
Sponsoring Organization:
National Science Foundation (NSF); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
SC0021142; SC0023161
OSTI ID:
2421766
Alternate ID(s):
OSTI ID: 1897743
Journal Information:
Journal of Computational Physics, Journal Name: Journal of Computational Physics Journal Issue: C Vol. 473; ISSN 0021-9991
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (11)

A Comprehensive Deep Learning-Based Approach to Reduced Order Modeling of Nonlinear Time-Dependent Parametrized PDEs journal April 2021
A multi-stage deep learning based algorithm for multiscale model reduction journal October 2021
Quantifying total uncertainty in physics-informed neural networks for solving forward and inverse stochastic problems journal November 2019
DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks journal July 2021
Computational multiscale methods for quasi-gas dynamic equations journal September 2021
Solving parametric PDE problems with artificial neural networks journal July 2020
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators journal March 2021
Operator learning for predicting multiscale bubble growth dynamics journal March 2021
Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems journal July 1995
Diffusion for Global Optimization in $\mathbb{R}^n $ journal May 1987
Learning Algorithms for Coarsening Uncertainty Space and Applications to Multiscale Simulations journal May 2020

Similar Records

Fed-DeepONet: Stochastic Gradient-Based Federated Training of Deep Operator Networks
Journal Article · Sun Sep 11 20:00:00 EDT 2022 · Algorithms · OSTI ID:1886960

Synergistic learning with multi-task DeepONet for efficient PDE problem solving
Journal Article · Thu Jan 02 19:00:00 EST 2025 · Neural Networks · OSTI ID:2557505