DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Reliable extrapolation of deep neural operators informed by physics or sparse observations

Journal Article · · Computer Methods in Applied Mechanics and Engineering

Deep neural operators can learn nonlinear mappings between infinite-dimensional function spaces via deep neural networks. As promising surrogate solvers of partial differential equations (PDEs) for real-time prediction, deep neural operators such as deep operator networks (DeepONets) provide a new simulation paradigm in science and engineering. Pure data-driven neural operators and deep learning models, in general, are usually limited to interpolation scenarios, where new predictions utilize inputs within the support of the training set. However, in the inference stage of real-world applications, the input may lie outside the support, i.e., extrapolation is required, which may result to large errors and unavoidable failure of deep learning models. Here, we address this challenge of extrapolation for deep neural operators. First, we systematically investigate the extrapolation behavior of DeepONets by quantifying the extrapolation complexity, via the 2-Wasserstein distance between two function spaces and propose a new strategy of bias–variance trade-off for extrapolation with respect to model capacity. Subsequently, we develop a complete workflow, including extrapolation determination, and we propose five reliable learning methods that guarantee a safe prediction under extrapolation by requiring additional information—the governing PDEs of the system or sparse new observations. The proposed methods are based on either fine-tuning a pre-trained DeepONet or multifidelity learning. We demonstrate the effectiveness of the proposed framework for various types of parametric PDEs. Furthermore, our systematic comparisons provide practical guidelines for selecting a proper extrapolation method depending on the available information, desired accuracy, and required inference speed.

Research Organization:
Univ. of Pennsylvania, Philadelphia, PA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0022953
OSTI ID:
1991293
Alternate ID(s):
OSTI ID: 1972742
Journal Information:
Computer Methods in Applied Mechanics and Engineering, Vol. 412; ISSN 0045-7825
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (34)

Generalizing from a Few Examples journal June 2020
Extrapolation and interpolation in neural network classifiers journal October 1992
A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse PDE problems journal January 2020
Approximation rates of DeepONets for learning operators arising from advection–diffusion equations journal September 2022
Extraction of mechanical properties of materials through deep learning from instrumented indentation journal March 2020
Physics-informed machine learning journal May 2021
Multifidelity deep neural operators for efficient learning of partial differential equations with application to fast inverse design of nanoscale heat transport journal June 2022
Multilayer feedforward networks are universal approximators journal January 1989
DeepXDE: A Deep Learning Library for Solving Differential Equations journal January 2021
A physics-informed variational DeepONet for predicting crack path in quasi-brittle materials journal March 2022
DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks journal July 2021
A physics-informed operator regression framework for extracting data-driven continuum models journal January 2021
Reconciling modern machine-learning practice and the classical bias–variance trade-off journal July 2019
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations journal February 2019
DeepM&Mnet for hypersonics: Predicting the coupled flow and finite-rate chemistry behind a normal shock using neural-network approximation of operators journal December 2021
Sigmoid-weighted linear units for neural network function approximation in reinforcement learning journal November 2018
Learning the solution operator of parametric partial differential equations with physics-informed DeepONets journal October 2021
Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients journal July 2021
Quantifying the generalization error in deep learning in terms of data distribution and neural network smoothness journal October 2020
Neural Networks and the Bias/Variance Dilemma journal January 1992
A seamless multiscale operator neural network for inferring bubble dynamics journal October 2021
Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems journal July 1995
Overcoming catastrophic forgetting in neural networks journal March 2017
A Survey on Transfer Learning journal October 2010
Deep solution operators for variational inequalities via proximal neural networks journal June 2022
Interfacing finite elements with deep neural operators for fast multiscale modeling of mechanics problems journal December 2022
A comprehensive and fair comparison of two neural operators (with practical extensions) based on FAIR data journal April 2022
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators journal March 2021
Deep double descent: where bigger models and more data hurt* journal December 2021
Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions journal January 2022
Operator learning for predicting multiscale bubble growth dynamics journal March 2021
A Comprehensive Survey on Transfer Learning journal January 2021
Predicting the output from a complex computer code when fast approximations are available journal March 2000
On a Formula for the L2 Wasserstein Metric between Measures on Euclidean and Hilbert Spaces journal January 1990