Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A comprehensive and fair comparison of two neural operators (with practical extensions) based on $$\mathrm{FAIR}$$ data

Journal Article · · Computer Methods in Applied Mechanics and Engineering
 [1];  [2];  [2];  [3];  [2];  [4];  [2]
  1. University of Pennsylvania, Philadelphia, PA (United States); OSTI
  2. Brown University, Providence, RI (United States)
  3. Xiamen University Malaysia, Sepeng (Malaysia)
  4. Worcester Polytechnic Institute, MA (United States)

Neural operators can learn nonlinear mappings between function spaces and offer a new simulation paradigm for real-time prediction of complex dynamics for realistic diverse applications as well as for system identification in science and engineering. Herein, we investigate the performance of two neural operators, which have shown promising results so far, and we develop new practical extensions that will make them more accurate and robust and importantly more suitable for industrial-complexity applications. The first neural operator, DeepONet, was published in 2019 (Lu et al., 2019), and its original architecture was based on the universal approximation theorem of Chen & Chen (1995). The second one, named Fourier Neural Operator or FNO, was published in 2020, and it is based on parameterizing the integral kernel in the Fourier space. DeepONet is represented by a summation of products of neural networks (NNs), corresponding to the branch NN for the input function and the trunk NN for the output function; both NNs are general architectures, e.g., the branch NN can be replaced with a CNN or a ResNet. According to Kovachki et al. (2021), FNO in its continuous form can be viewed conceptually as a DeepONet with a specific architecture of the branch NN and a trunk NN represented by a trigonometric basis. In order to compare FNO with DeepONet computationally for realistic setups, we develop several extensions of FNO that can deal with complex geometric domains as well as mappings where the input and output function spaces are of different dimensions. We also develop an extended DeepONet with special features that provide inductive bias and accelerate training, and we present a faster implementation of DeepONet with cost comparable to the computational cost of FNO, which is based on the Fast Fourier Transform. Here we consider 16 different benchmarks to demonstrate the relative performance of the two neural operators, including instability wave analysis in hypersonic boundary layers, prediction of the vorticity field of a flapping airfoil, porous media simulations in complex-geometry domains, etc. We follow the guiding principles of FAIR (Findability, Accessibility, Interoperability, and Reusability) for scientific data management and stewardship. The performance of DeepONet and FNO is comparable for relatively simple settings, but for complex geometries the performance of FNO deteriorates greatly. We also compare theoretically the two neural operators and obtain similar error estimates for DeepONet and FNO under the same regularity assumptions.

Research Organization:
Brown University, Providence, RI (United States)
Sponsoring Organization:
USDOE Office of Science (SC); Defense Advanced Research Projects Agency (DARPA); Air Force Office of Scientific Research (AFOSR)
Grant/Contract Number:
SC0019453
OSTI ID:
1976975
Journal Information:
Computer Methods in Applied Mechanics and Engineering, Journal Name: Computer Methods in Applied Mechanics and Engineering Journal Issue: C Vol. 393; ISSN 0045-7825
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (36)

Approximation by superpositions of a sigmoidal function journal December 1989
Multilayer feedforward networks are universal approximators journal January 1989
PPINN: Parareal physics-informed neural network for time-dependent PDEs journal October 2020
A physics-informed operator regression framework for extracting data-driven continuum models journal January 2021
Data-driven learning of nonlocal physics from high-fidelity synthetic data journal February 2021
Bayesian neural networks for uncertainty quantification in data-driven materials modeling journal December 2021
A physics-informed variational DeepONet for predicting crack path in quasi-brittle materials journal March 2022
High-order well-balanced schemes and applications to non-equilibrium flow journal October 2009
Positivity-preserving high order discontinuous Galerkin schemes for compressible Euler equations with source terms journal February 2011
Quantifying total uncertainty in physics-informed neural networks for solving forward and inverse stochastic problems journal November 2019
A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse PDE problems journal January 2020
B-PINNs: Bayesian physics-informed neural networks for forward and inverse PDE problems with noisy data journal January 2021
A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks journal June 2021
DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks journal July 2021
Multi-fidelity Bayesian neural networks: Algorithms and applications journal August 2021
DeepM&Mnet for hypersonics: Predicting the coupled flow and finite-rate chemistry behind a normal shock using neural-network approximation of operators journal December 2021
Learning functional priors and posteriors from data and physics journal May 2022
A seamless multiscale operator neural network for inferring bubble dynamics journal October 2021
Physics-informed machine learning journal May 2021
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators journal March 2021
Generalizing universal function approximators journal March 2021
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Operator learning for predicting multiscale bubble growth dynamics journal March 2021
The unreasonable effectiveness of deep learning in artificial intelligence journal January 2020
Extraction of mechanical properties of materials through deep learning from instrumented indentation journal March 2020
Non-equilibrium extrapolation method for velocity and pressure boundary conditions in the lattice Boltzmann method journal March 2002
Multiple-relaxation-time lattice Boltzmann model for incompressible miscible flow with large viscosity ratio and high Péclet number journal October 2015
Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems journal July 1995
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification conference December 2015
fPINNs: Fractional Physics-Informed Neural Networks journal January 2019
Learning in Modal Space: Solving Time-Dependent Stochastic PDEs Using Physics-Informed Neural Networks journal January 2020
DeepXDE: A Deep Learning Library for Solving Differential Equations journal January 2021
Physics-Informed Neural Networks with Hard Constraints for Inverse Design journal January 2021
Systems biology informed deep learning for inferring parameters and hidden dynamics journal November 2020
Extended Physics-Informed Neural Networks (XPINNs): A Generalized Space-Time Domain Decomposition Based Deep Learning Framework for Nonlinear Partial Differential Equations journal June 2020
Dying ReLU and Initialization: Theory and Numerical Examples journal June 2020

Similar Records

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators
Journal Article · Thu Mar 18 00:00:00 EDT 2021 · Nature Machine Intelligence · OSTI ID:2281727

Approximation rates of DeepONets for learning operators arising from advection–diffusion equations
Journal Article · Sat Jun 25 00:00:00 EDT 2022 · Neural Networks · OSTI ID:1977482

DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks
Journal Article · Mon Mar 22 00:00:00 EDT 2021 · Journal of Computational Physics · OSTI ID:2282980