skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions

Journal Article · · Neurocomputing
 [1];  [1];  [2];  [1]
  1. Brown University, Providence, RI (United States)
  2. Harvard University, Cambridge, MA (United States)

Here we propose a new type of neural networks, Kronecker neural networks (KNNs), that form a general framework for neural networks with adaptive activation functions. KNNs employ the Kronecker product, which provides an efficient way of constructing a very wide network while keeping the number of parameters low. Our theoretical analysis reveals that under suitable conditions, KNNs induce a faster decay of the loss than that by the feed-forward networks. This is also empirically verified through a set of computational examples. Furthermore, under certain technical assumptions, we establish global convergence of gradient descent for KNNs. As a specific case, we propose the Rowdy activation function that is designed to get rid of any saturation region by injecting sinusoidal fluctuations, which include trainable parameters. The proposed Rowdy activation function can be employed in any neural network architecture like feed-forward neural networks, Recurrent neural networks, Convolutional neural networks etc. The effectiveness of KNNs with Rowdy activation is demonstrated through various computational experiments including function approximation using feed-forward neural networks, solution inference of partial differential equations using the physics-informed neural networks, and standard deep learning benchmark problems using convolutional and fully-connected neural networks.

Research Organization:
Brown Univ., Providence, RI (United States)
Sponsoring Organization:
USDOE Office of Science (SC); US Air Force Office of Scientific Research (AFOSR)
Grant/Contract Number:
SC0019453; FA9550-20–1-0358
OSTI ID:
1977480
Alternate ID(s):
OSTI ID: 1827465
Journal Information:
Neurocomputing, Vol. 468, Issue C; ISSN 0925-2312
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (13)

Adaptive activation functions accelerate convergence in deep and physics-informed neural networks journal March 2020
Deep learning journal May 2015
Physics-informed neural networks for high-speed flows journal March 2020
Physics-Informed Neural Network for Ultrasound Nondestructive Quantification of Surface Breaking Cracks journal August 2020
Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems journal June 2020
hp-VPINNs: Variational physics-informed neural networks with domain decomposition journal February 2021
Gradient-based learning applied to document recognition journal January 1998
Extended Physics-Informed Neural Networks (XPINNs): A Generalized Space-Time Domain Decomposition Based Deep Learning Framework for Nonlinear Partial Differential Equations journal June 2020
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations journal February 2019
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators journal March 2021
Locally adaptive activation functions with slope recovery for deep and physics-informed neural networks
  • Jagtap, Ameya D.; Kawaguchi, Kenji; Em Karniadakis, George
  • Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 476, Issue 2239 https://doi.org/10.1098/rspa.2020.0334
journal July 2020
The Neural Network Zoo journal January 2020
Flow over an espresso cup: inferring 3-D velocity and pressure fields from tomographic background oriented Schlieren via physics-informed neural networks journal March 2021

Similar Records

Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems
Journal Article · Fri Mar 18 00:00:00 EDT 2022 · Computer Methods in Applied Mechanics and Engineering · OSTI ID:1977480

Scalable algorithms for physics-informed neural and graph networks
Journal Article · Wed Jun 29 00:00:00 EDT 2022 · Data-Centric Engineering · OSTI ID:1977480

Multi-fidelity Bayesian neural networks: Algorithms and applications
Journal Article · Fri Apr 16 00:00:00 EDT 2021 · Journal of Computational Physics · OSTI ID:1977480