When and why PINNs fail to train: A neural tangent kernel perspective
Journal Article
·
· Journal of Computational Physics
- University of Pennsylvania, Philadelphia, PA (United States); OSTI
- University of Pennsylvania, Philadelphia, PA (United States)
Physics-informed neural networks (PINNs) have lately received great attention thanks to their flexibility in tackling a wide range of forward and inverse problems involving partial differential equations. However, despite their noticeable empirical success, little is known about how such constrained neural networks behave during their training via gradient descent. More importantly, even less is known about why such models sometimes fail to train at all. Here in this work, we aim to investigate these questions through the lens of the Neural Tangent Kernel (NTK); a kernel that captures the behavior of fully-connected neural networks in the infinite width limit during training via gradient descent. Specifically, we derive the NTK of PINNs and prove that, under appropriate conditions, it converges to a deterministic kernel that stays constant during training in the infinite-width limit. This allows us to analyze the training dynamics of PINNs through the lens of their limiting NTK and find a remarkable discrepancy in the convergence rate of the different loss components contributing to the total training error. To address this fundamental pathology, we propose a novel gradient descent algorithm that utilizes the eigenvalues of the NTK to adaptively calibrate the convergence rate of the total training error. Finally, we perform a series of numerical experiments to verify the correctness of our theory and the practical effectiveness of the proposed algorithms.
- Research Organization:
- Raytheon Technologies Corporation, Waltham, MA (United States); University of Pennsylvania, Philadelphia, PA (United States)
- Sponsoring Organization:
- Air Force Office of Scientific Research (AFOSR); USDOE Advanced Research Projects Agency - Energy (ARPA-E)
- Grant/Contract Number:
- AR0001201; SC0019116
- OSTI ID:
- 1977272
- Journal Information:
- Journal of Computational Physics, Journal Name: Journal of Computational Physics Journal Issue: C Vol. 449; ISSN 0021-9991
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks
Infinite neural network quantum states: entanglement and training dynamics
Feature learning and generalization in deep networks with orthogonal weights
Journal Article
·
Tue Jun 01 20:00:00 EDT 2021
· Computer Methods in Applied Mechanics and Engineering
·
OSTI ID:1976965
Infinite neural network quantum states: entanglement and training dynamics
Journal Article
·
Fri Jun 02 20:00:00 EDT 2023
· Machine Learning: Science and Technology
·
OSTI ID:2425540
Feature learning and generalization in deep networks with orthogonal weights
Journal Article
·
Wed Aug 06 20:00:00 EDT 2025
· Machine Learning: Science and Technology
·
OSTI ID:2575064