Temporal Subsampling Diminishes Small Spatial Scales in Recurrent Neural Network Emulators of Geophysical Turbulence
- University of Colorado, Boulder, CO (United States); National Oceanic and Atmospheric Administration (NOAA), Boulder, CO (United States)
- University of Colorado, Boulder, CO (United States); Sofar Ocean, San Francisco, CA (United States)
- University of California San Diego, La Jolla, CA (United States)
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
The immense computational cost of traditional numerical weather and climate models has sparked the development of machine learning (ML) based emulators. Because ML methods benefit from long records of training data, it is common to use data sets that are temporally subsampled relative to the time steps required for the numerical integration of differential equations. Here, we investigate how this often overlooked processing step affects the quality of an emulator's predictions. We implement two ML architectures from a class of methods called reservoir computing: (a) a form of Nonlinear Vector Autoregression (NVAR), and (b) an Echo State Network (ESN). Despite their simplicity, it is well documented that these architectures excel at predicting low dimensional chaotic dynamics. We are therefore motivated to test these architectures in an idealized setting of predicting high dimensional geophysical turbulence as represented by Surface Quasi-Geostrophic dynamics. In all cases, subsampling the training data consistently leads to an increased bias at small spatial scales that resembles numerical diffusion. Interestingly, the NVAR architecture becomes unstable when the temporal resolution is increased, indicating that the polynomial based interactions are insufficient at capturing the detailed nonlinearities of the turbulent flow. The ESN architecture is found to be more robust, suggesting a benefit to the more expensive but more general structure. Spectral errors are reduced by including a penalty on the kinetic energy density spectrum during training, although the subsampling related errors persist. Future work is warranted to understand how the temporal resolution of training data affects other ML architectures.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE; National Oceanic and Atmospheric Administration (NOAA); US Department of the Navy, Office of Naval Research (ONR)
- Grant/Contract Number:
- AC05-76RL01830; NA20OAR4600277; N00014-19-1-2522; N00014-20-1-2580
- OSTI ID:
- 2278630
- Report Number(s):
- PNNL-SA-183691
- Journal Information:
- Journal of Advances in Modeling Earth Systems, Vol. 15, Issue 12; ISSN 1942-2466
- Publisher:
- American Geophysical Union (AGU)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
ClimSim: A large multi-scale dataset for hybrid physics-ML climate emulation
Sampling strategies for subsampled segmented EPI PRF thermometry in MR guided high intensity focused ultrasound