Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach

Journal Article · · Mathematics
DOI:https://doi.org/10.3390/math12101483· OSTI ID:2477212
 [1];  [2];  [3];  [1];  [4];  [5]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States)
  2. Florida State Univ., Tallahassee, FL (United States)
  3. Argonne National Laboratory (ANL), Argonne, IL (United States); Univ. Paris-Saclay, Gif-sur-Yvette (France)
  4. Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
  5. Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Training an effective deep learning model to learn ocean processes involves careful choices of various hyperparameters. We leverage DeepHyper’s advanced search algorithms for multiobjective optimization, streamlining the development of neural networks tailored for ocean modeling. The focus is on optimizing Fourier neural operators (FNOs), a data-driven model capable of simulating complex ocean behaviors. Selecting the correct model and tuning the hyperparameters are challenging tasks, requiring much effort to ensure model accuracy. DeepHyper allows efficient exploration of hyperparameters associated with data preprocessing, FNO architecture-related hyperparameters, and various model training strategies. We aim to obtain an optimal set of hyperparameters leading to the most performant model. Moreover, on top of the commonly used mean squared error for model training, we propose adopting the negative anomaly correlation coefficient as the additional loss term to improve model performance and investigate the potential trade-off between the two terms. The numerical experiments show that the optimal set of hyperparameters enhanced model performance in single timestepping forecasting and greatly exceeded the baseline configuration in the autoregressive rollout for long-horizon forecasting up to 30 days. Utilizing DeepHyper, we demonstrate an approach to enhance the use of FNO in ocean dynamics forecasting, offering a scalable solution with improved precision.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
89233218CNA000001; AC02-06CH11357
OSTI ID:
2477212
Journal Information:
Mathematics, Journal Name: Mathematics Journal Issue: 10 Vol. 12; ISSN 2227-7390
Publisher:
MDPICopyright Statement
Country of Publication:
United States
Language:
English

References (31)

Perceptual Losses for Real-Time Style Transfer and Super-Resolution book September 2016
Practical Recommendations for Gradient-Based Training of Deep Architectures book January 2012
Extremely randomized trees journal March 2006
Using Machine Learning at scale in numerical simulations with SmartSim: An application to ocean climate modeling journal July 2022
Time series prediction of sea surface temperature based on BiLSTM model with attention mechanism journal April 2024
Efficient Global Optimization of Expensive Black-Box Functions journal January 1998
Random Forests journal January 2001
Data‐Driven Equation Discovery of Ocean Mesoscale Closures journal August 2020
Stochastic‐Deep Learning Parameterization of Ocean Momentum Forcing journal September 2021
Training machine learning models on climate model output yields skillful interpretable seasonal precipitation forecasts journal August 2021
Deep learning to represent subgrid processes in climate models journal September 2018
Deep-learning-based information mining from ocean remote-sensing imagery journal March 2020
Physics-informed deep-learning parameterization of ocean vertical mixing improves climate simulations journal March 2022
DeepOcean: A General Deep Learning Framework for Spatio-Temporal Ocean Sensing Data Prediction journal January 2020
Padding Module: Learning the Padding in Deep Neural Networks journal January 2023
DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks conference December 2018
Deep Learning and the Oceans journal May 2022
Exascale Deep Learning for Climate Analytics conference November 2018
Loss Functions for Image Restoration With Neural Networks journal March 2017
Rain Regime Segmentation of Sentinel-1 Observation Learning From NEXRAD Collocations With Convolution Neural Networks journal January 2024
Training a Task-Specific Image Reconstruction Loss conference January 2022
Asynchronous Decentralized Bayesian Optimization for Large Scale Hyperparameter Optimization conference October 2023
Improved estimates of ocean heat content from 1960 to 2015 journal March 2017
Learning skillful medium-range global weather forecasting journal December 2023
Spectral Neural Operators journal December 2023
An Empirical Study of the Impact of Hyperparameter Tuning and Model Optimization on the Performance Properties of Deep Neural Networks journal April 2022
Physics-Informed Neural Operator for Learning Partial Differential Equations journal May 2024
Skill Scores and Correlation Coefficients in Model Verification journal March 1989
Diagnosing Isopycnal Diffusivity in an Eddying, Idealized Midlatitude Ocean Basin via Lagrangian, in Situ, Global, High-Performance Particle Tracking (LIGHT) journal August 2015
Learning to solve the elastic wave equation with Fourier neural operators journal April 2023
Improving Ocean Forecasting Using Deep Learning and Numerical Model Integration journal March 2022

Similar Records

Multiobjective Hyperparameter Optimization for Deep Learning Interatomic Potential Training Using NSGA-II
Conference · Tue Aug 01 00:00:00 EDT 2023 · OSTI ID:1996670

DeepHyper: A Python Package for Massively Parallel Hyperparameter Optimization in Machine Learning
Journal Article · Sun May 18 20:00:00 EDT 2025 · Journal of Open Source Software · OSTI ID:3002304

DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks
Conference · Sun Dec 31 23:00:00 EST 2017 · OSTI ID:1772592