DOE Data Explorer title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

Abstract

This dataset release corresponds to the work conducted in ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling, where large-scale AI methods were applied to improve climate and weather resolution. The collection integrates four widely used, publicly available datasets: ERA5, PRISM, DAYMET, and IMERG. To prepare the data for ORBIT-2 model training and evaluation, we applied a preprocessing pipeline that generates paired low-resolution and high-resolution samples, enabling supervised downscaling experiments. The transformation from coarse to fine scales was performed using bilinear regridding, consistent with the procedures described in WeatherBench2, a community benchmark for weather and climate AI models. This dataset supports the development and evaluation of foundation models designed for weather and climate downscaling at exascale. Additional details on methodology and applications can be found in Wang et al., ORBIT-2 (arXiv:2505.04802, 2025).

Authors:
; ; ; ;
  1. Oak Ridge National Laboratory
Publication Date:
DOE Contract Number:  
AC05-00OR22725
Research Org.:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
Office of Science (SC)
Subject:
54 ENVIRONMENTAL SCIENCES; 58 GEOSCIENCES; 97 MATHEMATICS AND COMPUTING
OSTI Identifier:
2589526
DOI:
https://doi.org/10.13139/OLCF/2589526

Citation Formats

Lu, Dan, Wang, Xiao, Tsaris, Aristeidis, Choi, Jong Youl, and Ashfaq, Moetasim. ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling. United States: N. p., 2025. Web. doi:10.13139/OLCF/2589526.
Lu, Dan, Wang, Xiao, Tsaris, Aristeidis, Choi, Jong Youl, & Ashfaq, Moetasim. ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling. United States. doi:https://doi.org/10.13139/OLCF/2589526
Lu, Dan, Wang, Xiao, Tsaris, Aristeidis, Choi, Jong Youl, and Ashfaq, Moetasim. 2025. "ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling". United States. doi:https://doi.org/10.13139/OLCF/2589526. https://www.osti.gov/servlets/purl/2589526. Pub date:Fri Oct 10 04:00:00 UTC 2025
@article{osti_2589526,
title = {ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling},
author = {Lu, Dan and Wang, Xiao and Tsaris, Aristeidis and Choi, Jong Youl and Ashfaq, Moetasim},
abstractNote = {This dataset release corresponds to the work conducted in ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling, where large-scale AI methods were applied to improve climate and weather resolution. The collection integrates four widely used, publicly available datasets: ERA5, PRISM, DAYMET, and IMERG. To prepare the data for ORBIT-2 model training and evaluation, we applied a preprocessing pipeline that generates paired low-resolution and high-resolution samples, enabling supervised downscaling experiments. The transformation from coarse to fine scales was performed using bilinear regridding, consistent with the procedures described in WeatherBench2, a community benchmark for weather and climate AI models. This dataset supports the development and evaluation of foundation models designed for weather and climate downscaling at exascale. Additional details on methodology and applications can be found in Wang et al., ORBIT-2 (arXiv:2505.04802, 2025).},
doi = {10.13139/OLCF/2589526},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Fri Oct 10 04:00:00 UTC 2025},
month = {Fri Oct 10 04:00:00 UTC 2025}
}