DOE Data Explorer title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

Abstract

We release the dataset used in ORBIT-2, an exascale vision foundation model for weather and climate downscaling. The dataset was constructed by preprocessing raw data into paired low-resolution and high-resolution samples for model training and evaluation. Downscaling was performed using bilinear regridding following the WeatherBench2 methodology (https://weatherbench2.readthedocs.io). The dataset integrates four publicly available, open-source sources: ERA5, PRISM, DAYMET, and IMERG, obtained respectively from: • ERA5: https://weatherbench2.readthedocs.io • PRISM: https://prism.oregonstate.edu • DAYMET: https://daymet.ornl.gov • IMERG: https://gpm.nasa.gov/data/imerg Further details on the dataset construction and usage are provided in: Wang, Xiao, et al. “ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling.” arXiv preprint arXiv:2505.04802 (2025).


Citation Formats

Lu, Dan, Wang, Xiao, Choi, Jong Youl, Tsaris, Aristeidis, and Ashfaq, Moetasim. ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling. United States: N. p., 2025. Web. doi:10.13139/OLCF/2589526.
Lu, Dan, Wang, Xiao, Choi, Jong Youl, Tsaris, Aristeidis, & Ashfaq, Moetasim. ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling. United States. doi:https://doi.org/10.13139/OLCF/2589526
Lu, Dan, Wang, Xiao, Choi, Jong Youl, Tsaris, Aristeidis, and Ashfaq, Moetasim. 2025. "ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling". United States. doi:https://doi.org/10.13139/OLCF/2589526. https://www.osti.gov/servlets/purl/3002026. Pub date:Mon Sep 01 04:00:00 UTC 2025
@article{osti_3002026,
title = {ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling},
author = {Lu, Dan and Wang, Xiao and Choi, Jong Youl and Tsaris, Aristeidis and Ashfaq, Moetasim},
abstractNote = {We release the dataset used in ORBIT-2, an exascale vision foundation model for weather and climate downscaling. The dataset was constructed by preprocessing raw data into paired low-resolution and high-resolution samples for model training and evaluation. Downscaling was performed using bilinear regridding following the WeatherBench2 methodology (https://weatherbench2.readthedocs.io). The dataset integrates four publicly available, open-source sources: ERA5, PRISM, DAYMET, and IMERG, obtained respectively from: • ERA5: https://weatherbench2.readthedocs.io • PRISM: https://prism.oregonstate.edu • DAYMET: https://daymet.ornl.gov • IMERG: https://gpm.nasa.gov/data/imerg Further details on the dataset construction and usage are provided in: Wang, Xiao, et al. “ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling.” arXiv preprint arXiv:2505.04802 (2025).},
doi = {10.13139/OLCF/2589526},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Mon Sep 01 04:00:00 UTC 2025},
month = {Mon Sep 01 04:00:00 UTC 2025}
}