Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Disk Failure Dataset from the Campaign Storage System

Dataset ·

This dataset consists of 1,389 disk (HDD) failure events collected from the Campaign storage system at LANL. The Campaign system supported various compute platforms throughout its lifespan, including Cielo, Fire, Ice, and notably, the Trinity supercomputer. Each recorded event includes its detection timestamp (in ISO 8601 format) and details such as its location within the storage system—rack, enclosure, and drive slot number. The data, spanning from May 4, 2021, to July 25, 2023 (2 years, 2 months, and 22 days), represents failure events from the terminal years of Campaign’s operational period, accounting for 26% of its total operational time.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
Office of Science (SC)
Contributing Organization:
Oak Ridge National Laboratory, Los Alamos National Laboratory
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2446874
Country of Publication:
United States
Language:
English

Similar Records

Collection of Disk Failure Events from Alpine, the Parallel File System for Summit Supercomputer
Dataset · Thu Sep 19 00:00:00 EDT 2024 · OSTI ID:2441482

Alpine disk failure dataset
Dataset · Mon May 23 00:00:00 EDT 2022 · OSTI ID:1868941

Cielo-to-Trinity Storage Infrastructure
Technical Report · Thu Jul 21 00:00:00 EDT 2016 · OSTI ID:1291260