Disk Failure Dataset from the Campaign Storage System
- ORNL-OLCF
This dataset consists of 1,389 disk (HDD) failure events collected from the Campaign storage system at LANL. The Campaign system supported various compute platforms throughout its lifespan, including Cielo, Fire, Ice, and notably, the Trinity supercomputer. Each recorded event includes its detection timestamp (in ISO 8601 format) and details such as its location within the storage systemârack, enclosure, and drive slot number. The data, spanning from May 4, 2021, to July 25, 2023 (2 years, 2 months, and 22 days), represents failure events from the terminal years of Campaignâs operational period, accounting for 26% of its total operational time.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- Office of Science (SC)
- Contributing Organization:
- Oak Ridge National Laboratory, Los Alamos National Laboratory
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 2446874
- Country of Publication:
- United States
- Language:
- English
Similar Records
Collection of Disk Failure Events from Alpine, the Parallel File System for Summit Supercomputer
Alpine disk failure dataset
Cielo-to-Trinity Storage Infrastructure
Dataset
·
Thu Sep 19 00:00:00 EDT 2024
·
OSTI ID:2441482
Alpine disk failure dataset
Dataset
·
Mon May 23 00:00:00 EDT 2022
·
OSTI ID:1868941
Cielo-to-Trinity Storage Infrastructure
Technical Report
·
Thu Jul 21 00:00:00 EDT 2016
·
OSTI ID:1291260