Disk Failure Dataset from the Campaign Storage System
Abstract
This dataset consists of 1,389 disk (HDD) failure events collected from the Campaign storage system at LANL. The Campaign system supported various compute platforms throughout its lifespan, including Cielo, Fire, Ice, and notably, the Trinity supercomputer. Each recorded event includes its detection timestamp (in ISO 8601 format) and details such as its location within the storage systemârack, enclosure, and drive slot number. The data, spanning from May 4, 2021, to July 25, 2023 (2 years, 2 months, and 22 days), represents failure events from the terminal years of Campaignâs operational period, accounting for 26% of its total operational time.
- Authors:
-
- ORNL-OLCF
- Publication Date:
- DOE Contract Number:
- AC05-00OR22725
- Research Org.:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- Office of Science (SC)
- Collaborations:
- Oak Ridge National Laboratory, Los Alamos National Laboratory
- Subject:
- 97 MATHEMATICS AND COMPUTING; Disk failures, Parallel File System, MarFS, Campaign, Trinity, HPC storage, Supercomputer
- OSTI Identifier:
- 2446874
- DOI:
- https://doi.org/10.13139/OLCF/2446874
Citation Formats
Ransom, Garret Wilson, and George, Anjus. Disk Failure Dataset from the Campaign Storage System. United States: N. p., 2024.
Web. doi:10.13139/OLCF/2446874.
Ransom, Garret Wilson, & George, Anjus. Disk Failure Dataset from the Campaign Storage System. United States. doi:https://doi.org/10.13139/OLCF/2446874
Ransom, Garret Wilson, and George, Anjus. 2024.
"Disk Failure Dataset from the Campaign Storage System". United States. doi:https://doi.org/10.13139/OLCF/2446874. https://www.osti.gov/servlets/purl/2446874. Pub date:Wed Sep 25 04:00:00 UTC 2024
@article{osti_2446874,
title = {Disk Failure Dataset from the Campaign Storage System},
author = {Ransom, Garret Wilson and George, Anjus},
abstractNote = {This dataset consists of 1,389 disk (HDD) failure events collected from the Campaign storage system at LANL. The Campaign system supported various compute platforms throughout its lifespan, including Cielo, Fire, Ice, and notably, the Trinity supercomputer. Each recorded event includes its detection timestamp (in ISO 8601 format) and details such as its location within the storage systemârack, enclosure, and drive slot number. The data, spanning from May 4, 2021, to July 25, 2023 (2 years, 2 months, and 22 days), represents failure events from the terminal years of Campaignâs operational period, accounting for 26% of its total operational time.},
doi = {10.13139/OLCF/2446874},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Wed Sep 25 04:00:00 UTC 2024},
month = {Wed Sep 25 04:00:00 UTC 2024}
}
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
