Collection of Disk Failure Events from Alpine, the Parallel File System for Summit Supercomputer
- ORNL-OLCF
This dataset contains disk (HDD) failure events collected from the Alpine storage system of the Summit supercomputer, hosted at OLCF, spanning from January 4, 2019, to December 21, 2023 (a total of 4 years, 11 months, and 18 days), covering 89% of its operational lifetime. It includes 3,766 disk failure events, each recorded with its detection timestamp (in ISO 8601 format) and detailed by its location within the storage systemârack, enclosure, and drive slot number.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- Office of Science (SC)
- Contributing Organization:
- Oak Ridge National Laboratory
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 2441482
- Country of Publication:
- United States
- Language:
- English
Similar Records
Alpine disk failure dataset
Disk Failure Dataset from the Campaign Storage System
Scaling the Summit: Deploying the World's Fastest Supercomputer
Dataset
·
Mon May 23 00:00:00 EDT 2022
·
OSTI ID:1868941
Disk Failure Dataset from the Campaign Storage System
Dataset
·
Wed Sep 25 00:00:00 EDT 2024
·
OSTI ID:2446874
Scaling the Summit: Deploying the World's Fastest Supercomputer
Conference
·
Sat Jun 01 00:00:00 EDT 2019
·
OSTI ID:1561654