Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Collection of Disk Failure Events from Alpine, the Parallel File System for Summit Supercomputer

Dataset ·
This dataset contains disk (HDD) failure events collected from the Alpine storage system of the Summit supercomputer, hosted at OLCF, spanning from January 4, 2019, to December 21, 2023 (a total of 4 years, 11 months, and 18 days), covering 89% of its operational lifetime. It includes 3,766 disk failure events, each recorded with its detection timestamp (in ISO 8601 format) and detailed by its location within the storage system—rack, enclosure, and drive slot number.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
Office of Science (SC)
Contributing Organization:
Oak Ridge National Laboratory
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2441482
Country of Publication:
United States
Language:
English

Similar Records

Alpine disk failure dataset
Dataset · Mon May 23 00:00:00 EDT 2022 · OSTI ID:1868941

Disk Failure Dataset from the Campaign Storage System
Dataset · Wed Sep 25 00:00:00 EDT 2024 · OSTI ID:2446874

Scaling the Summit: Deploying the World's Fastest Supercomputer
Conference · Sat Jun 01 00:00:00 EDT 2019 · OSTI ID:1561654