Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Defining and measuring supercomputer Reliability, Availability, and Serviceability (RAS).

Conference ·
OSTI ID:948682

The absence of agreed definitions and metrics for supercomputer RAS obscures meaningful discussion of the issues involved and hinders their solution. This paper seeks to foster a common basis for communication about supercomputer RAS, by proposing a system state model, definitions, and measurements. These are modeled after the SEMI-E10 specification which is widely used in the semiconductor manufacturing industry.

Research Organization:
Sandia National Laboratories
Sponsoring Organization:
USDOE
DOE Contract Number:
AC04-94AL85000
OSTI ID:
948682
Report Number(s):
SAND2005-1574C
Country of Publication:
United States
Language:
English