Quantifying failure prediction in large scale HPC systems: A case study.
Conference
·
OSTI ID:1141919
Abstract not provided.
- Research Organization:
- Sandia National Lab. (SNL-CA), Livermore, CA (United States); Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1141919
- Report Number(s):
- SAND2009-6079C; 507683
- Resource Relation:
- Conference: Proposed for presentation at the 24th IEEE International Parallel and Distributed Processing Symposium held April 19-23, 2010 in Atlanta, GA.
- Country of Publication:
- United States
- Language:
- English
Similar Records
Quantifying Failure Prediction in Large Scale HPC Systems: A Case Study.
Quantifying effectiveness of failure prediction and response in HPC systems : methodology and example.
Rigorous Approaches to Quantifying Cell Failure to Enable Large-Scale Failure Modeling - Materials Mechanics and Electrochemistry.
Conference
·
Tue Sep 01 00:00:00 EDT 2009
·
OSTI ID:1141919
+6 more
Quantifying effectiveness of failure prediction and response in HPC systems : methodology and example.
Conference
·
Tue Jun 01 00:00:00 EDT 2010
·
OSTI ID:1141919
+6 more
Rigorous Approaches to Quantifying Cell Failure to Enable Large-Scale Failure Modeling - Materials Mechanics and Electrochemistry.
Conference
·
Tue Oct 01 00:00:00 EDT 2019
·
OSTI ID:1141919