Scalable Information Fusion for Fault Tolerance in Large-Scale HPC.
Conference
·
OSTI ID:1142166
Abstract not provided.
- Research Organization:
- Sandia National Lab. (SNL-CA), Livermore, CA (United States); Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1142166
- Report Number(s):
- SAND2009-6080C; 507684
- Resource Relation:
- Conference: Proposed for presentation at the SIAM Conference on Parallel Processing and Scientific Computing held February 24-26, 2010 in Seattle, WA.
- Country of Publication:
- United States
- Language:
- English
Similar Records
Scalable Information Fusion for Fault Tolerance in Large-Scale HPC.
Probabilistic Approaches for Fault-Tolerance and Scalability in Extreme-Scale Computing.
Evaluation of Simple Causal Message Logging for Large-Scale Fault Tolerant HPC Systems
Conference
·
Mon Feb 01 00:00:00 EST 2010
·
OSTI ID:1142166
+5 more
Probabilistic Approaches for Fault-Tolerance and Scalability in Extreme-Scale Computing.
Conference
·
Sat Feb 01 00:00:00 EST 2014
·
OSTI ID:1142166
+4 more
Evaluation of Simple Causal Message Logging for Large-Scale Fault Tolerant HPC Systems
Conference
·
Fri Feb 25 00:00:00 EST 2011
·
OSTI ID:1142166