Addressing Failures in Exascale Computing
Journal Article
·
· International Journal of High Performance Computing Applications
more »
- Argonne National Laboratory (ANL)
- Intel Corporation
- unknown
- University of Illinois, Urbana-Champaign
- Purdue University
- Lawrence Livermore National Laboratory (LLNL)
- IBM T. J. Watson Research Center
- University of Chicago
- Los Alamos National Laboratory (LANL)
- University of Southern California
- ORNL
- University of Texas at Austin
- Booz Allen Hamilton
- Science Applications International Corporation (SAIC), Oak Ridge, TN
- Pacific Northwest National Laboratory (PNNL)
- AMD
- Stanford University
- HP Labs
- Sandia National Laboratories (SNL)
- ARM
We present here a report produced by a workshop on Addressing failures in exascale computing' held in Park City, Utah, 4-11 August 2012. The charter of this workshop was to establish a common taxonomy about resilience across all the levels in a computing system, discuss existing knowledge on resilience across the various hardware and software layers of an exascale system, and build on those results, examining potential solutions from both a hardware and software perspective and focusing on a combined approach. The workshop brought together participants with expertise in applications, system software, and hardware; they came from industry, government, and academia, and their interests ranged from theory to implementation. The combination allowed broad and comprehensive discussions and led to this document, which summarizes and builds on those discussions.
- Research Organization:
- Oak Ridge National Laboratory (ORNL)
- Sponsoring Organization:
- ORNL LDRD Director's R&D
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1128984
- Journal Information:
- International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications; ISSN 1094-3420
- Country of Publication:
- United States
- Language:
- English
Similar Records
Addressing failures in exascale computing
Outcomes from the DOE Workshop on Turbulent Flow Simulation at the Exascale
Exascale Operating Systems and Runtime Software Report
Journal Article
·
Thu May 01 00:00:00 EDT 2014
· International Journal of High Performance Computing Applications, 28(2):129-173
·
OSTI ID:1176844
Outcomes from the DOE Workshop on Turbulent Flow Simulation at the Exascale
Conference
·
Fri Jun 17 00:00:00 EDT 2016
·
OSTI ID:1296608
Exascale Operating Systems and Runtime Software Report
Technical Report
·
Thu Dec 27 23:00:00 EST 2012
·
OSTI ID:1471119