skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Soft Error Vulnerability of Iterative Linear Algebra Methods

Abstract

Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft error rates were significant primarily in space and high-atmospheric computing. Modern architectures now use features so small at sufficiently low voltages that soft errors are becoming important even at terrestrial altitudes. Due to their large number of components, supercomputers are particularly susceptible to soft errors. Since many large scale parallel scientific applications use iterative linear algebra methods, the soft error vulnerability of these methods constitutes a large fraction of the applications overall vulnerability. Many users consider these methods invulnerable to most soft errors since they converge from an imprecise solution to a precise one. However, we show in this paper that iterative methods are vulnerable to soft errors, exhibiting both silent data corruptions and poor ability to detect errors. Further, we evaluate a variety of soft error detection and tolerance techniques, including checkpointing, linear matrix encodings, and residual tracking techniques.

Authors:
;
Publication Date:
Research Org.:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
944290
Report Number(s):
LLNL-CONF-400659
TRN: US200902%%653
DOE Contract Number:  
W-7405-ENG-48
Resource Type:
Conference
Resource Relation:
Conference: Presented at: International Conference on Supercomputing, Kos, Greece, Jun 07 - Jun 12, 2008
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS; ALGEBRA; DETECTION; ITERATIVE METHODS; SUPERCOMPUTERS; TOLERANCE; VULNERABILITY

Citation Formats

Bronevetsky, G, and de Supinski, B. Soft Error Vulnerability of Iterative Linear Algebra Methods. United States: N. p., 2008. Web.
Bronevetsky, G, & de Supinski, B. Soft Error Vulnerability of Iterative Linear Algebra Methods. United States.
Bronevetsky, G, and de Supinski, B. Sat . "Soft Error Vulnerability of Iterative Linear Algebra Methods". United States. https://www.osti.gov/servlets/purl/944290.
@article{osti_944290,
title = {Soft Error Vulnerability of Iterative Linear Algebra Methods},
author = {Bronevetsky, G and de Supinski, B},
abstractNote = {Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft error rates were significant primarily in space and high-atmospheric computing. Modern architectures now use features so small at sufficiently low voltages that soft errors are becoming important even at terrestrial altitudes. Due to their large number of components, supercomputers are particularly susceptible to soft errors. Since many large scale parallel scientific applications use iterative linear algebra methods, the soft error vulnerability of these methods constitutes a large fraction of the applications overall vulnerability. Many users consider these methods invulnerable to most soft errors since they converge from an imprecise solution to a precise one. However, we show in this paper that iterative methods are vulnerable to soft errors, exhibiting both silent data corruptions and poor ability to detect errors. Further, we evaluate a variety of soft error detection and tolerance techniques, including checkpointing, linear matrix encodings, and residual tracking techniques.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2008},
month = {1}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: