PRESAGE: Protecting Structured Address Generation against Soft Errors
Abstract
Modern computer scaling trends in pursuit of larger component counts and power efficiency have, unfortunately, lead to less reliable hardware and consequently soft errors escaping into application data ("silent data corruptions"). Techniques to enhance system resilience hinge on the availability of efficient error detectors that have high detection rates, low false positive rates, and lower computational overhead. Unfortunately, efficient detectors to detect faults during address generation (to index large arrays) have not been widely researched. We present a novel lightweight compiler-driven technique called PRESAGE for detecting bit-flips affecting structured address computations. A key insight underlying PRESAGE is that any address computation scheme that flows an already incurred error is better than a scheme that corrupts one particular array access but otherwise (falsely) appears to compute perfectly. Enabling the flow of errors allows one to situate detectors at loop exit points, and helps turn silent corruptions into easily detectable error situations. Our experiments using PolyBench benchmark suite indicate that PRESAGE-based error detectors have a high error-detection rate while incurring low overheads.
- Authors:
- Publication Date:
- Research Org.:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1440680
- Report Number(s):
- PNNL-SA-121138
KJ0402010
- DOE Contract Number:
- AC05-76RL01830
- Resource Type:
- Conference
- Resource Relation:
- Conference: IEEE 23rd International Conference on High Performance Computing (HiPC), December 19-22, 2016, Hyderabad, India, 252-261
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Sharma, Vishal C., Gopalakrishnan, Ganesh, and Krishnamoorthy, Sriram. PRESAGE: Protecting Structured Address Generation against Soft Errors. United States: N. p., 2017.
Web. doi:10.1109/HiPC.2016.037.
Sharma, Vishal C., Gopalakrishnan, Ganesh, & Krishnamoorthy, Sriram. PRESAGE: Protecting Structured Address Generation against Soft Errors. United States. https://doi.org/10.1109/HiPC.2016.037
Sharma, Vishal C., Gopalakrishnan, Ganesh, and Krishnamoorthy, Sriram. Thu .
"PRESAGE: Protecting Structured Address Generation against Soft Errors". United States. https://doi.org/10.1109/HiPC.2016.037.
@article{osti_1440680,
title = {PRESAGE: Protecting Structured Address Generation against Soft Errors},
author = {Sharma, Vishal C. and Gopalakrishnan, Ganesh and Krishnamoorthy, Sriram},
abstractNote = {Modern computer scaling trends in pursuit of larger component counts and power efficiency have, unfortunately, lead to less reliable hardware and consequently soft errors escaping into application data ("silent data corruptions"). Techniques to enhance system resilience hinge on the availability of efficient error detectors that have high detection rates, low false positive rates, and lower computational overhead. Unfortunately, efficient detectors to detect faults during address generation (to index large arrays) have not been widely researched. We present a novel lightweight compiler-driven technique called PRESAGE for detecting bit-flips affecting structured address computations. A key insight underlying PRESAGE is that any address computation scheme that flows an already incurred error is better than a scheme that corrupts one particular array access but otherwise (falsely) appears to compute perfectly. Enabling the flow of errors allows one to situate detectors at loop exit points, and helps turn silent corruptions into easily detectable error situations. Our experiments using PolyBench benchmark suite indicate that PRESAGE-based error detectors have a high error-detection rate while incurring low overheads.},
doi = {10.1109/HiPC.2016.037},
url = {https://www.osti.gov/biblio/1440680},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2017},
month = {2}
}