Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
mSWAT: Low-Cost Hardware Fault Detection and Diagnosis for Multicore Systems
 

Summary: mSWAT: Low-Cost Hardware Fault Detection and
Diagnosis for Multicore Systems

Siva Kumar Sastry Hari, Man-Lap Li, Pradeep Ramachandran, Byn Choi, Sarita V. Adve
Department of Computer Science
University of Illinois at Urbana-Champaign
swat@cs.uiuc.edu
ABSTRACT
Continued technology scaling is resulting in systems with billions
of devices. Unfortunately, these devices are prone to failures from
various sources, resulting in even commodity systems being af-
fected by the growing reliability threat. Thus, traditional solutions
involving high redundancy or piecemeal solutions targeting specific
failure modes will no longer be viable owing to their high over-
heads. Recent reliability solutions have explored using low-cost
monitors that watch for anomalous software behavior as a symptom
of hardware faults. We previously proposed the SWAT system that
uses such low-cost detectors to detect hardware faults, and a higher
cost mechanism for diagnosis. However, all of the prior work in this
context, including SWAT, assumes single-threaded applications and

  

Source: Adve, Sarita - Department of Computer Science, University of Illinois at Urbana-Champaign

 

Collections: Computer Technologies and Information Sciences