Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Software-Controlled Fault Tolerance George A. Reis1

Summary: Software-Controlled Fault Tolerance
George A. Reis1
Jonathan Chang1
Neil Vachharajani1
Ram Rangan1
David I. August1
Shubhendu S. Mukherjee2
Traditional fault tolerance techniques typically utilize resources ineffectively because they cannot
adapt to the changing reliability and performance demands of a system. This paper proposes
software-controlled fault tolerance, a concept allowing designers and users to tailor their perfor-
mance and reliability for each situation. Several software-controllable fault detection techniques
are then presented: SWIFT, a software-only technique, and CRAFT, a suite of hybrid hardware/
software techniques. Finally, the paper introduces PROFiT, a technique which adjusts the level
of protection and performance at fine granularities through software control. When coupled with
software-controllable techniques like SWIFT and CRAFT, PROFiT offers attractive and novel
reliability options.
Categories and Subject Descriptors: C.4.2 [Performance of Systems]: Fault tolerance
General Terms: Reliability
Additional Key Words and Phrases: software-controlled fault tolerance, fault detection, reliability


Source: August, David - Department of Computer Science, Princeton University


Collections: Computer Technologies and Information Sciences