Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
Fail-Stutter Fault Tolerance Remzi H. Arpaci-Dusseau and Andrea C. Arpaci-Dusseau
 

Summary: Fail-Stutter Fault Tolerance
Remzi H. Arpaci-Dusseau and Andrea C. Arpaci-Dusseau
Department of Computer Sciences, University of Wisconsin, Madison
Abstract
Traditional fault models present system designers with two ex-
tremes: the Byzantine fault model, which is general and there-
fore difficult to apply, and the fail-stop fault model, which is easier
to employ but does not accurately capture modern device behav-
ior. To address this gap, we introduce the concept of fail-stutter
fault tolerance, a realistic and yet tractable fault model that ac-
counts for both absolute failure and a new range of performance
failures common in modern components. Systems built under the
fail-stutter model will likely perform well, be highly reliable and
available, and be easier to manage when deployed.
1 Introduction
Dealing with failure in large-scale systems remains a chal-
lenging problem. In designing the systems that form the
backbone of Internet services, databases, and storage sys-
tems, one must account for the possibility or even likelihood
that one or more components will cease to operate correctly;

  

Source: Arpaci-Dusseau, Andrea - Department of Computer Sciences, University of Wisconsin at Madison
Arpaci-Dusseau, Remzi - Department of Computer Sciences, Department of Computer Sciences, University of Wisconsin at Madison

 

Collections: Computer Technologies and Information Sciences