Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Fault tolerance via specialization: An efficient approach for tolerating general failures

Conference ·
OSTI ID:501665
 [1]
  1. Michigan State Univ., East Lansing, MI (United States)

We consider a cooperative system consisting of a set of servers that provide some service to a set of clients. The service is implemented by accessing some shared objects. The goal is to provide reliable service in spite of client or server failures such that the overhead during normal operating periods is low. We consider a relatively general fault model in which a faulty processor can write spurious data for a period of time before it is detected and removed from the system. A specialization technique is used to achieve efficient fault tolerance. Due to the specialization of servers, each server is prevented from arbitrarily damaging the system. The system is designed to tolerate the general (but not malicious) failure of a processor at any step in the computation process.

OSTI ID:
501665
Report Number(s):
CONF-961239--
Country of Publication:
United States
Language:
English

Similar Records

On fault-tolerant structure, distributed fault-diagnosis, reconfiguration, and recovery of the array processors
Journal Article · Sat Jul 01 00:00:00 EDT 1989 · IEEE Trans. Comput.; (United States) · OSTI ID:5849821

Generalized measures of fault tolerance with application to N-cube networks
Journal Article · Tue Oct 31 23:00:00 EST 1989 · IEEE (Institute of Electrical and Electronics Engineers) Transactions on Computers; (USA) · OSTI ID:5242881

Highly fault-tolerant parallel computation
Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:457647