Fault tolerance via specialization: An efficient approach for tolerating general failures
- Michigan State Univ., East Lansing, MI (United States)
We consider a cooperative system consisting of a set of servers that provide some service to a set of clients. The service is implemented by accessing some shared objects. The goal is to provide reliable service in spite of client or server failures such that the overhead during normal operating periods is low. We consider a relatively general fault model in which a faulty processor can write spurious data for a period of time before it is detected and removed from the system. A specialization technique is used to achieve efficient fault tolerance. Due to the specialization of servers, each server is prevented from arbitrarily damaging the system. The system is designed to tolerate the general (but not malicious) failure of a processor at any step in the computation process.
- OSTI ID:
- 501665
- Report Number(s):
- CONF-961239--
- Country of Publication:
- United States
- Language:
- English
Similar Records
Generalized measures of fault tolerance with application to N-cube networks
Highly fault-tolerant parallel computation