Distributed recovery in fault-tolerant multiprocessor networks

Yanney, R M; Hayes, J P

doi:10.1109/TC.1986.1676678

Distributed recovery in fault-tolerant multiprocessor networks

Journal Article · Wed Oct 01 04:00:00 EDT 1986 · IEEE Trans. Comput.; (United States)

DOI:https://doi.org/10.1109/TC.1986.1676678· OSTI ID:5255272

Yanney, R M; Hayes, J P

A methodology for characterizing dynamic distributed recovery in fault-tolerant multiprocessor systems is developed using graph theory. Distributed recovery, which is intended for systems with non central supervisor, depends on the cooperation of a set of processors to execute the recovery function, since each processor is assumed to have only a limited amount of information about the system as a whole. Facility graphs, whose nodes denote the system components (processors), and whose edges denote interconnection between components, are used to represent multiprocessor systems, and error conditions. A general distributed recovery strategy R, which allows global recovery to be achieved via a sequence of local actions, is given. R recovers the system in several steps in which different nodes successfully act as the local supervisor. R is specialized for two important classes of systems: loop networks and tree networks. For each of these cases, fault-tolerant designs and their associated distributed recovery strategies, which allow recovery from up to k faults within a specified number of steps, are presented.

Research Organization:: TRW, Redondo Beach, CA 90278

OSTI ID:: 5255272

Journal Information:: IEEE Trans. Comput.; (United States), Journal Name: IEEE Trans. Comput.; (United States) Vol. C-35:10; ISSN ITCOB

Country of Publication:: United States

Language:: English

Similar Records

Designing and reconfiguring fault-tolerant multiprocessor systems

Thesis/Dissertation · Sun Dec 31 23:00:00 EST 1989 · OSTI ID:7046530

On fault-tolerant mechanisms in distributed systems

Thesis/Dissertation · Thu Dec 31 23:00:00 EST 1987 · OSTI ID:6309833

Fault-tolerant interconnection networks for multiprocessor systems

Thesis/Dissertation · Sat Dec 31 23:00:00 EST 1988 · OSTI ID:6089434

Related Subjects

99 GENERAL AND MISCELLANEOUS
990200* -- Mathematics & Computers
ARRAY PROCESSORS
COMPUTER GRAPHICS
COMPUTERS
DATA PROCESSING
DIGITAL COMPUTERS
DISTRIBUTED DATA PROCESSING
EQUIPMENT INTERFACES
ERRORS
FAULT TOLERANT COMPUTERS
GRAPHS
PROCESSING
RELIABILITY

Distributed recovery in fault-tolerant multiprocessor networks

Citation Formats

Similar Records

Related Subjects