Fault tolerance for VLSI multicomputers

Tamir, Y

Fault tolerance for VLSI multicomputers

Thesis/Dissertation · Tue Jan 01 04:00:00 EST 1985

OSTI ID:5127488

Tamir, Y

The performance requirements of future high-end computers will only be met by systems that facilitate the exploitation of the parallelism inherent in the algorithms that they execute. One such system is a multicomputer that consists of hundreds or thousands of VLSI computation nodes interconnected by dedicated links. Some important applications of high-end computers, such as weather forecasting, require continuous correct operation for many hours. This requirement can only be met if the system is fault-tolerant, i.e., can continue to operate correctly despite the failure of some of its components. This dissertation investigates the use of fault tolerance techniques to increase the reliability of VLSI multicomputers. Different techniques are evaluated in the context of the entire system, its implementation technology, and intended applications. A proposed fault tolerance scheme combines hardware that performs error detection and system-level protocols for error recovery and fault treatment. Practical design and implementation tradeoffs are discussed. A fault-tolerant system must identify erroneous information produced by faulty hardware. It is shown that a high probability of error detection can be achieved with self-checking nodes implemented using duplication and comparison.

🛈

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Research Organization:: California Univ., Berkeley (USA)

OSTI ID:: 5127488

Country of Publication:: United States

Language:: English

Similar Records

Self-checking VLSI building blocks for fault-tolerant multicomputers

Conference · Fri Dec 31 23:00:00 EST 1982 · OSTI ID:5198147

Fault tolerant VLSI multicomputers

Book · Wed Dec 31 23:00:00 EST 1986 · OSTI ID:5384601

Fault tolerance in multistage interconnection network-based multicomputer systems

Thesis/Dissertation · Wed Dec 31 23:00:00 EST 1986 · OSTI ID:5705671

Related Subjects

99 GENERAL AND MISCELLANEOUS
990200* -- Mathematics & Computers
ALGORITHMS
COMPUTER NETWORKS
COMPUTERS
DETECTION
DIGITAL COMPUTERS
ELECTRONIC CIRCUITS
ERRORS
FAULT TOLERANT COMPUTERS
INTEGRATED CIRCUITS
MATHEMATICAL LOGIC
MICROELECTRONIC CIRCUITS
PARALLEL PROCESSING
PROGRAMMING

Fault tolerance for VLSI multicomputers

Citation Formats

Similar Records

Related Subjects