Failure detection in high-performance clusters and computers using chaotic map computations
A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 10.sup.18 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-00OR22725
- Assignee:
- UT-Battelle, LLC (Oak Ridge, TN)
- Patent Number(s):
- 9,122,603
- Application Number:
- 13/919,601
- OSTI ID:
- 1213445
- Resource Relation:
- Patent File Date: 2013 Jun 17
- Country of Publication:
- United States
- Language:
- English
Similar Records
Fault Diagnosis of Hybrid Computing Systems Using Chaotic-Map Method
Computer Science Research Needs for Parallel Discrete Event Simulation (PDES)