Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Failure detection in high-performance clusters and computers using chaotic map computations

Patent ·
OSTI ID:1213445
A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 10.sup.18 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
Assignee:
UT-Battelle, LLC (Oak Ridge, TN)
Patent Number(s):
9,122,603
Application Number:
13/919,601
OSTI ID:
1213445
Country of Publication:
United States
Language:
English

References (13)

On Dynamics of Transport Protocols Over Wide-Area Internet Connections book January 2005
Super-Scalable Algorithms for Computing on 100,000 Processors book December 2004
A Probabilistic Theory of Pattern Recognition book April 1996
Chaos: An Introduction to Dynamical Systems journal November 1997
Quasiperiodic Route to Chaotic Dynamics of Internet Transport Protocols journal May 2005
Computational complexity issues in operative diagnosis of graph-based systems journal April 1993
On polynomial-time testable combinational circuits journal January 1994
Chaos: A tutorial for engineers journal January 1987
Basic concepts and taxonomy of dependable and secure computing journal January 2004
Designing programs that check their work journal January 1995
Fail-stop processors: an approach to designing fault-tolerant computing systems journal August 1983
Toward Exascale Resilience journal September 2009
The International Exascale Software Project roadmap journal January 2011

Similar Records

Fault Diagnosis of Hybrid Computing Systems Using Chaotic-Map Method
Book · Thu Nov 01 00:00:00 EDT 2018 · OSTI ID:1561635

Fault Diagnosis of Hybrid Computing Systems Using Chaotic-Map Method
Book · Thu Nov 01 00:00:00 EDT 2018 · OSTI ID:1649633

Heterogeneous graphics processing unit for scheduling thread groups for execution on variable width SIMD units
Patent · Tue Jul 14 00:00:00 EDT 2020 · OSTI ID:1735025

Related Subjects