skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Failure detection in high-performance clusters and computers using chaotic map computations

Patent ·
OSTI ID:1213445

A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 10.sup.18 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
Assignee:
UT-Battelle, LLC (Oak Ridge, TN)
Patent Number(s):
9,122,603
Application Number:
13/919,601
OSTI ID:
1213445
Resource Relation:
Patent File Date: 2013 Jun 17
Country of Publication:
United States
Language:
English

References (13)

Test and measurement system for detecting and monitoring faults and losses in passive optical networks (PONs) patent May 2002
Integrated control and diagnostics system patent November 2007
Chaos: An Introduction to Dynamical Systems journal November 1997
Basic concepts and taxonomy of dependable and secure computing journal January 2004
Designing programs that check their work journal January 1995
Toward Exascale Resilience journal September 2009
The International Exascale Software Project roadmap journal January 2011
Quasiperiodic Route to Chaotic Dynamics of Internet Transport Protocols journal May 2005
Chaos: A tutorial for engineers journal January 1987
Computational complexity issues in operative diagnosis of graph-based systems journal April 1993
On Dynamics of Transport Protocols Over Wide-Area Internet Connections book January 2005
On polynomial-time testable combinational circuits journal January 1994
Fail-stop processors: an approach to designing fault-tolerant computing systems journal August 1983