Failure detection in high-performance clusters and computers using chaotic map computations
Abstract
A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 10.sup.18 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.
- Inventors:
- Issue Date:
- Research Org.:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1213445
- Patent Number(s):
- 9122603
- Application Number:
- 13/919,601
- Assignee:
- UT-Battelle, LLC (Oak Ridge, TN)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC05-00OR22725
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2013 Jun 17
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Rao, Nageswara S. Failure detection in high-performance clusters and computers using chaotic map computations. United States: N. p., 2015.
Web.
Rao, Nageswara S. Failure detection in high-performance clusters and computers using chaotic map computations. United States.
Rao, Nageswara S. Tue .
"Failure detection in high-performance clusters and computers using chaotic map computations". United States. https://www.osti.gov/servlets/purl/1213445.
@article{osti_1213445,
title = {Failure detection in high-performance clusters and computers using chaotic map computations},
author = {Rao, Nageswara S.},
abstractNote = {A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 10.sup.18 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {9}
}
Works referenced in this record:
Test and measurement system for detecting and monitoring faults and losses in passive optical networks (PONs)
patent, May 2002
- Holland, William R.
- US Patent Document 6,396,575
Integrated control and diagnostics system
patent, November 2007
- Discenzo, Frederick M.
- US Patent Document 7,301,296
Chaos: An Introduction to Dynamical Systems
journal, November 1997
- Alligood, Kathleen T.; Sauer, Tim D.; Yorke, James A.
- Physics Today, Vol. 50, Issue 11
Basic concepts and taxonomy of dependable and secure computing
journal, January 2004
- Avizienis, A.; Laprie, J. -C.; Randell, B.
- IEEE Transactions on Dependable and Secure Computing, Vol. 1, Issue 1
Designing programs that check their work
journal, January 1995
- Blum, Manuel; Kannan, Sampath
- Journal of the ACM, Vol. 42, Issue 1
Toward Exascale Resilience
journal, September 2009
- Cappello, Franck; Geist, Al; Gropp, Bill
- The International Journal of High Performance Computing Applications, Vol. 23, Issue 4
The International Exascale Software Project roadmap
journal, January 2011
- Dongarra, Jack; Beckman, Pete; Moore, Terry
- The International Journal of High Performance Computing Applications, Vol. 25, Issue 1
Quasiperiodic Route to Chaotic Dynamics of Internet Transport Protocols
journal, May 2005
- Gao, Jian-Bo; Rao, Nageswara S. V.; Hu, Jing
- Physical Review Letters, Vol. 94, Issue 19
Chaos: A tutorial for engineers
journal, January 1987
- Parker, T. S.; Chua, L. O.
- Proceedings of the IEEE, Vol. 75, Issue 8
Computational complexity issues in operative diagnosis of graph-based systems
journal, April 1993
- Rao, N. S. V.
- IEEE Transactions on Computers, Vol. 42, Issue 4
On Dynamics of Transport Protocols Over Wide-Area Internet Connections
book, January 2005
- Rao, Nageswara S. V.; Gao, Jianbo; Chua, Leon O.
- Complex Dynamics in Communication Networks
On polynomial-time testable combinational circuits
journal, January 1994
- Rao, N. S. V.; Toida, S.
- IEEE Transactions on Computers, Vol. 43, Issue 11
Fail-stop processors: an approach to designing fault-tolerant computing systems
journal, August 1983
- Schlichting, Richard D.; Schneider, Fred B.
- ACM Transactions on Computer Systems, Vol. 1, Issue 3