DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: System and methods for hardware-software cooperative pipeline error detection

Abstract

An error reporting system utilizes a parity checker to receive data results from execution of an original instruction and a parity bit for the data. A decoder receives an error correcting code (ECC) for data resulting from execution of a shadow instruction of the original instruction, and data error correction is initiated on the original instruction result on condition of a mismatch between the parity bit and the original instruction result, and the decoder asserting a correctable error in the original instruction result.

Inventors:
; ; ; ;
Issue Date:
Research Org.:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1924970
Patent Number(s):
11409597
Application Number:
16/811,499
Assignee:
Nvidia Corp. (Santa Clara, CA)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B620719
Resource Type:
Patent
Resource Relation:
Patent File Date: 03/06/2020
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Sullivan, Michael, Hari, Siva, Zimmer, Brian, Tsai, Timothy, and Keckler, Stephen W. System and methods for hardware-software cooperative pipeline error detection. United States: N. p., 2022. Web.
Sullivan, Michael, Hari, Siva, Zimmer, Brian, Tsai, Timothy, & Keckler, Stephen W. System and methods for hardware-software cooperative pipeline error detection. United States.
Sullivan, Michael, Hari, Siva, Zimmer, Brian, Tsai, Timothy, and Keckler, Stephen W. Tue . "System and methods for hardware-software cooperative pipeline error detection". United States. https://www.osti.gov/servlets/purl/1924970.
@article{osti_1924970,
title = {System and methods for hardware-software cooperative pipeline error detection},
author = {Sullivan, Michael and Hari, Siva and Zimmer, Brian and Tsai, Timothy and Keckler, Stephen W.},
abstractNote = {An error reporting system utilizes a parity checker to receive data results from execution of an original instruction and a parity bit for the data. A decoder receives an error correcting code (ECC) for data resulting from execution of a shadow instruction of the original instruction, and data error correction is initiated on the original instruction result on condition of a mismatch between the parity bit and the original instruction result, and the decoder asserting a correctable error in the original instruction result.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2022},
month = {8}
}

Works referenced in this record:

Error Corrections Coding Over Multiple Memory Pages
patent-application, October 2013


Integrated circuit device and method for reducing SRAM leakage
patent-application, February 2017


Arithmetic Error Codes: Cost and Effectiveness Studies for Application in Digital System Design
journal, November 1971


Method and apparatus for providing error correction within a register file of a CPU
patent, June 2006


Processor register error correction management
patent, December 2016


Tolerating soft errors by selective duplication
patent-application, December 2011


Method and apparatus for providing error correction within a register file of a CPU
patent-application, November 2003


System and apparatus for error-correcting register files
patent, October 2012


System and method for testing a logic-based processing device
patent-application, December 2015


System and apparatus for error-correcting register files
patent-application, February 2011


Semiconductor Memory Devices and Memory Systems Including the Same
patent-application, April 2017


Compiler-Managed Software-based Redundant Multi-Threading for Transient Fault Detection
conference, March 2007


Error detection by duplicated instructions in super-scalar processors
journal, March 2002


Error correction for flash memory
patent-application, May 2010


Error correction method with instruction level rollback
patent, January 2012


Fault Free Store Data Path for Software Implementation of Redundant Multithreading Environments
patent-application, July 2006


Real-world design and evaluation of compiler-managed GPU redundant multithreading
conference, June 2014


SwapCodes: Error Codes for Hardware-Software Cooperative GPU Pipeline Error Detection
conference, October 2018


Error detection method and system for processors that employ alternating threads
patent-application, June 2005


Understanding software approaches for GPGPU reliability
conference, March 2009


Instruction and logic for support of code modification
patent-application, October 2015