DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Fault isolation through no-overhead link level CRC

Abstract

A fault isolation technique for checking the accuracy of data packets transmitted between nodes of a parallel processor. An independent crc is kept of all data sent from one processor to another, and received from one processor to another. At the end of each checkpoint, the crcs are compared. If they do not match, there was an error. The crcs may be cleared and restarted at each checkpoint. In the preferred embodiment, the basic functionality is to calculate a CRC of all packet data that has been successfully transmitted across a given link. This CRC is done on both ends of the link, thereby allowing an independent check on all data believed to have been correctly transmitted. Preferably, all links have this CRC coverage, and the CRC used in this link level check is different from that used in the packet transfer protocol. This independent check, if successfully passed, virtually eliminates the possibility that any data errors were missed during the previous transfer period.

Inventors:
 [1];  [2];  [3]
  1. Croton On Hudson, NY
  2. Yorktown Heights, NY
  3. Mount Kisco, NY
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
950118
Patent Number(s):
7210088
Application Number:
10/468,996
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
F - MECHANICAL ENGINEERING F04 - POSITIVE - DISPLACEMENT MACHINES FOR LIQUIDS F04D - NON-POSITIVE-DISPLACEMENT PUMPS
F - MECHANICAL ENGINEERING F24 - HEATING F24F - AIR-CONDITIONING
DOE Contract Number:  
B517552
Resource Type:
Patent
Country of Publication:
United States
Language:
English

Citation Formats

Chen, Dong, Coteus, Paul W, and Gara, Alan G. Fault isolation through no-overhead link level CRC. United States: N. p., 2007. Web.
Chen, Dong, Coteus, Paul W, & Gara, Alan G. Fault isolation through no-overhead link level CRC. United States.
Chen, Dong, Coteus, Paul W, and Gara, Alan G. Tue . "Fault isolation through no-overhead link level CRC". United States. https://www.osti.gov/servlets/purl/950118.
@article{osti_950118,
title = {Fault isolation through no-overhead link level CRC},
author = {Chen, Dong and Coteus, Paul W and Gara, Alan G},
abstractNote = {A fault isolation technique for checking the accuracy of data packets transmitted between nodes of a parallel processor. An independent crc is kept of all data sent from one processor to another, and received from one processor to another. At the end of each checkpoint, the crcs are compared. If they do not match, there was an error. The crcs may be cleared and restarted at each checkpoint. In the preferred embodiment, the basic functionality is to calculate a CRC of all packet data that has been successfully transmitted across a given link. This CRC is done on both ends of the link, thereby allowing an independent check on all data believed to have been correctly transmitted. Preferably, all links have this CRC coverage, and the CRC used in this link level check is different from that used in the packet transfer protocol. This independent check, if successfully passed, virtually eliminates the possibility that any data errors were missed during the previous transfer period.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2007},
month = {4}
}